21CS71 Big Data Analytics (BDA) Model Question Paper-1 with answers
Module-1
1.A] What is Big Data? Explain the evolution of big data and its characteristics. – 10 Marks
OR
2.A] What is Cloud Computing? Explain different services of Cloud. – 10 Marks
2.B] Explain any two big data applications. – 5 Marks
2.C] How does Berkeley Data Analytics Stack help in analytics tasks? – 5 Marks
Module-2
3.A] What is Hadoop? Explain the Hadoop ecosystem with a neat diagram. – 8 Marks
3.B] Explain with a neat diagram the components of HDFS. – 8 Marks
3.C] Write a short note on Apache Hive. – 4 Marks
OR
4.A] Explain Apache Sqoop Import and Export methods. – 8 Marks
4.B] Explain Apache Oozie with a neat diagram. – 7 Marks
4.C] Explain the YARN application framework. – 5 Marks
Module-3
5.A] What is NoSQL? Explain the CAP Theorem. – 10 Marks
5.B] Explain NoSQL data architecture patterns. – 10 Marks
OR
6.A] Explain the shared-nothing architecture for big data tasks. – 10 Marks
6.B] Explain MongoDB database. – 10 Marks
Module-4
7.A] Explain MapReduce execution steps with a neat diagram. – 10 Marks
7.B] What is Hive? Explain Hive architecture. – 10 Marks
OR
8.A] Explain Pig architecture for scripts data flow and processing. – 10 Marks
8.B] Explain key-value pairing in MapReduce. – 10 Marks
Module-5
9.A] What is machine learning? Explain different types of regression analysis. – 10 Marks
9.B] Explain with a neat diagram the K-means clustering algorithm. – 5 Marks
9.C] Explain Naïve Bayes Theorem with an example. – 5 Marks
OR
10.A] Explain the five phases in a process pipeline for text mining. – 10 Marks
10.B] Explain web usage mining. – 10 Marks