1.Define Big Data and discuss the Four Vs associated with it. 2.What is Big Data Analytics? Outline the Big Data Analytics Life Cycle
[7 marks]What is MapReduce? Explain working of various phases of MapReduce with word count example
[7 marks]Difference Between 1. NoSQL vs. SQL Vs. NewSQL 2. Hadoop Vs. RDBMS
[7 marks]What is Apache Hadoop? Explain how its ecosystem operates.
[7 marks]Explain the concept of Hive and illustrate its architecture using a figure.
[7 marks]What is Apache Pig? Discuss its importance and uses.
[7 marks]Define a distributed file system. Explain the architecture of the Hadoop Distributed File System
[7 marks]Explain HBase and describe the data storage mechanism in HBase with an example.
[7 marks]What do you mean by ZooKeeper? Explain its role in monitoring a cluster.
[7 marks]What do you understand by Spark? Explain the concept and working of RDD.
[7 marks]List and explain the different categories of NoSQL databases in detail
[7 marks]Explain HiveQL and discuss the detailed steps involved in querying data
[7 marks]Describe the key principles of schema design in detail.
[7 marks]What are the main features of MongoDB? Write a short explanation.
[7 marks]Discuss CRUD operations in MongoDB along with suitable examples for every operation
[7 marks]Page 1 of
[2 marks]Describe the use of the aggregate function in MongoDB with an appropriate example
[7 marks]Define NoSQL and discuss its advantages Page 2 of
[2 marks]