Answer the following: 1. List the applications of big data. 2. Define NewSQL. 3. Define RDD.
[3 marks]Discuss the key drivers of big data.
[4 marks]Discuss hadoop ecosystem in brief.
[7 marks]Differentiate following: 1. SQL vs NoSQL 2. Pig vs Hive
[7 marks]What is distributed file system? Discuss architecture of Hadoop Distributed File System.
[7 marks]Discuss the classification of big data.
[7 marks]What is MapReduce? Explain Mapper and Reducer phases in brief.
[7 marks]Discuss hive architecture in detail.
[7 marks]Define HQL. Discuss DDL and DML statements with reference to HQL.
[7 marks]Write and discuss the process of Extract, Transform and Load in PIG.
[7 marks]Write short note on HBase.
[7 marks]What is apache spark? List and explain apache spark components.
[7 marks]List and explain types of NoSQL databases.
[7 marks]What is zookeeper? Explain characteristics of zookeeper.
[7 marks]Discuss key features of MongoDB.
[7 marks]What is NOSQL? List the advantages and uses of NoSQL.
[7 marks]Explain following terms with reference to MongoDB. 1. Document 2. Collection 3. Object Id
[7 marks]Explain aggregate function in MongoDB with suitable example.
[7 marks]