Define : 1. Unstructured data 2. Sharding 3. Replication 4. Cluster 5. CAP theorem 6. Master – slave architecture 7. MongoDB
[7 marks]Give Syntax: 1. aggregate in MongoDB. 2. Creating index in MongoDB. 3. Arranging data in ascending order in Pig. 4. Limit and skip in MongoDB. 5. Grouping the data in Pig. 6. Creating view in Hive. 7. Copy a local file into HDFS.
[7 marks]What is big data? List and explain characteristics of big data in detail.
[7 marks]Write Pig Script for word count.
[7 marks]Write Hive query for word count.
[7 marks]Give differences : 1. SQL, NoSQL and NewSQL 2. Pig and Hive
[3 marks]Explain Map – Reduce process with proper example.
[7 marks]Explain daemons of HDFS and Map Reduce.
[7 marks]Explain mongoimport and mongexport in detail with syntax and e.g.
[7 marks]Explain Hive Architecture in detail.
[7 marks]What is NoSQL? Explain types of NoSQL databases in detail.
[7 marks]Explain Hadoop eco system in detail.
[7 marks]What is Pig? Explain Pig Anatomy and Pig Philosophy in detail.
[7 marks]What is spark? Give difference between Hadoop and Spark. Also explain RDD in detail.
[7 marks]Explain static and dynamic partition in Hive.
[7 marks]Write short note on MLlib, Tensor Flow and Theone.
[7 marks]Explain PiggyBank in detail with e.g.
[7 marks]