List various use cases of Big Data. Also enlist its importance over traditional database system.
[3 marks]Define: Big Data, Heartbeat, Volume, and Velocity
[4 marks]Draw different components of Hadoop with diagram. Explain each in details.
[7 marks]What is HDFS? List out different HDFS Commands.
[3 marks]Explain the working of MapReduce in short.
[4 marks]Explain Structured, Semi structured and Unstructured data with suitable example in brief.
[7 marks]Differentiate (1) Apache Pig Vs Map Reduce (2) Apache Spark Vs Map Reduce
[7 marks]List down any 3 Hadoop’s configuration files. Also explain in short.
[3 marks]How Hadoop 1.0 differs from Hadoop 2.0?
[4 marks]Write and Explain a ‘WordCount’ Java MapReduce program for having input Text: Rose red pink yellow white and peach Marigold orange or yellow Jasmine white Lotus pink and white Sunflower yellow Mogra pink purple yellow and white
[7 marks]What is full form YARN? Explain it in short.
[3 marks]Explain sharding process of MongoDB.
[4 marks]What is full form of RDD? Explain RDD operations in brief.
[7 marks]Explain Pig Data Model in short.
[3 marks]What is metastore in Hive. Explain it in short.
[4 marks]Justify the statement: Spark is faster than MapReduce
[7 marks]What is SQL?
[3 marks]What is NOSQL? How it differs from SQL?
[4 marks]Explain ‘WordCount’ Scala program for Apache Spark.
[7 marks]What is Zookeeper? Explain it in short.
[3 marks]Write a MongoDB query to demonstrate collection and document.
[4 marks]Draw and discuss Hive architecture.
[7 marks]Explain different components of HBase architecture in short.
[3 marks]What is the importance of HBase?
[4 marks]Explain MLib in detail.
[7 marks]