Define Big Data. Explain with suitable example.
[3 marks]Explain 4 V’s properties of Big Data.
[4 marks]Enlist the advantages of Hadoop. Draw and explain Hadoop Architecture with components.
[7 marks]Explain unstructured, semi-structured and structured data with one example of each.
[3 marks]What are the benefits of Big Data? Discuss challenges under Big Data.
[4 marks]Define HDFS. Discuss the HDFS Architecture and HDFS commands in brief.
[7 marks]Write Map Reduce code for counting occurrences of words in the input text file.
[7 marks]Explain in detail: Input and Output of MapReduce.
[3 marks]Explain CRUD operations in MongoDB with syntax.
[4 marks]Define RDD. Discuss transformations and actions in RDDs. State and explain RDD operations in brief.
[7 marks]Explain the following phases of MapReduce.
[3 marks]Map Phase ii) Shuffle and Sort Phase iii) Reduce Phase
[ marks]Implement Join operation using Hive.
[4 marks]Explain Pig Data Model in details. Discuss its effectiveness for data flow.
[7 marks]Define Zookeeper. List and explain its benefits.
[3 marks]Explain the terms:
[4 marks]Metastore in Hive ii) setrep iii) Fair Scheduler iv) YARN
[ marks]Discuss the concepts of regions in HBase and storing Big Data with HBase.
[7 marks]"Moving Computation is Cheaper than Moving Data" - Justify the sentence.
[3 marks]Differentiate: Apache pig Vs Map Reduce.
[4 marks]Draw and discuss the Architecture of Hive.
[7 marks]List out key elements of Ingress and Egress and explain them in detail.
[3 marks]Write a short note on NoSQL databases.
[4 marks]Discuss Big Data in Healthcare, Agriculture and Manufacturing.
[7 marks]Explain the terms:
[3 marks]Replication Factor ii) Data Serialization iii) Heartbeat Message
[ marks]List the differences between NoSQL and relational databases.
[4 marks]Discuss Big Data in Transportation, Education, Smart Cities.
[7 marks]