Define Big Data. Enlist its importance over traditional database system.
[3 marks]List out 4 V’s and Explain in detail.
[4 marks]List out the applications of Big data and explain them in details.
[7 marks]Implement Join operation using Hive.
[3 marks]List out key elements of Ingress and Egress and explain them in detail.
[4 marks]Define HDFS. Discuss the HDFS architecture and HDFS commands in brief.
[7 marks]Write a short note on NoSQL databases. List the differences between NoSQL and relational databases.
[7 marks]What is Zookeeper? List the benefits of it.
[3 marks]Explain Aggregation and Indexing for MongoDB.
[4 marks]What are the components of Spark? Also state the features of Spark.
[7 marks]Explain in detail: Input and Output of MapReduce.
[3 marks]Write a short note on Pig.
[4 marks]Explain CRUD operations in MongoDB.
[7 marks]Explain any three HiveQL DDL command with its syntax and example.
[3 marks]What is HBase? List out and explain the basic concepts of HBase in detail.
[4 marks]Explain Replication and scaling feature of MongoDB.
[7 marks]Explain the following commands of HDFS:
[3 marks]copyFromLocal ii) setrep iii) checksum
[ marks]As per your point of view, How Big data analytics can be useful in development of smart cities?
[4 marks]Explain working of Hive with proper steps and diagram.
[7 marks]Explain Metastore in Hive.
[3 marks]Differentiate: Apache pig Vs Map Reduce.
[4 marks]What are the problems related to Map Reduce data storage? How Apache Spark solves it using Resilient Distributed Dataset? Explain RDDs in detail.
[7 marks]Write a MapReduce code for Word Count.
[3 marks]"Moving Computation is Cheaper than Moving Data", Justify the sentence.
[4 marks]Explain Map-Reduce framework in detail. Draw the architectural diagram for Physical Organization of Computer Nodes.
[7 marks]