How Big Data Analytics can be useful in the development of smart cities.
[3 marks]What is Big data? Discuss it in terms of volume and velocity.
[4 marks]What are the advantages of Hadoop? Explain Hadoop Architecture and its Components with proper diagram.
[7 marks]Explain Storage mechanism in HBase.
[3 marks]In Map Reduce how Job Scheduling is done in case of the Fair Scheduler?
[4 marks]Draw and explain HDFS Architecture. Explain the function of NameNode and DataNode. What is a Secondary Namenode? Is it a substitute to the Namenode?
[7 marks]Explain following commands of HDFS with syntax and at least one example of each.
[7 marks]get (ii) cp (iii) chown
[ marks]Explain Metastore in Hive.
[3 marks]Explain Map-reduce framework in brief.
[4 marks]Explain the concept of Blocks and Heartbeat Message in HDFS Architecture. What are the benefits of block transfer?
[7 marks]What is Zookeeper? List the benefits of it.
[3 marks]Explain the 5 P’s of Data science in brief.
[4 marks]Explain Spark components in detail. Also list the features of spark.
[7 marks]Differentiate NoSQL and relational databases?
[3 marks]What do you mean by HiveQL Data Definition Language?
[4 marks]Explain following for MongoDB.
[7 marks]Indexing (ii) Aggregation
[ marks]Explain any three HiveQL DDL command with its syntax and example.
[3 marks]Explain scaling feature of MongoDB.
[4 marks]What are the problems related to Map Reduce data storage? How Apache Spark solves it using Resilient Distributed Dataset? Explain RDDs in detail.
[7 marks]Compare Raw oriented and Column Oriented database structures.
[3 marks]Discuss how Pig data model will help in effective data flow.
[4 marks]Explain how HBase uses Zookeeper to Build Applications with Zookeeper. OR1
[7 marks]What is Spark?
[3 marks]Differentiate: Apache pig Vs Map Reduce.
[4 marks]Explain CRUD operations in MongoDB.
[7 marks]