Answer in Brief: 1. How many bytes does Volume of big data generally refer to? 2. Give 3C’s Characteristics of big data. 3. Give any two characteristics of NOSQL databases. 4. Give full form of HDFS, YARN, JSON, HQL 5. What is the advantage of partitioning in Hive? 6. Give any two applications of Apache Spark. 7. In which form MongoDB document is stored?
[7 marks]Attempt the following a. Discuss any four components of Hadoop Ecosystem. b. Mention any three key differences between Pig and Hive.
[7 marks]Give sources of structured data, semi-structured data and unstructured data.
[7 marks]Write a short note on: Key aspects/characteristics of Hadoop
[7 marks]Discuss HDFS concept in detail.
[7 marks]What is MongoDB? How MongoDB differs from RDBMS? Explain.
[7 marks]Discuss Insert, Update and Delete operation in MongoDB giving examples.
[7 marks]Why there is a need of MongoDB? Discuss basic concept of Sharding and Replication in MongoDB.
[7 marks]Discuss the following concepts in MongoDB with example 1. Count ,Limit & Sort 2. Array Concept
[7 marks]Discuss MAPPER and REDUCER concept for Hadoop cluster.
[7 marks]Write the corresponding HiveQL Statements to achieve the following tasks in HIVE: 1. Display List of Databases 2. Create Database MYSTUDENTS 3. Describe Database MYSTUDENTS 4. Create a Managed Table EMPLOYEES 5. Create EXTERNAL table MYEMPLOYEES 6. Load Data into above table MYEMPLOYEES from file 7. Retrieve all records from MYEMPLOYEES table.
[7 marks]Write a note on Map-Reduce concept in MongoDB.
[7 marks]Explain the following OPERATORS in PIG with example: 1. FILTER 2. FOREACH
[7 marks]What is Pig? Give Execution modes of Pig. Discuss when to use and when not to use Pig.
[7 marks]Discuss aggregation commands of MongoDB giving example.
[7 marks]Write a short note on: Partitioning and Bucketing in Hive.
[7 marks]Discuss Spark architecture and explain briefly the concept of RDD creation, RDD transformation and RDD operation with example.
[7 marks]