i) Define Terms: DataNode, NameNode, Cluster, Big Table ii) Differentiate between File Systems and distributed file system.
[3 marks]Say TRUE or FALSE with justification.
[7 marks]MapReduce works on the principle of divide-and-conquer. ii) MongoDB scales horizontally using Replication. iii) MapReduce cannot be used for batch processing of data and aggregation operations. iv) There is no similarity between SQL and NoSQL databses.
[ marks]Hadoop can be used as an ETL engine. vi) Afundamental principle related to Big Data processing is the CAP principle. vii) Big data characteristic Value refers to the quality or fidelity of data.
[ marks]i) Differentiate between RDBMS and NoSQL Databases. ii) Why Document databases are schema-less in form?
[3 marks]Justify statement: Developments in ICT have accelerated the pace of Big Data adoption in businesses.
[7 marks]State the stages of Big Data analytics lifecycle.
[7 marks]What is Big Data? List and explain characteristics of big data with example
[7 marks]Describe the Speed, Consistency and Volume (SCV). why only two out of the three SCV properties are simultaneously supportable.
[7 marks]How Sharding and Replication can be combined benefit fault tolerance, availability and scalability?
[7 marks]Describe the Consistency, Availability, and Partition tolerance (CAP) theorem. why only two out of the three CAP theorem properties are simultaneously supportable.
[7 marks]Discuss A/B Testing, Correlation and Regression using suitable example.
[7 marks]What is MapReduce? Write a Mapper and Reducer function for word count.
[7 marks]Discuss Classification, Clustering and Filtering Machine learning techniques.
[7 marks]What is the objective of MapReduce? Explain the key considerations when developing a MapReduce algorithm.
[7 marks]Explain how to access data from MongoDB, Hbase and Redis databases.
[7 marks]Explain mongoimport, mongoexport and mongodump utility.
[7 marks]Explain the concepts “task parallelism” and “data parallelism” using suitable example. Which of these concepts is adopted by MapReduce? How?1
[7 marks]MongoDB : Create collection Election with fields : SeatID, SeatName, PartyName, CandidateName, Gender,Age 1. List all documents containing Patel in their CandidateName. 2. Take backup of collection. 3. Count no. of Party wise candidates.. 4. Create index on PartyName. 5. Arrange documents in ascending order of names. 6. Display 4th document to 5th document. 7. Remove documents whose Age is more than 65
[7 marks]