i) Explain Terms: IoE, KPI, cluster, Data Node ii) What is replication? State its types.
[3 marks]i) Explain 4 V’s of Big Data. ii) Explain in brief Structured, Semi-structured and unstructured Data.
[3 marks]i) Differentiate between traditional BI and Big Data BI. ii) Differentiate between RDBMS and Hadoop.
[3 marks]Discuss visual analysis with its importance.
[7 marks]State various categories of NoSQL databases and explain each of them in brief.
[7 marks]List and Explain Big Data analytics lifecycle.
[7 marks]Explain CAP and BASE Theorem in detail.
[7 marks]Explain SCV principal in detail.
[7 marks]Explain the term “sharding” along with an appropriate example.
[7 marks]Explain why MapReduce is not suitable for real-time data processing. How can it be enabled to work in a near-real-time scenario?
[7 marks]Compare parallel data processing and distributed data processing.
[7 marks]What is MapReduce? Explain using an appropriate example.
[7 marks]Explain the concepts “task parallelism” and “data parallelism” along with appropriate example.
[7 marks]Differentiate between “quantitative analysis” and “qualitative analysis”.
[7 marks]Discuss any 7 commands of Redis using suitable example.
[7 marks]Explain in brief Semantic analysis and A/B Testing.
[7 marks]MongoDB : Create collection students with fields : studID, studName, studDept, studGrade, studHobby,Salary 1. List students details 2. List student whose name ends with ‘P’. 3. Count no. of students department wise. 4. Create index on studID. 5. Arrange documents in ascending order of student Department. 6. Display 4th document to 5th document. 7. Remove documents whose hobby is “singing”.
[7 marks]