Q1. Write a short note on predictive analytics.

This is question 1 (4 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q2. Write a Map-Reduce code for Word Count.

This is question 2 (3 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q2. Explain the steps to set up the Hadoop cluster.

This is question 2 (4 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q2. Draw and explain Map-Reduce framework in detail.

This is question 2 (7 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q2. Draw and discuss HDFS architecture in detail.

This is question 2 (7 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Big Data Analytics | B.E. Semester VI Summer 2025 | GTU Papers

Q: Q1. State the difference between traditional data and big data.

This is question 1 (3 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q: Q1. Explain the “V’s” of Big Data in detail with relevant examples.

This is question 1 (7 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q: Q3. Compare and contrast NoSQL and relational databases.

This is question 3 (3 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q: Q3. Write CQL queries for the following in Cassandra: 1. Create a keyspace named company 2. Create a table employee with columns: emp_id (PRIMARY KEY), name, dept, salary.

This is question 3 (4 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

Q: Q3. Discuss the architecture and features of Cassandra. How does it manage data distribution and fault tolerance?

This is question 3 (7 marks) from the GTU Big Data Analytics BE Summer 2025 question paper. Subject code: 3164604.

State the difference between traditional data and big data.

[3 marks]

Write a short note on predictive analytics.

[4 marks]

Explain the “V’s” of Big Data in detail with relevant examples.

[7 marks]

Write a Map-Reduce code for Word Count.

[3 marks]

Explain the steps to set up the Hadoop cluster.

[4 marks]

Draw and explain Map-Reduce framework in detail.

[7 marks]

Draw and discuss HDFS architecture in detail.

[7 marks]

Compare and contrast NoSQL and relational databases.

[3 marks]

Write CQL queries for the following in Cassandra: 1. Create a keyspace named company 2. Create a table employee with columns: emp_id (PRIMARY KEY), name, dept, salary.

[4 marks]

Discuss the architecture and features of Cassandra. How does it manage data distribution and fault tolerance?

[7 marks]

Differentiate between master-slave and peer-to-peer distribution models.

[3 marks]

Write CQL queries for the following in Cassandra: 1. Create a keyspace named university 2. Create a table students with columns: student_id (PRIMARY KEY), name, course, marks

[4 marks]

Describe the four ways in which NoSQL systems handle big data problems. Illustrate your answer with suitable examples.

[7 marks]

Differentiate between traditional batch processing and stream processing.

[3 marks]

Explain the concept of lazy evaluation in Spark with an example.

[4 marks]

Describe Flajolet-Martin algorithm with suitable example.

[7 marks]

Enlist the challenges in mining data streams.

[3 marks]

Explain the Spark execution workflow from job submission to task execution.

[4 marks]

Explain the concept of counting ones in a window using DGIM algorithm. Illustrate with a bit stream example.1

[7 marks]

Compare Apache pig with Map Reduce.

[3 marks]

Explain the architecture of ZooKeeper.

[4 marks]

Explain working of Hive with necessary steps and diagram.

[7 marks]

Explain Metastore in Hive.

[3 marks]

Explain the data processing operators in PIG.

[4 marks]

Discuss the concepts of regions in HBase and storing Big Data with HBase.

[7 marks]

Big Data Analytics — Summer 2025

Questions