Q1. 1) Define the following terms:

This is question 1 (3 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q1. Big Data b) Machine Learning c) Heartbeat

This is question 1 (null marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q1. Explain 4Vs property of Big Data.

This is question 1 (4 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q1. a) Explain five P’s of Big Data in brief.

This is question 1 (3 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q2. Explain HDFS operations in detail.

This is question 2 (3 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q2. a) Define Schema.

This is question 2 (null marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Q2. Explain the importance of Big Data.

This is question 2 (4 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Big Data Analytics | B.E. Semester VII Winter 2022 | GTU Papers

Q: Q2. Define Distributed file system. Enlist and explain features of DFS.

This is question 2 (7 marks) from the GTU BIG DATA ANALYTICS (Department Elective-II) BE Winter 2022 question paper. Subject code: 2171607.

Questions29

1) Define the following terms:

[3 marks]

Big Data b) Machine Learning c) Heartbeat

[ marks]

Explain 4Vs property of Big Data.

[4 marks]

a) Explain five P’s of Big Data in brief.

[3 marks]

Explain job scheduling of fair scheduler in Map Reduce.

[4 marks]

Explain HDFS operations in detail.

[3 marks]

Enlist various applications of Big Data. How it can be used in weather forecasting.

[4 marks]

Define Distributed file system. Enlist and explain features of DFS.

[7 marks]

a) Define Schema.

[ marks]

Explain the importance of Big Data.

[4 marks]

Make a note on “how type of data affects data serialization.”

[2 marks]

Explain storage mechanism in HBase.

[3 marks]

Differentiate: Apache pig Vs Map Reduce.

[4 marks]

Explain Hadoop components with diagram.

[7 marks]

Define Zookeeper. Enlist and discuss the benefits of it.

[3 marks]

Explain SPARK unified stack.

[4 marks]

Justify “Spark is faster than MapReduce”.

[7 marks]

Define: Term Frequency and Inverse Document Frequency.

[3 marks]

List out the features of HIVE. Explain the architecture of HIVE.

[4 marks]

What is NoSQL? List out the features of NoSQL. Explain types of NoSQL databases in brief.

[7 marks]

Explain sharding process of MongoDB.

[3 marks]

Explain job scheduling of capacity scheduler in Map Reduce.

[4 marks]

Differentiate SQL and NoSQL. Enlist the industry applications of NoSQL.

[7 marks]

Discuss Machine Learning with MLlib in SPARK.

[3 marks]

Explain scaling feature of MongoDB.

[4 marks]

Explain following for MongoDB. 1) Indexing 2) Aggregation1

[7 marks]

Explain following in brief with respect to Mongo DB : 1) Collections and documents 2) Indexing and retrieval

[3 marks]

Explain metastore in Hive.

[4 marks]

Explain CRUD operations with suitable example in MongoDB.

[7 marks]

BIG DATA ANALYTICS (Department Elective-II) — Winter 2022

Questions29

Related Papers