Define: Data Mart, Enterprise Warehouse & Virtual Warehouse
[3 marks]Adata warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data – Justify.
[4 marks]What are different sources of information? Explain the term data, information and knowledge with suitable example.
[7 marks]Differentiate Fact table vs. Dimension table.
[3 marks]Define noise. Explain binning methods for data smoothing.
[4 marks]Explain different OLAP operation with example.
[7 marks]What is Data Mining? Explain Data mining as one step of Knowledge Discovery Process.
[7 marks]List and describe the methods for handling the missing values in data cleaning.
[3 marks]What is market basket analysis? Explain the two measures of rule interestingness: support and confidence.
[4 marks]State the Apriori Property. Generate large itemsets and association rules using Apriori algorithm on the following data set with minimum support value and minimum confidence value set as 50% and 75% respectively.
[7 marks]Discuss click-stream analysis using data mining.
[3 marks]Explain the Min-max data normalization method with suitable example.
[4 marks]Explain three-tier Data Warehouse Architecture.
[7 marks]Discuss following terms. 1) Supervised learning 2) Correlation analysis 3) Tree pruning
[3 marks]Differentiate Association vs. Classification.
[4 marks]Explain Baye’s Theorm and calculate Naïve Bayesian Classification for given example:1
[7 marks]Draw the topology of a multilayer, feed-forward Neural Network.
[3 marks]Explain data mining application for fraud detection.
[4 marks]Define linear and nonlinear regression using figures. Calculate the value of Yfor X=100 based on Linear regression prediction method. X Y4390
[7 marks]Describe web mining using example.
[3 marks]What is Big Data? What is big data analytic?
[4 marks]Define the term “Information Gain”. Explain the steps of the ID3 Algorithm for generating Decision Tree.
[7 marks]Define: 1) Data Node 2) Name Node 3) Text mining
[3 marks]Explain partitioning and hierarchical methods of clustering.
[4 marks]What is distributed file system? Explain HDFS architecture in detail.
[7 marks]