Compare data mart and data warehouse.
[3 marks]Adata warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data – Justify.
[4 marks]What is Cuboid? Explain various OLAP operations on data cube with example.
[7 marks]Differentiate Fact table vs. Dimension table.
[3 marks]briefly explain classification and prediction.
[4 marks]Explain the KDD process in detail.
[7 marks]Explain the major issues in data mining.
[7 marks]Briefly discuss the need for data preprocessing.
[3 marks]Explain the following terms with suitable example. 1) Data Integration 2) Data Transformation
[4 marks]Draw the diagram and describe the architecture of a data mining system.
[7 marks]Explain parametric and non-parametric methods of data reduction.
[3 marks]What is data cleaning? How to handle the missing value in data cleaning?
[4 marks]What is noise? Describe the possible reasons for noisy data. Explain the different techniques to remove the noise from data.
[7 marks]Briefly explain Linear and Non-linear regression.
[3 marks]What is market basket analysis? Explain the two measures of rule interestingness: support and confidence.
[4 marks]Explain the steps of the Apriori Algorithm for mining Frequent Itemsets with Candidate Generation. Use a suitable example to illustrate your answer.
[7 marks]Discuss : training and test dataset.
[3 marks]What is classification? Explain classification as a two-step process with a diagram.
[4 marks]Explain how the accuracy of a classifier/predictor can be measured.
[7 marks]Explain text mining using example.
[3 marks]Write a short note on tree pruning.
[4 marks]Explain the working of the k-Means clustering algorithm.
[7 marks]Write a note on web mining.
[3 marks]Explain the following as attribute selection measures: (i) Information Gain (ii) Gain Ratio.
[4 marks]Discuss Bayesian classification.
[7 marks]