Define data mining. Describe three challenges to data mining regarding data mining methodology and user interaction issues.
[3 marks]Explain the steps in knowledge discovery.
[4 marks]Explain the various data mining issues.
[7 marks]What are the smoothing techniques available to remove noise?
[3 marks]Discuss normalization in detail.
[4 marks]In real-world data, tuples with missing values for some attributes are a common occurrence. Describe various methods for handling this problem.
[7 marks]Discuss data discretization and concept hierarchy generation.
[7 marks]How are association rules mined from large databases?
[3 marks]Give the difference between Boolean association rule and quantitative association rule.
[4 marks]What are the limitations of the apriori approach for mining? Briefly describe the techniques to improve the efficiency of apriori algorithm.
[7 marks]Describe two interesting measures for association rules.
[3 marks]How Meta rules are useful in constraint based association mining.
[4 marks]Write an algorithm for finding frequent item-sets using candidate generation.
[7 marks]What are the difference between supervised learning and unsupervised learning?
[3 marks]Write down short note on backpropagation.
[4 marks]What is information gain? Explain the steps required to generate a decision tree from a training data set.
[7 marks]Differentiate between linear regression and nonlinear regression.
[3 marks]Explain various methods of evaluating accuracy of classifier.
[4 marks]Write a short on: web content mining.
[7 marks]Explain temporal mining.
[3 marks]Differentiate between partitioning and hierarchical methods for clustering.
[4 marks]Explain following clustering algorithm in details: 1) CLARA 2) BIRCH
[7 marks]List out the applications of distributed and parallel data mining.
[3 marks]Illustrate strength and weakness of k-mean in comparison with k-medoid algorithm.
[4 marks]Explain the typical requirements of clustering in data mining.
[7 marks]