What is the difference between discrimination and classification?
[3 marks]Describe the steps involved in data mining when viewed as a process of knowledge discovery.
[4 marks]Draw and explain the data mining architecture.
[7 marks]Discuss issues to consider during data integration.
[3 marks]Describe various methods for handling the missing values in the dataset.
[4 marks]Use the methods below to normalize the following group of data: 200, 300, 400, 600, 1000
[7 marks]min-max normalization by setting min = 0 and max =
[ marks]Calculate z-score normalization
[ marks]z-score normalization using the mean absolute deviation instead of standard deviation
[ marks]normalization by decimal scaling
[ marks]For the following data (in increasing order) for the attribute age: 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
[7 marks]Use smoothing by bin means to smooth these data, using a bin depth of 3. Illustrate your steps. Comment on the effect of this technique for the given data.
[ marks]How might you determine outliers in the data?
[ marks]What other methods are there for data smoothing?
[ marks]What is meant by association rule?
[3 marks]Write a short note on: Web content mining.
[4 marks]Explain about the Apriori algorithm for finding frequent item sets with an example.
[7 marks]What is meant by Maximal Frequent Item Set?
[3 marks]Write a short note on: Web usage mining.
[4 marks]Which patterns are interesting? Explain based on Pattern Evaluation Methods.1
[7 marks]What is the difference between “supervised” and unsupervised” learning?
[3 marks]What are the attribute selection measures? Explain any one in details.
[4 marks]Discuss about Decision tree induction algorithm with an example.
[7 marks]What do you meant by Bayesian Classification?
[3 marks]Describe the issues regarding classification and prediction.
[4 marks]Discuss about Neural Network-Based Algorithms.
[7 marks]What are the requirements of clustering?
[3 marks]Discuss about Outlier Detection.
[4 marks]Discuss about k-nearest neighbor classification algorithm with an example?
[7 marks]State the categories of clustering methods?
[3 marks]Explain the concept hierarchy.
[4 marks]Explain in detail about partitioned Clustering method.
[7 marks]