What is the importance of data analysis?
[3 marks]What is data warehousing? List the characteristics of data warehouse.
[4 marks]Explain the three tier architecture of data warehouse.
[7 marks]What is OLAP? How OLAP is differ from OLTP?
[3 marks]What is data cube? Explain rollup operation on data cube.
[4 marks]Explain different schema of data warehouse with suitable diagram.
[7 marks]Define: mean, median, mode, variance and standard deviation.
[7 marks]List different methods for data discretization and explain any one in detail.
[3 marks]Why data smoothing is required? Perform smoothing by bin means, by bin medians and by bin boundaries on the given data with bin size is 3. Consider the data for price (in dollars): 4, 8, 9, 15, 21, 21, 24, 25, 26, 28, 29, 33.
[4 marks]Explain the process of extracting the knowledge from the database.
[7 marks]Explain Association Rules with Confidence & Support.
[3 marks]Write a note on data cleaning process for missing value and noisy data treatment.
[4 marks]Explain various methods for normalization.
[7 marks]What are the limitations of the Apriori approach for mining?
[3 marks]What is data mining? How data mining techniques useful in super market?
[4 marks]State the Apriori Property. Find frequent item-sets and association rules using Apriori algorithm on the following data set with minimum support is 60% and minimum confidence=80%. Sr.No TID List of items 1 T100 M, O, N ,K, E, Y 2 T200 D, O, N, K, E, Y 3 T300 M, A, K, E 4 T400 M, U, C, K, Y 5 T500 C, O, O, K, I, E
[7 marks]Give the difference between supervised and un-supervised learning.
[3 marks]Explain attribute selection measures in decision tree.
[4 marks]Explain Baye’s Theorem and Naïve Bayesian Classification.1
[7 marks]Give the importance of the pruning in association rule mining.
[3 marks]What is outlier? Discuss different methods for outlier detection.
[4 marks]Briefly explain Linear and Non-linear regression.
[7 marks]Explain text mining in brief.
[3 marks]What is web mining? Explain types of web mining.
[4 marks]Explain k-mean clustering algorithm.
[7 marks]