Explain various features of data warehouse.
[3 marks]Explain why data warehouses are needed for developing business solutions from today’s perspective.
[4 marks]Explain three-tier data warehouse architecture.
[7 marks]Give the comparisons of OLAP and OLTP system.
[3 marks]List challenges of data mining regarding data mining methodology and user interaction issues.
[4 marks]What is cuboid? Explain OLAP Operations with examples.
[7 marks]Explain star, snowflake, fact constellation schema for multidimensional database.
[7 marks]Explain various application area of data mining.
[3 marks]How to handle missing values in data set?
[4 marks]With the help of a suitable diagram explain the process of knowledge discovery from database.
[7 marks]Why preprocessing is required?
[3 marks]Explain min-max and Z-score normalization.
[4 marks]Explain Mean, Median, Mode, Variance and Standard Deviation with suitable database example.
[7 marks]Suppose a group of sales price records has been sorted as follow 6,9,12,13,15,25,50,70,72,92,204,232 partition them into three bins by equal frequency. Perform data smoothing by bin mean.
[3 marks]Explain association rules with confidence and support.1
[4 marks]State the Apriori Property. Find frequent item-sets and association rules using Apriori algorithm on the following data set with minimum support count is 2 and minimum confidence=75%. Sr.No TID List of items_IDs 1 T100 I1,I2,I5 2 T200 I2,I4 3 T300 I2,I3 4 T400 I1,I2,I4 5 T500 I1,I3 6 T600 I2,I3 7 T700 I1,I3 8 T800 I1,I2,I3,I5 9 T900 I1,I2,I3
[7 marks]Write the difference between classification and predication.
[3 marks]Explain rule based classification.
[4 marks]Explain Naïve Bayesian classifier with example. Why it is called Naïve?
[7 marks]List strength and weakness of neural network as classifier.
[3 marks]Define True Positive, True Negative, False Positive and False Negative.
[4 marks]Explain decision tree algorithm with suitable example.
[7 marks]Write short note on web mining.
[3 marks]Explain text mining.
[4 marks]Explain k-means clustering algorithm. .2
[7 marks]