What is Data Warehouse? Who & Why it need?
[3 marks]How to manage data warehouse? Exlpain with its methodology.
[4 marks]Draw ETL architechture & Explain in details.
[7 marks]Define the term “data mining”. List the major issues in data mining.
[3 marks]What are the limitations of the Apriori approach for mining?
[4 marks]State the Apriori Property. Find frequent item-sets and association rules using Apriori algorithm on the following data set with minimum support count is 2 and minimum confidence=75%. Sr.No TID List of items_IDs 1 T100 I1,I2,I5 2 T200 I2,I4 3 T300 I2,I3 4 T400 I1,I2,I4 5 T500 I1,I3 6 T600 I2,I3 7 T700 I1,I3 8 T800 I1,I2,I3,I5 9 T900 I1,I2,I3
[7 marks]Explain multi-dimensional association rule in details.
[7 marks]What is classification? Give the difference between classification and predication.
[3 marks]What is Information gain and Gain ratio?
[4 marks]Explain Baye’s Theorem and Naïve Bayesian Classification.
[7 marks]What is Market Basket Analysis? Explain Association Rules with Confidence & Support.
[3 marks]Give the difference between OLAP and OLTP.
[4 marks]Explain data smoothing methods to divide given data into bins of size 3 by bin means, by bin medians and by bin boundaries. Consider the data: 10, 2, 19, 18, 20, 18, 25, 28, 221
[7 marks]What is clustering? Why clustering is an un-supervised learning?
[3 marks]Write a short note on text mining.
[4 marks]Explain k-means algorithm of clustering.
[7 marks]Give the Difference between Spatial and Temporal Data Mining.
[3 marks]Briefly explain Linear and Non-linear regression.
[4 marks]Explain neural network based alogorithms in details.
[7 marks]What is DBSCAN? Explain in brief.
[3 marks]Explain clustering portioning method in brief.
[4 marks]Explain different types of Web Mining with example.
[7 marks]Why web mining is needed? Explain.
[3 marks]Explain web structure mining in brief.
[4 marks]What is logistic regression? Explain in details. -------2
[7 marks]