Define following. 1. Data warehouse 2. Cluster centroid 3. Stemming 4. Term Frequency
[4 marks]Discuss the importance of analytic sandbox.
[3 marks]Explain Data preparation, Model planning and Model building phases of data analytic lifecycle.
[7 marks]Explain types and drivers of big data.
[7 marks]Discuss collaborative filtering vs association rules.
[7 marks]Discuss different metrics to measure the distance between two records.
[7 marks]Explain k-means clustering algorithm.
[7 marks]Write short note on item based collaborative filtering.
[7 marks]Discuss the concept of support and confidence for generating association rules in apriori algorithm.
[7 marks]Explain hierarchical clustering in brief.
[7 marks]Explain simple exponential smoothing in detail.
[7 marks]Write short note on: Amodel with Linear Trend.
[7 marks]Define autocorrelation. Discuss ARIMA models in brief.
[7 marks]Explain edge list and adjacency matrix with proper example.
[7 marks]Explain following: 1. Node level centrality metrics 2. Egocentric Network
[7 marks]Explain advantages and disadvantages of social media analytics.
[7 marks]Explain tokenization in brief.
[7 marks]Explain bag-of-words approach with suitable example.
[7 marks]