Find Q1, Q2, and Q3 for the following data set. {5,40,42,46,48,49,50,50,52,53,55,56,58,75,102}
[3 marks]Draw the framework for data-driven decision making analytics.
[4 marks]Write a short note on web and social media analytics.
[7 marks]Determine the most appropriate data type for the following:
[3 marks]PIN code
[ marks]Exam score out of
[ marks]Ranking in a race (1st, 2nd, 3rd…)
Explain measures of shape in brief.
[4 marks]Derive mean of Uniform Distribution.
[7 marks]Derive mean of Poisson Distribution.
[7 marks]In a game, a die is rolled once. What is the probability of rolling a number greater than 4?
[3 marks]Afactory has three machines; M1, M2, and M3; producing 30%, 45%, and 25% of the total output, respectively. The defect rates for the machines are: M1: 2% M2: 3% M3: 5% If a product is selected at random and found to be defective, what is the probability that it was produced by machine M2?
[4 marks]Write a short note on Probabilistic Sampling.
[7 marks]Abag contains 5 red balls and 3 green balls. Two balls are selected at random from the bag. What is the probability that the balls selected are red?
[3 marks]In a deck of 52 playing cards, 13 are spades, 13 are hearts, 13 are diamonds, and 13 are clubs. If one card is drawn at random, what is the probability that it is either a heart or a queen? Page 1 of
[2 marks]Explain classical methods of estimation.
[7 marks]Give the difference between classification and regression.
[3 marks]State and explain properties of the Least Squares Estimators.
[4 marks]Explain Naïve Bayes Classifier with suitable example.
[7 marks]Briefly explain application of logistic regression in credit rating.
[3 marks]Explain simple linear regression model.
[4 marks]Explain ID3 algorithm with suitable example.
[7 marks]Define sensitivity. Also, write its formula.
[3 marks]Briefly explain how data science is useful in BioTech sector.
[4 marks]An educational institution aims to identify students who are at risk of failing a course by analyzing historical data, such as attendance, grades, and participation in extracurricular activities. How would you apply data science techniques to develop a predictive model for identifying at-risk students? Explain the types of data preprocessing techniques you would use, the machine learning algorithms you would consider, and how you would handle class imbalance if more students are predicted to pass than fail.
[7 marks]Define Specificity. Also, write its formula.
[3 marks]Briefly explain, how data science is useful for weather forecasting.
[4 marks]Asocial media platform wants to perform real-time sentiment analysis on user comments to understand public opinion about a new product launch. The company has access to large volumes of user-generated data and wants to classify comments as positive, negative, or neutral. Describe how you would build a real-time sentiment analysis system using data science. What type of model would you use, and how would you ensure that the system can handle large amounts of data efficiently? Also, discuss the challenges of sentiment analysis in real-time environments and how to overcome them. Page 2 of
[2 marks]