Explain Data Analytics life cycle with the help of diagram.
Please rotate your device horizontally for split view
Access and download Data Science question papers from Savitribai Phule Pune University (SPPU). Our collection includes INSEM (Internal Semester) and ENDSEM (End Semester) exam papers.
We offer 7 question papers for Data Science, covering various exam patterns and years. All papers are in PDF format for easy viewing and download.
Prepare for mid-term evaluations with Data Science INSEM papers, aligned with the SPPU exam pattern and syllabus.
Access Data Science ENDSEM papers covering the entire syllabus, essential for final exam preparation.
Our question-paper viewer enables you to:
SPPU Question Papers Hub is focused entirely on SPPU previous year papers, with cleaner discovery by branch, semester, and subject.
Data Science is a key subject in the SPPU curriculum. Our question paper collection helps students understand exam patterns, practice effectively, and improve academic performance.
Explore Data Science resources including SPPU question papers from Savitribai Phule Pune University. Find INSEM and ENDSEM papers for effective examination preparation. Our platform offers academic resources, a PDF viewer for online study, university question papers, and materials for semester examinations.
Download all INSEM question papers as ZIP
Download all ENDSEM question papers as ZIP
Download all question papers (INSEM + ENDSEM) as ZIP
Explain Data Analytics life cycle with the help of diagram.
List different phases in data analytics life cycle and explain Model Building phase in detail.
What are different phases in data analytics life cycle? Explain Operationalize phase in detail.
Explain Model building phase with its challenges.
Explain association rules with example.
Explain Python Libraries for Data Processing, Modeling and Data Visualization.
Explain predictive, Descriptive, and Prescriptive data analysis. And also mention their difference.
Write a short notes on Global Innovation Social Network and Analysis.
Explain the use of logistic function in logistic regression in detail. List and explain the Types of Logistic regression.
Wirte short notes on ASM.
What is clustering? With suitable example explain the steps involved in k-means algorithm.
Discuss Holdout method and Random Sampling methods.
Wirte short note on i) Confusion matrix ii) AVC- ROC curve
What do you mean by text analysis? Why text analysis need to be done? Explain the following text analysis steps with suitable examples i) Part of speech (POS) tagging ii) Lemmatization iii) Stemming
Wirte short note on i) Time series Analysis ii) TF- IDF.
What is data visualization? What are the different methods of data visualization explain in detail.
Explain in detail the Hadoop Ecosystem with suitable diagram.
Describe the Data visualization tool “Tableau”. Explain its applications in brief.
With a suitable example explain and draw a Box plot and explain its usages.
With a suitable example explain Histogram and explain its usages.
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 70 |
| Total Questions | 8 |
| Duration | 2½ Hours |
| Paper Number | [6003]-542 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence and Data Science |
| Exam Type | ENDSEM |
| Exam Session | 2023 May Jun Endsem |
| Watermark | ['CEGP013091', '49.248.216.238 21/06/2023 10:39:16 static-238'] |
What is Data Analytics Life Cycle?
Draw the diagram of the data analytics life cycle in big data and briefly explain the Model Planning phase.
Write a note on the Data Preparation phase with its steps.
Explain in detail how the model-building phase is built by a team in the data analytics life cycle.
List and explain the steps in the Data Preparation phase of the data analytics lifecycle.
Explain challenges in the Model building phase.
What do you mean by Linear Regression? Elaborate the types.
Explain the Apriori algorithm with an example.
Write a short note on the following: i) FP growth ii) Decision Tree Classification
Explain Data transformation using function and mapping.
Write a short note on the following: i) Removing duplicates from the data set. ii) Handling missing data.
Explain association rules with examples.
What is clustering? Explain hierarchical clustering with an example.
Explain the Holdout method and Random Sub Sampling method.
Discuss parameter tuning and optimization.
Short Note on: i) AUC-ROC Curves ii) Elbow plot
What is clustering? Explain k - means algorithm.
Write a short note on : i) Time series Analysis ii) TF-IDF.
What is data Visualization? List and explain any one type of data visualization.
With a suitable example explain and draw a Box plot and explain its usage.
Discuss various challenges to Big data Visualization.
Define data visualization. What are the different methods of data visualization explain in detail.
Explain in detail the Hadoop Ecosystem with a suitable diagram.
With a suitable example explain the Scatter plot and explain its usage.
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 70 |
| Total Questions | 8 |
| Duration | 2½ Hours |
| Paper Number | [6262]-56 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence and Data Science |
| Exam Type | ENDSEM |
| Exam Session | 2024 May Jun Endsem |
| Watermark | ['CEGP013091', '49.248.216.238 15/05/2024 09:49:40 static-238'] |
Explain phases of data analytic life cycle in detail. Draw a suitable diagram.
What is model building elaborate this phase of Data analytics with the help of suitable example.
Write a short note on : i) ETL ii) Common tools for model building iii) Model selection for Data analytics
List and explain the various activities involved in identifying potential data resources as a part of discovery phase in Data analytics life cycle.
Write short note on the following : i) Removing duplicates from data set. ii) Data transformation iii) FP Growth
What are association rules? Explain Apriori Algorithm in brief.
Explain Predictive, Descriptive and Prescriptive analytics with example.
What is Classification? Explain Decision Tree Classification with example.
Write short note on the following : i) Confusion matrix. ii) AUC-ROC Curves. iii) Elbow plot.
Explain the following Text Analysis steps with suitable example : i) Part-of-speech(POS)tagging ii) Lemmatization
What is clustering? With suitable example explain the steps involved in Hierarchical Clustering.
What is confusion matrix? Given the confusion matrix, Calculate Accuracy, Precision, Recall, Error rate with description on diabetic risk. Predicted Classes: Actual Classes | Diabetic Risk- Yes | Diabetic Risk- No. Classes | Diabetic Risk- Yes | 90 | 210. Classes | Diabetic Risk- No | 140 | 9560
List the challenges of Big Data Visualization. Explain the types of visualization with example.
Explain in detail the Hadoop Ecosystem with suitable diagram and different components.
Write a short note on the following : i) Map reduce. ii) Pig iii) Hive
With suitable example draw histogram and boxplot. Also explain its uses.
| Subject Name | Data Science |
|---|---|
| Semester | VI |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 70 |
| Total Questions | 8 |
| Duration | 2½ Hours |
| Paper Number | [6403]-56 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence and Data Science |
| Exam Type | ENDSEM |
| Exam Session | 2025 May Jun Endsem |
| Watermark | ['CEGP013091', '49.248.216.237 26/05/2025 09:38:51 static-237'] |
Explain in Detail Discovery Phase of Data Analytics lifecycle.
Explain in Detail Data Preparation Phase of Data Analytics lifecycle.
Explain in Detail Model Building Phase of Data Analytics lifecycle.
Explain in Detail Communication Result Phase of Data Analytics lifecycle.
Write a Short Notes on Global Innovation Social Network and Analysis (GINA).
What are different python libraries used for data analysis?
Explain data preprocessing in detail.
Differentiate between Predictive, Descriptive, and Prescriptive data analysis.
What are association rules? Explain Apriori Algorithm in brief.
Write python program to apply the logistic regression model and check for goodness of fit.
What is the decisiontree? Explain with an example.
What do you mean by text analysis? Why text analysis needs to be done? Explain the following text analysis steps with suitable example. i) Part of speech (POS) tagging ii) Lemmatization iii) Stemming
Write short note on: i) Time series Analysis ii) TF - IDF.
What is clustering? With suitable example explain the steps involved in k - means algorithm.
Discuss Holdout method and Random Sampling methods.
Write short note on: i) Confusion matrix ii) AUC - ROC curve
Write a short note on the following i) Map reduce ii) Pig iii) Hive
Describe the Data visualization tool “Tableau”. Explain its applications in brief.
With a suitable example explain Histogram and explain its usages.
Explain in detail the Hadoop Ecosystem with suitable diagram.
With a suitable example explain and draw a Box plot and explain its usages.
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 70 |
| Total Questions | 8 |
| Duration | 2½ Hours |
| Paper Number | [6353]-56 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence & Data Science |
| Exam Type | ENDSEM |
| Exam Session | 2024 Nov Dec Endsem |
| Watermark | ['CEGP013091', '49.248.216.237 25/11/2024 09:48:43 static-237'] |
Compare Business Intelligence Vs. Data Science.
List and explain data reduction techniques with example.
What is Data wrangling? Explain various methods of Data wrangling.
What is Data Science? Differentiate between Data Science and Information Science?
Describe Bayes theorem with an example.
Differentiate Structured and Unstructured Data.
For the given numbers find out the variance and standard deviation.1, 2, 3, 4,5, 6.
Explain the use of hypothesis and hypothesis testing.
What is Data Integration and Discretization explain?
Define measures of central tendency and given the five annual salaries of an industry shown in Table, determine the mean and the median. Position: CEO, Salary: $10,00,000; Position: Manager, Salary: $55,000; Position: Developer, Salary: $40,000; Position: Tester, Salary: $30,000; Position: Custodian, Salary: $20,000
What does the Pearson correlation coefficient test do explain with an example.
Explain Type I and Type II errors with an example.
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 30 |
| Total Questions | 4 |
| Duration | 1 Hour |
| Paper Number | [6009]-424 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence & Data Science |
| Exam Type | INSEM |
| Exam Session | 2023 Feb Insem |
| Watermark | ['CEGP013091', '49.248.216.238 03/04/2023 12:11:40 static-238'] |
What is Data Science? Explain its use in AI.
Explain the various applications of Data Science.
Imagine you have given the data set with the Null or Nan values. Apply data wrangling methods used in data science.
Explain null and alternative hypothesis by considering the example for a flipping coin.
Describe Bayes theorem in detail with an example.
Explain Standard deviation and Inter Quartile range and write python code to compute the standard deviation and interquartile range.
What is data Wrangling? Explain various methods of data wrangling.
Explain Data Cleaning, Data Transformation.
Explain Data Integration and Discretization.
What is hypothesis testing?
Explain different measures of dispersion.
Solve the following. Suppose the probability of the weather being cloudy is 40%. Also suppose the probability of rain on a given day is 20%. Also suppose the probability of clouds on a rainy day is 85%. If it’s cloudy outside on a given day, what is the probability that it will rain that day?
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 30 |
| Total Questions | 4 |
| Duration | 1 Hour |
| Paper Number | [6269]-324 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence and Data Science |
| Exam Type | INSEM |
| Exam Session | 2024 March Insem |
| Watermark | ['CEGP013091', '49.248.216.238 20/03/2024 10:40:44 static-238'] |
Differentiate Structured and Unstructured Data.
List and explain various Data Discretization techniques.
What is Data Wrangling? Why data wrangling is important in data science?
What is Data Science? How data science is similar to and differ from Information Science?
Describe Bayes theorem with an example.
Explain the Data Transformation methods in detail?
What are different measures of dispersion? For the given numbers find out variance and standard deviation 4, 34, 11, 12, 2 and 26.
Here are the 19 scores listed out. 5, 7, 10, 15, 19, 21, 21, 22, 22, 23, 23, 23, 23, 23, 24, 24, 24, 24, 25 Calculate IQR for below the first quartile and above the third quartile. Identify outliers if any?
Define Type I and Type II Error. Give example to differentiate between the two types of error.
What are different measures of central tendency? A pizza outlet overview its weekly sales. They sold 57 cheese pizzas, 63 pasta pizzas, 53 veggies pizzas, 68 cottage cheese pizzas, and 56 max cheese pizzas, Find the mean of all the pizzas sold by them.
Explain the Chi-Square hypothesis testing with an example.
What is T-test? What are the types of T-tests? Explain by example the number of variables and degree of freedom.
| Subject Name | Data Science |
|---|---|
| Semester | II |
| Pattern Year | 2019 |
| Subject Code | 317529 |
| Max Marks | 30 |
| Total Questions | 4 |
| Duration | 1 Hour |
| Paper Number | [6410]-424 |
| Academic Year | T.E. |
| Branch Name | Artificial Intelligence and Data Science |
| Exam Type | INSEM |
| Exam Session | 2025 March Insem |
| Watermark | ['CEGP013091', '49.248.216.237 10/03/2025 10:30:17 static-237'] |
Select a question to generate an answer