Topic: Data Science Essentials
Fundamentals:
1. Define Data Science in your own words.
2. What is the difference between structured and unstructured data? Provide an example of each.
3. Explain the importance of data cleaning in the data science process.
4. What are the key steps involved in a typical data science workflow?
5. Differentiate between exploratory data analysis (EDA) and confirmatory data analysis.
Statistics and Probability:
1. Define mean, median, and mode. When is the median a better measure of central tendency than the mean?
2. What is standard deviation, and what does it tell you about a dataset?
3. Explain the concept of correlation. What is the difference between positive and negative correlation?
4. What is a probability distribution? Give an example of a common discrete probability distribution.
5. State the Central Limit Theorem and explain its significance in statistics.
Machine Learning Basics:
1. What is the difference between supervised and unsupervised learning? Provide an example of each.
2. Explain the concept of features and labels in supervised learning.
3. What is the purpose of splitting data into training and testing sets?
4. Define overfitting and underfitting in the context of machine learning models.
5. What is the bias-variance tradeoff?
Common Algorithms (Briefly Explain):
1. Briefly describe the K-Nearest Neighbors (KNN) algorithm.
2. What is a decision tree, and how does it make predictions?
3. Explain the basic idea behind linear regression.
4. What is the goal of clustering algorithms? Give an example of a clustering algorithm.
5. Briefly describe the concept of a neural network.
Evaluation and Metrics:
1. For a binary classification problem, what do True Positives, True Negatives, False Positives, and False Negatives
represent?
2. Define accuracy, precision, and recall. When might precision be more important than recall?
3. What is the F1-score, and why is it often a useful metric?
4. For a regression problem, what is Mean Squared Error (MSE)?
5. Explain the concept of cross-validation and why it's used.
Each Question Carrying: 2 Marks
Minimum 30% is Required to get Certificate
Instruction: All Students are requested to Mention Exact Name, Department of Study , University Roll Number,
Subject Name Properly Otherwise Answer-sheet will be canceled.
After Successful Completion of the Program Kindly Send Hand Written Answer-sheet to
[email protected]