PART I: MCQs 20 Marks
1. What do you mean by data science
(a) Dealing with huge amounts of data to find marketing patterns is known as data science
(b) Extracting a meaningful insight from the data is what data science is
(c) It is a study that deals with a huge amount of all types of data structured, unstructured, or semi-
structured
(d) All of the above
2. What is the main goal of machine learning?
(a) To make computers intelligent
(b) To automate manual tasks
(c) To enable computers to learn from data
(d) To create self-aware machines
3. Which type of machine learning algorithm aims to mimic the process of human learning?
(a) Supervised learning
(b) Unsupervised learning
(c) Reinforcement learning
(d) Deep learning
4. Which technique is used to reduce the impact of noise and outliers in a dataset?
(a) Feature extraction
(b) Regularization
(c) Cross-validation
(d) Principal Component Analysis (PCA)
5. Identify the type of learning in which labeled training data is used.
(a) Semi unsupervised learning
(b) Supervised learning
(c) Reinforcement learning
(d) Unsupervised learning
Page 1 of 6
6. Machine learning is a subset of which of the following.
(a) Artificial intelligence
(b) Deep learning
(c) Data learning
(d) None of the above
7. Which of the following are common classes of problems in machine learning?
(a) Regression
(b) Classification
(c) Clustering
(d) All of the above
8. Identify the successful applications of Machine Learning.
(a) Learning to classify new astronomical structures
(b) Learning to recognize spoken words
(c) Learning to drive an autonomous vehicle
(d) All of the above
9. Total groups in which data can be characterized is?
(a) 1
(b) 2
(c) 3
(d) 4
10. The different types of machine learning are?
(a) Supervised
(b) Unsupervised
(c) Reinforcement
(d) All of the above
11. What is the primary goal of descriptive statistics?
(a) To make predictions about the future based on past data
Page 2 of 6
(b) To summarize and describe the main features of a dataset
(c) To infer conclusions about a population based on a sample
(d) To test hypotheses and relationships between variables
12. What does the mode of a dataset represent?
(a) The most frequently occurring value
(b) The middle value
(c) The arithmetic average
(d) The spread of data points
13. In a symmetrical dataset, which measure of central tendency will be approximately equal?
(a) Mean and median
(b) Mean and mode
(c) Median and mode
(d) Mean, median, and mode
14. What is common between the following? Facebook, Twitter, Instagram, Pinterest.
(a) All are accessible from dark web.
(b) All are social networking sites.
(c) All are new web sites.
(d) All of the above
15. In terms of Facebook's ranking algorithm, what does "weight" refer to?
(a) The amount of comments, likes, etc. a content post receives
(b) How often a profile produces content
(c) How many highly ranked content posts a profile has in a given week
(d) How many followers/friends a company has
16. Degree centrality is defined as
(a) The ratio of number of neighbors of the vertex with total number of possible neighbors
(b) Central nodes of all the given nodes in networks
(c) Total number of nodes in the given network
Page 3 of 6
(d) None of the above
17. Which social network is considered the most popular for business to business marketing?
(a) Facebook
(b) LinkedIn
(c) Tiktok
(d) WhatsApp
18. Which of the following is the most viral section of the internet?
(a) Chat Messenger
(b) Social networking sites
(c) Tutorial sites
(d) Chat-rooms
19. Which of the following is not an appropriate measure for securing social networking accounts?
(a) Strong passwords
(b) Link your account with a phone number
(c) Never write your password anywhere
(d) Always maintain a soft copy of all your passwords in your PC
20. Which type of data analysis focuses on the question "what might happen in the future" and helps in
making predictions about some sort of data?
(a) Descriptive data analysis
(b) Diagnostic data analysis
(c) Predictive data analysis
(d) Prescriptive data analysis
PART II: Structural Questions 20 Marks
1. In descriptive statistics, differentiate between measures of central tendency and measures of
variability. (4Mks)
Page 4 of 6
2. List two advantages and two disadvantages of supervised learning. (4Mks)
3. List three (03) applications of unsupervised learning. (3Mks)
4. In Network Analysis, what are the indicators used to measure the centrality of a node?
(3Mks)
5. List and explain the functioning of two (02) basic graphs in data science. (4Mks)
6. What are similarities between data analysis and data science? (2Mks)
PART III: Application (Python programming basics) 20 Marks
Note: for each answer, give an application
1. What are the basic data types in Python? (Show how you can declare a variable of each type).
(4.5Mks)
2. Illustrate how to compute mean, median and mode in a given dataset. (4.5Mks)
3. What is the use of Range function? (2Mks)
4. Distinguish between array and list. (3Mks)
5. How can you add comments in your code? (2Mks)
6. How can you add an element in a list? How can you remove an element from a list?
(4Mks)
Page 5 of 6