PADMANVANI ARTS AND COLLEGE FOR WOMEN, 13. Which parameter ensures shared x-axis or y-axis in subplots?
SALEM-11 A) sharex and sharey B)align_x and align_y
DEPARTMENT OF COMPUTER SCIENCE C) linkx and linky D) syncx and syncy
FUNDAMENTALS OF DATA SCIENCE 14. To display horizontal error bars, you need to specify:
MODEL –I A) Only yerr B)Only xerr C) Both xerr and yerr D) Neither
Class: II B.Sc [Computer Science] - [Artificial Intelligence & Data Science] 15. Which function is used to create a 3D surface plot in Matplotlib?
Duration: A) ax.plot_surface() B)ax.surface3d()
Date: Max. Marks: C) plt.surface() D) ax.plot_surface3d()
PART- A 15*1=15 PART –B 5*2=10
Answer all Questions Answer any two
1. In which field does data science contribute to scientific and social advancements,
1. What is the role of data integration in data science?
like genomics and climate change research? 2. How would you construct a frequency distribution for nominal data?
A. Retail B. Manufacturing 3. What is the computational formula for the correlation coefficient, and it used?
C. Research and development D. Hospitality 4. What are pivot tables, and how are they useful in data analysis?
2. Which type of data does data science analyze to uncover trends in patient care? 5. How do you create a histogram in Matplotlib, and what does it represent?
A. Social media posts B. General internet data
C. Electronic Health Records (EHRs) D. Weather data
PART – C 10*5=50
3. Which NLP task would most likely involve the use of a Generative Pretrained
Transformer (GPT) model? Answer all the questions
A) Text summarization B)Sentiment analysis 6. A) Describe the different facets of data in data science. Include the distinction
C) Text generation D) Named Entity Recognition between structured, semi-structured, and unstructured data.
4. Which type of frequency distribution is best suited for visualizing continuous data? (OR)
A. Cumulative frequency distribution B. Grouped frequency distribution B) Explain how integrating data from multiple sources can improve the accuracy of
C. Raw frequency table D. Nominal frequency distribution machine learning models and decision-making.
5. What is the sum of all relative frequencies in a dataset? 7. A) Explain the measures of central tendency: mode, median, and mean. How do
A. Equal to the total frequency. B. Equal to the range of the dataset. you
C. Always equal to 1 (or 100%). D. Equal to the number of class intervals. calculate each, and in what cases is one measure preferred over the others?
6. When a graph omits the units of measurement, it is considered: (OR)
A. Informative. B. Misleading. C. Unbiased. D. Accurate. B) Discuss the importance of graphs in interpreting frequency distributions.
7. If a z-score is negative, it means the data point is: Include
A) Above the mean B)Below the mean C) Equal to the mean a discussion on histograms, bar charts, and cumulative frequency graphs.
D) Greater than one standard deviation away from the mean 8. A) What are the properties of the normal curve, and how do they relate to the
8. What is the shape of a normal distribution curve? Concept of standard deviation? How can you find proportions and
A) Skewed to the right B)Skewed to the left C) Bell-shaped D) Uniform probabilities
9. A z-score of -0.5 means that the data point is: from the normal distribution?
A) 0.5 units above the mean B)0.5 standard deviations above the mean (OR)
C) 0.5 standard deviations below the mean D) 0.5 units below the mean B) Explain how to compute the correlation coefficient for quantitative data.
10. What does arr[1, 2] return if arr = np.array([[10, 20, 30], [40, 50, 60]])? Provide the formula for the correlation coefficient and explain each term.
A) 20 B)30 C) 50 D) 60 9. A) What are some common methods for performing computations on NumPy
11. What does np.array([True, False]) & np.array([False, True]) return? arrays, such as element-wise operations? Provide examples of addition,
A) [True, True] B)[False, True] multiplication, and more complex mathematical operations.
C) [False, False] D) [True, False] (OR)
12. Which function is used to set the limits of x and y axes? B) Discuss the role of pivot tables in data analysis. How can you create and
A) plt.axis_range() B)plt.axis_limits() manipulate pivot tables in Python using NumPy or Pandas? Include examples
C) plt.xlim() and plt.ylim() D) plt.set_axis() of aggregation and summarization.
10. A) Describe the basic steps to create a line plot in Matplotlib. Include the purpose
of plt.plot() and how you can customize the plot with labels, colors, and titles.
(OR)
B) Describe the creation of advanced visualizations like contour plots and
heatmaps using Matplotlib. Discuss their use cases, such as visualizing
gradients or relationships in large datasets.