COURSE PLAN
Course code
CS3352 FOUNDATIONS OF DATA SCIENCE LTPC 3003
and title
Class SECOND YEAR Semester 03
Academic
Regulation 2021 2023-2024
year
Course
prerequisites
Basic computer knowledge (Windows Os and MS office packages)
To understand the data science fundamentals and process.
To learn to describe the data for the data science process.
Course To learn to describe the relationship between data
objectives
To utilize the Python libraries for Data Wrangling.
To present and interpret data using visualization libraries in Python
COURSE OUTCOMES
At the end of the course, students can able to
CO1 Define the data science process
CO2 Understand different types of data description for data science process
CO3
Gain knowledge on relationships between data
CO4
Use the Python Libraries for Data Wrangling
CO5
Apply visualization Libraries in Python to interpret and explore data
MAPPING OF PROGRAM OUTCOMES
Engineering knowledge: Apply the knowledge of mathematics, science, engineering
PO1 fundamentals, and an engineering specialization for the solution of complex engineering
problems.
Problem analysis: Identify, formulate, research literature, and analyze complex
PO2 engineering problems reaching substantiated conclusions using first principles of
mathematics, natural sciences, and engineering sciences.
Design/development of solutions: Design solutions for complex engineering problems
and design system components or processes that meet the specified needs with appropriate
PO3
consideration for public health and safety, and cultural, societal, and environmental
considerations.
PROGRAM SPECIFIC OUTCOMES
PSO1 Analyze, design and develop computing solutions by applying foundational concepts of
computer science and engineering.
Apply software engineering principles and practices for developing quality software for
PSO2
scientific and business applications.
Adapt to emerging information and communication technologies (ICT) to innovate ideas
PSO3
and solutions to existing/novel problems.
RELATIONSHIP OF COURSE TO PROGRAMME
EDUCATIONAL OBJECTIVES
Graduates will be able to design and develop projects in industry/ research organization
PEO1 or pursue higher studies with a strong foundation in Computer fundamentals,
mathematical, scientific and engineering principles.
Graduates will be able to work in a team and/or as an individual by effective
PEO2
communication to provide innovative solutions through continuous learning.
Graduates will be able to establish themselves as effective professionals and satisfy the
PEO3
recent trends in business and industry with ethical and social responsibilities.
TEXT BOOKS :
1. David Cielen, Arno D. B. Meysman, and Mohamed Ali, “Introducing Data Science”, Manning Publications,
2016. (Unit I)
2. Robert S. Witte and John S. Witte, “Statistics”, Eleventh Edition, Wiley Publications, 2017. (Units II and III)
3. Jake VanderPlas, “Python Data Science Handbook”, O’Reilly, 2016. (Units IV and V) REFERENCES: 1.
Allen B. Downey, “Think Stats: Exploratory Data Analysis in Python”, Green Tea Press,2014.
REFERENCES :
R1. 1. Allen B. Downey, “Think Stats: Exploratory Data Analysis in Python”, Green Tea Press,2014.
e-learning 1.https://www.coursera.org/specializations/data science process
resources 2. https://www.udemy.com/learning- data science process
Mode of
Evaluation
Internal Mark (20%), End Semester Examination (80%)
Faculty Dr.D.Karthikeswaran
e-mail id
[email protected] LESSON PLAN
Reference/
No of
Planned Teaching Actual Weekly
lecture Topics to be covered
Date aids and date review
hours
methods
UNIT 1 - INTRODUCTION
T1/SB /PPT
1 Data Science: Benefits And Uses
T1/SB /PPT
2 Facets Of Data
T1/SB /PPT
3 Data Science Process
T1/SB /PPT
4 Defining Research Goals , Retrieving Data
T1/SB /PPT
5 Data Preparation , Exploratory Data Analysis
T1/SB /PPT
6 Build The Mode
T1/SB /PPT
7 Presenting Findings And Building Applications
T1/SB /PPT
8 Data Mining , Data Warehousing
T1/SB /PPT
9 Basic Statistical Descriptions Of Data
Planned Actual
Date Sign of Faculty Review by HoD
hours hours
UNIT II - DESCRIBING DATA
T1/SB /PPT
10 Describing Data
T1/SB /PPT
11 Types of Data
T1/SB /PPT
12 Types of Variables
T1/SB /PPT
13 Describing Data with Tables
T1/SB /PPT
14 Describing Graphs
T1/SB /PPT
15 Describing Variability
T1/SB /PPT
16 Normal Distributions
17 Standard (z) Scores T1/SB /PPT
Circulate the values of n variables, Distance T1/SB /PPT
18
between two points
Planned Actual Review by HoD
Date Sign of Faculty
hours hours
UNIT III - DESCRIBING RELATIONSHIPS
T1/SB /PPT
19 Correlation –Scatter Plots
T1/SB /PPT
20 Correlation Coefficient For Quantitative Data
T1/SB /PPT
21 Computational Formula For Correlation Coefficient
T1/SB /PPT
22 Regression –Regression Line
T1/SB /PPT
23 Least Squares Regression Line
T1/SB /PPT
24 Standard Error Of Estimate
T1/SB /PPT
25 Interpretation Of R2
T1/SB /PPT
26 Multiple Regression Equations
27 T1/SB /PPT
Regression Towards The Mean
Planned Actual Review by HoD
Date Sign of Faculty
hours hours
UNIT IV - PYTHON LIBRARIES FOR DATA WRANGLING
T1/SB /PPT
28 Basics Of Numpy Arrays, Aggregations ,
T1/SB /PPT
29 Computations On Arrays
T1/SB /PPT
30 Computations On Arrays , Fancy Indexing
T1/SB /PPT
31 Structured Arrays , Data Manipulation With Pandas
T1/SB /PPT
32 Data Indexing And Selection,
33 Operating On Data –, Missing Data T1/SB /PPT
T1/SB /PPT
34 Hierarchical Indexing
T1/SB /PPT
35 Combining Datasets
Aggregation And Grouping , Pivot Tables T1/SB /PPT
36
Planned Actual Review by HoD
Date Sign of Faculty
hours hours
UNIT V - DATA VISUALIZATION
T1/SB /PPT
37 Importing Matplotlib
T1/SB /PPT
38 Line Plots – Scatter Plots
T1/SB /PPT
39 Visualizing Errors
T1/SB /PPT
40 Density And Contour Plots
T1/SB /PPT
41 Histograms – Legends – Colors
T1/SB /PPT
42 Subplots – Text And Annotation
T1/SB /PPT
43 Customization – Three Dimensional Plotting
T1/SB /PPT
45 Geographic Data With Basemap
45 Visualization With Seaborn T1/SB /PPT
Planned Actual Review by HoD
Date Sign of Faculty
hours hours
Topics beyond the Curriculum / Guest lecture(s) / Industry / Laboratory Visit Proposed
Planned Actual Review by HoD
hours hours
Date Topics beyond curriculum
Explosion In Deepfake Video And
45 45
Audio
Course outcome –Program outcome Mapping Table
Program outcomes
1.Low Correlation 2. Moderate Correlation 3- High
correlation
Cognitive
Course code and Title: level
PO10
PO11
PO12
PO5
PO1
PO2
PO3
PO4
PO6
PO7
PO8
PO9
DIRECT METHOD
Define the data science
CO1 Understand 2 1 1
process
Understand different types
CO2 of data description for data Understand 2 1 1
science process
Gain knowledge on
CO3 Analyze 2 2 1
relationships between data
Use the Python Libraries
CO4 Evaluate 1 1 1
for Data Wrangling
Apply visualization
CO5 Libraries in Python to Apply 2 1 1
interpret and explore data
INDIRECT METHOD (Based on Classroom activity/ Event)
Data Science Dumb charade
Analyze 1 2 2
Course outcome –Program Specific outcome Mapping Table
(1.Low Correlation 2. Moderate Correlation 3- High correlation)
Cognitive Program Specific outcomes
Course code and Title: Level
PSO1 PSO2 PSO3
Define the data science process Understand 2 2 1
Understand different types of data
Understand 2 1 1
description for data science process
Gain knowledge on relationships
Analyze 2 2 1
between data
Use the Python Libraries for Data
Evaluate 1 1 1
Wrangling
Apply visualization Libraries in Python
Apply 2 1 1
to interpret and explore data
Indirect Method (Based On Class Room Activity / Event)
Data Science Dumb charade
Analyze 2 1 1
Date of creation: 22-08-2022
Course Instructor IQAC Member HoD