0% found this document useful (0 votes)
72 views7 pages

Data Analytics Course Handout

The document is a course handout for a B.Tech. Data Analytics course (CSS 2103) scheduled from July to December 2025, detailing the course structure, assessment methods, and syllabus. It covers key topics such as data exploration, preparation, recommender systems, and time series analysis, with a focus on practical applications and real-world case studies. The course aims to equip students with essential skills in data analysis and decision-making, supported by various learning outcomes and references.

Uploaded by

manyu123.5a
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
72 views7 pages

Data Analytics Course Handout

The document is a course handout for a B.Tech. Data Analytics course (CSS 2103) scheduled from July to December 2025, detailing the course structure, assessment methods, and syllabus. It covers key topics such as data exploration, preparation, recommender systems, and time series analysis, with a focus on practical applications and real-world case studies. The course aims to equip students with essential skills in data analysis and decision-making, supported by various learning outcomes and references.

Uploaded by

manyu123.5a
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

[Link].

Second Year

Course Handout
Session (Semester): July 2025 to December 2025
Branch: School of Computer Engineering
Class: Theory Course
Course Name (Code): DATA ANALYTICS (CSS 2103)
Contact Hours/Week:

L T P C
3 1 0 4

Course Coordinator: Dr. Napa Lakshmi

Course Instructor: Dr. Jayita Saha, Dr. Shilpa Anakalki, Dr. Niranjan Kumar,
Dr. Raghavendra M Devdas, Dr. Sangeeta Sangani, Dr. Amreen Ayesha, Dr. Snigdha
Sen

A. Introduction: This course on Data Analytics introduces students to the fundamentals


of analyzing and interpreting data to support informed decision-making. It covers key
types of analytics—descriptive, predictive, and prescriptive—along with the steps
involved in a typical analytics project. Students will explore techniques for data
collection, cleaning, visualization, and statistical analysis. The course includes
multivariate analysis, feature engineering, and dimensionality reduction methods like
PCA. Learners will also gain insights into building recommender systems using
association rules and collaborative filtering. In addition, the course introduces time
series analysis for handling and forecasting temporal data. Real-world case studies and
hands-on practice will help students apply analytical tools effectively.

B. Assessment Rubrics:

Criteria Description Duration Maximum


Marks
Mid Term Test 120 Minutes 30
Internal Assessment Quiz Will be informed 10
(Summative) by Instructor
Assignment 10
End Term Exam End Semester / Make-up 180 minutes 50
(Summative) Exam
Total 100
Attendance A minimum of 75% Attendance is required to be maintained by a
(Formative) student to be qualified for taking up the End Semester examination. The
allowance of 25% includes all types of leaves including medical
leaves.
C. Syllabus:

Module -1 INTRODUCTION TO ANALYTICS:


Overview, Sources of Data, Process for making sense of data, Descriptive, Predictive,
Prescriptive Analytics. 4 Hours

Module -2 DATA EXPLORATION


Overview, Observations and variables, Types of Variables, Central Tendency, Distribution
of the data, confidence intervals, preparing data tables, visualizing relationships between
variables calculating metrics about relationships, case studies. 8 Hours

Module – 3 DATA PREPARATION


Data preprocessing an overview, Major tasks in data preprocessing, data cleaning,
missing values, Noisy data, data cleaning as a process, data integration, entity
identification problem, redundancy and correlation analysis, data reduction strategies,
Principal component analysis, attribute subset selection, regression and Log linear
Models; parametric Data reduction, histograms, clustering, sampling, and data cube
aggregation , data transformation strategies, data transformation by normalization,
discretization by binning, discretization by histogram analysis, discretization by cluster,
decision tree, and correlation analyses, concept hierarchy generation for nominal data.
Hypothesis testing : setting up a hypothesis test, one-tailed and two-tailed test, Type I
error, Type II error , One sample and two sample Z-test and t-Test. Nonparametric tests:
chi-Square tests Analysis variance : One-way, and two way. 16 Hours

Module – 4 RECOMMENDER SYSTEMS


Frequent Item sets, closed Item sets, and association rules, Apriori algorithm, generating
association rules from frequent item sets, improving the efficiency of Apriori, A pattern
growth approach for mining frequent item sets, mining frequent item sets, using vertical
data format, mining closed and max patterns, pattern evaluation methods
10 Hours
Module – 5: Time Series:
Introduction to forecasting, time series Data and components of time series data,
forecasting techniques and forecasting accuracy, Moving average method, Single,
Double and triple exponential smoothing, Regression model for forecasting, ARMA,
ARIMA models, Case studies 10 Hours
D. References:

1. Glenn J. Myatt, Wayne P. Johnson, Making Sense of Data: A Practical Guide to


Exploratory Data Analysis and Data Mining, John Wiley Publication, Second Edition,
2014.
2. Jiawei Han, Micheline Kamber, Jian Pei, Data Mining Concepts and Techniques
Morgan Kaufmann Publishers, Third Edition, 2012.
3. U. Dinesh Kumar, Business Analytics: The Science of Data-Driven Decision Making,
Second Edition, Wiley Publications, 2021
4. Joel Grus, "Data Science from Scratch: First Principles with Python", O'Reilly Media,
2019.
5. Anil Maheshwari, Data analytics: A comprehensive guide to data analysis and decision-
making, Wiley Publications, 2021.
6. [Link] -Introduction to Data Analytics.
7. [Link] -Data Analytics with Python.

Course Outcomes:
At the end of the course, students will be able to:

CO1: Interpret diverse datasets using data exploratory techniques.

CO2: Analyse different data preparation techniques to improve data quality and model
performance.

CO3: Implement multivariate data analysis methods for efficient decision making.

CO4: Evaluate association rule mining techniques to develop recommender systems.

CO5: Apply time series analysis techniques for forecasting and trend analysis in data-
driven applications.

Program Outcomes and Program Specific Outcomes

[PO.1]. Engineering knowledge: Apply the knowledge of mathematics, science,


engineering fundamentals, and engineering specialization to the solution of complex
engineering problems.
[PO.2]. Problem analysis: Identify, formulate, research literature, and analyze complex
engineering problems reaching substantiated conclusions using first principles of
mathematics, natural sciences, and engineering sciences.
[PO.3]. Design/development of solutions: Design solutions for complex engineering
problems and design system components or processes that meet the specified needs with
appropriate consideration for public health and safety, and the cultural, societal, and
environmental considerations.
[PO.4]. Conduct investigations of complex problems: Use research-based knowledge
and research methods including design of experiments, analysis and interpretation of data,
and synthesis of the information to provide valid conclusions.
[PO.5]. Modern tool usage: Create, select, and apply appropriate techniques, resources,
and modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations.
[PO.6]. The engineer and society: Apply reasoning informed by contextual knowledge to
assess societal, health, safety, legal, and cultural issues, and the consequent
responsibilities relevant to professional engineering practice.
[PO.7]. Environment and sustainability: Understand the impact of professional
engineering solutions in societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
[PO.8]. Ethics: Apply ethical principles and commit to professional ethics and
responsibilities and norms of the engineering practices.
[PO.9]. Individual and teamwork: Function effectively as an individual, and as a member
or leader in diverse teams, and in multidisciplinary settings.
[PO.10]. Communication: Communicate effectively on complex engineering activities
with the engineering community and with society at large, such as, being able to
comprehend and write effective reports and design documentation, make effective
presentations, and give and receive clear instructions.
[PO.11]. Project management and finance: Demonstrate knowledge and understanding
of the engineering and management principles and apply these to one’s own work, as a
member and leader in a team, to manage projects and in multidisciplinary environments.
[PO.12]. Life-long learning: Recognize the need for and have the preparation and ability
to engage in independent and life-long learning in the broadest context of technological
change.

PROGRAM SPECIFIC OUTCOMES (PSOs)

PSO1: Analyse and solve real world problems by using computational models of software
and hardware.
PSO2: Apply mathematical and algorithmic principles to build system and application- level
software.
PSO3: Design and develop intelligent systems by applying prediction models, computational
linguistics, vision-based and data interpretation techniques to meet future challenges.
PSO4: Design & develop automated systems to solve societal issues using Artificial
Intelligence.

E. Lecture Plan
L No TOPICS Course
Outcome
Addressed

1 Overview, Sources of Data, Process for making sense of data CO1

2 Descriptive, and Predictive analytics CO1

3 Continuation of Predictive analytics, and Prescriptive Analytics CO1

4 T1 Data analytics basics CO1


5 Overview, Observations and variables, Types of Variables CO1
6 Central Tendency, Distribution of the data CO1
7 confidence intervals, preparing data tables CO1
8 T2 Examples on L4 to L6 CO1
9 Visualizing relationships between variables CO1
10 Calculating metrics about relationships, CO1

11 Case Studies CO1


T3 12 Examples on Visualizing relationships and Calculating metrics CO1
about relationships
13 Data preprocessing an overview, Major tasks in data CO2
preprocessing
14 Data cleaning, missing values, Noisy data, data cleaning as a CO2
process, data integration
15 Entity identification problem, redundancy and correlation CO2
analysis
T4 16 Examples on L12, L13 and L14 CO2
17 data reduction strategies, PCA CO2

18 Principal component analysis, attribute subset selection, CO2


19 regression and Log linear Models; CO2
T5 20 Examples on L17, L18 and L19 CO2
SDL21 Parametric Data reduction, histograms, clustering, sampling, CO3
and data cube aggregation
22 data transformation strategies, data transformation by CO3
normalization analysis, discretization by binning, discretization
by histogram
23 Discretization by cluster Decision tree, and correlation CO3
analyses, Concept hierarchy generation for nominal data.

T6 24 Examples on Data reduction, histograms, clustering etc. CO3

25 Hypothesis testing : setting up a hypothesis test, one-tailed and CO3


two-tailed test, Type I error, Type II error , One sample and two
sample Z-test and t-Test.
26 One sample and two sample Z-test and t-Test. CO3
27 Non parametric tests: chi-Square tests Analysis variance : One- CO3
way, and two way

T7 28 Examples on Hypothesis testing CO3

29 Frequent Item sets, closed Item sets, and association rules CO4

30 Apriori algorithm, generating association rules from frequent CO4


item sets
31 SDL improving the efficiency of Apriori CO4
T8 32 Examples on Apriori CO4
33 A pattern growth approach for mining frequent item sets CO4

34 mining frequent item sets, using vertical data format, CO4

35 SDL mining closed and max pattern CO4


T9 36 Examples on L33 to L35 CO4

37 Continuation mining closed and max patterns, CO4

38 pattern evaluation methods CO4

39 Introduction to forecasting, time series Data and components of CO5


time series data
T10 40 Examples on pattern evaluation CO5

SDL 41 Forecasting techniques and forecasting accuracy CO5


42 Moving average method CO5

43 Single, Double and triple exponential smoothing, CO5


T11 44 Examples on Forecasting CO5
45 Regression model for forecasting CO5

46 ARMA, ARIMA models CO5


47 Case studies CO5

T 12 Examples on Time Series data and analysis CO5


48
F. Course Articulation Matrix (Mapping of COs with POs)

CO STATEMENT CORRELATION WITH PROGRAM


OUTCOMES
PO PO2 PO3 PO 4 PO 5 PO 6 PO 7 PO 8 PO 9 PO PO PO
1 10 11 12

CO1 Interpret diverse 1 3 2 2 3 1 1


datasets using data
exploratory
techniques.
CO2 Analyse different 1 3 3 3 3 2 1 1
data preparation
techniques to
improve data quality
and model
performance.
CO3 Implement 1 3 3 3 3 2 1 1 1 1
multivariate data
analysis methods
for efficient
decision making.
CO4 Evaluate 1 3 3 3 3 2 2 1 1 1 1
association rule
mining techniques
to develop
recommender
systems
CO5 Apply time series 1 3 3 3 3 2 1 1 1 1
analysis
techniques for
forecasting and
trend analysis in
data-driven
applications.

Note: CO to PO & PSO mapping level (1-low, 2-moderate and 3-substantial). PSO (If applicable)

Submitted By: (Dr. Napa Lakshmi) HOD Director

You might also like