M.
Sc COMPUTER SCIENCE
SYLLABUS
SECOND SEMESTER
CORE PAPER
Course code 23PVPCSC04 DATA MINING L T P C
AND
WAREHOUSING
Core/Elective/ Core 5 5
Supportive
Pre-requisite Basics of RDBMS & Algorithms
Course Objectives:
The main objectives of this course are to:
1. Enable the students to learn the concepts of Mining tasks, classification, clustering and Data
Warehousing.
2. Develop skills of using recent data mining software for solving practical problems.
3. Develop and apply critical thinking, problem-solving, and decision-making skills.
Expected Course Outcomes:
On the successful completion of the course, student will be able to:
1 Understand the basic data K1,K2
mining techniques and
algorithms.
2 Understand the Association K2,K3
rules, Clustering techniques
and Data warehousing
contents
3 Compare and evaluate K4,K5
different data mining
techniques like classification,
prediction, Clustering and
association rule mining
4 Design data warehouse with K5,K6
dimensional modeling and
apply OLAP operations
5 Identify appropriate data K6
mining algorithms to solve
real world problems
K1 - Remember; K2 - Understand; K3 - Apply; K4 - Analyze; K5 - Evaluate; K6 - Create
Unit:1 BASICS AND 12 hours
TECHNIQUES
Basic data mining tasks – data mining versus knowledge discovery in databases – data mining issues –
data mining metrics – social implications of data mining – data mining from a database perspective.
Data mining techniques: Introduction – a statistical perspective on data mining – similarity measures –
decision trees – neural networks – genetic algorithms.
Unit:2 ALGORITHMS 12 hours
Classification: Introduction – Statistical – based algorithms - distance – based algorithms-
decision tree - based algorithms - neural network – based algorithms –rule - based algorithms –
combining techniques.
Unit:3 CLUSTERING AND 12 hours
ASSOCIATION
Clustering: Introduction – Similarity and Distance Measures – Outliers – Hierarchical
Algorithms - Partitional Algorithms. Association rules: Introduction - large item sets - basic
algorithms – parallel & distributed algorithms – comparing approaches- incremental rules –
advanced association rules techniques – measuring the quality of rules.
Unit:4 DATA WAREHOUSING 11 hours
AND MODELING
Data warehousing: introduction - characteristics of a data warehouse – data marts – other
aspects of data mart. Online analytical processing: introduction - OLTP & OLAP systems
Datamodeling –star schema for multidimensional view –data modeling – multifactstar schema
or snow flake schema – OLAP TOOLS – State of the market – OLAP TOOLS and the internet.
Unit:5 APPLICATIONS OF 11 hours
DATA WAREHOUSE
Developing a data WAREHOUSE: why and how to build a data warehouse –data warehouse
architectural strategies and organization issues - design consideration – data content – metadata
distribution of data – tools for data warehousing – performance considerations – crucial
decisions in designing a data warehouse. Applications of data warehousing and data mining in
government: Introduction - national data warehouses – other areas for data warehousing and
data mining..
Unit:6 Contemporary Issues 2 hours
Expert lectures, online seminars – webinars
Total Lecture hours 60 hours
Text Books
1 Margaret H. Dunham, “Data Mining: Introductory and Advanced Topics”,
Pearson education,2003
2 C.S.R. Prabhu, “Data Warehousing Concepts,Techniques, Productsand
Applications”, PHI, Second Edition.
Reference Books
1 Arun K.Pujari, “Data Mining Techniques”, Universities Press (India) Pvt.
Ltd.,2003.
2 Alex Berson, Stephen J. Smith, “Data Warehousing, Data Mining and OLAP”,
TMCH, 2001.
3 Jiawei Han & Micheline Kamber, Academicpress.
Related Online Contents [MOOC, SWAYAM, NPTEL, Websites etc.]
1 https://www.javatpoint.com/data-warehouse
2 https://nptel.ac.in/noc/courses/noc20/SEM1/noc20-
cs12/
3 https://www.btechguru.com/training--it--database-
management-systems--file-structures--introduction-
to-data-warehousing-and-olap-2-video-lecture--
12054--26--151.html