KARATINA UNIVERSITY
COURSE CODE: COM 428 COURSE NAME: DATA WAREHOUSING AND MINING
Aim of the course
This course is designed to equip the learners with an overview of introducing the learner to the concept of data
mining, Data warehousing process data pre-processing, data integration, on-line analytical processing (OLAP)
tools for the interactive analysis of multidimensional data and effective data mining techniques and their tools.
Objectives
By the end of the course, the student should be able to:
i) Design a data warehouse or data mart to present information needed by management in a form that is
usable for management clients.
ii) Implement a high-quality data warehouse or data mart.
iii) Effectively administer a corporate data resource in such a way that it will truly meet management’s
needs.
iv) Evaluate standards and new technologies to determine their potential impact on information resources.
Course Content
Introduction to Data Mining, Data Pre-processing, Data Warehouse and OLAP technology, Mining frequent
patterns, Classification & Prediction and Cluster analysis. Data mining definition, databases, machine learning,
algorithms, information retrieval, and statistics. Data warehousing process data pre-processing, data
integration, on-line analytical processing (OLAP) tools for the interactive analysis of multidimensional data,
effective data mining. data warehousing and data mining techniques and their tools.
WEEK CHAPTER / SUB METHODOLOGY COMPETENCIES
TOPIC TOPICS/CONTENTS
1 Introduction to Introduction to data ➢ Lectures Analytical skills
Data warehousing, definition of Class discussions
Warehousing terms and Concepts, benefits
of data warehousing,
operational vs informational
databases, characteristics of a
data warehouse, data
warehouse vs operational data
store. Data marts; Data
warehouse administration and
management; and Information
delivery system, The three-tier
data warehouse architecture
2 Data TYPES OF DATA, Meta data; ➢ Lectures ➢ Analytical skills
Access tools; ➢ Class discussions Creative thinking
Video demonstrations skills
3 Building a Considerations in building a ➢ Lectures ➢ Analytical skills
Data data warehouse: business ➢ Class discussions Problem solving skills
Warehouse considerations; design
considerations; technical
considerations; and
implementation
considerations,
4 Data Pre- Data Pre-processing– Class Presentations Presentation skills
processing
Data Integration and Video demonstrations Problem solving skills
Transformation, Data
Reduction, Data Mining
Primitives: Task-
Relevant Data, The Kind
of Knowledge to be
Mined, KDD
5 CAT ONE CAT ONE
➢ CAT ONE
➢ Assignment Team work
6 Building a DBMS schemas for decision ➢ Lectures ➢ Analytical skills
Data support: Star schema; ➢ Class discussions Creative thinking
Warehouse Snowflake schema; and Fact skills
constellation schema
7 Data OLAP overview, categories of Class discussions Presentation skills
Warehousing OLAP tools, OLAP guidelines,
and OLAP OLAP vs OLTP
8 Introduction to Data Mining overview, scope ➢ Lectures Analytical skills
Data Mining of data mining, tasks of data ➢ Class discussions
mining, architecture of data
mining
9 Data Mining Data mining process: state the ➢ Lectures ➢ Analytical skills
Process problem and formulate the ➢ Class discussions Problem solving skills
hypothesis; collect the data; Team work
pre-processing the data;
estimate the model; interpret
the model and draw
conclusion, Classifications of
data mining systems, major
issues in data mining,
knowledge discovery in
databases
10 CAT II ➢ CAT TWO
11 Data Data Mining Techniques and ➢ Lectures ➢ Analytical skills
Techniques Tools ➢ Class discussions ➢ Innovativeness
and Tools ➢ Demonstration Presentation skills
12 Mining Association rule mining, ➢ Lectures ➢ Creative thinking
frequent market basket analysis, ➢ Class discussions skills
patterns and frequent pattern mining, ➢ Innovativeness
web mining efficient frequent item-set
mining methods, approaches
for mining multilevel
association rule
13 Classification Classification and prediction ➢ Lectures ➢ Innovativeness
and Prediction overview, issues regarding ➢ Class discussions
classification and prediction, ➢ Demonstration
comparing classification and
prediction methods, detection
methods
14 EXAMS
15 EXMAS
Mode of Delivery
The course will be conducted using lectures, case studies and group presentations
Teaching Equipment/Materials
The course delivery requires audio-visual devices, computers with internet services, journals, newspapers,
chalk/pens, whiteboard markers, whiteboards/ blackboards, flip charts and learning centres
Assessment
Continuous Assessment Tests (CATs): 20%
Practical Based Assessment 10%
End of Semester Written Examinations: 70%
Course Textbook and Journal
i. Jiawei Han, Micheline Kamber, "Data Mining: Concepts and Techniques", Morgan Kaufmann
Publishers, 2002.
ii. Alex Berson,Stephen J. Smith, “Data Warehousing, Data Mining,& OLAP”, Tata Mcgraw- Hill, 2004.
Recommended Textbooks and Journal for Further Reading
i. Ralph Kimball, "The Data Warehouse Life Cycle Toolkit", John Wiley & Sons Inc., 1998.
ii. Sean Kelly, "Data Warehousing In Action", John Wiley & Sons Inc., 1997.
iii. Margaret H. Dunham, Data Mining – Introductory and Advanced Topics, Prentice-Hall, 2003.
LECTURER’S NAME: MR. ZABLON OKARI SIGNATURE……………………
APPROVED BY THE HOD…MS. VANCY KEBUT……………………………………….
RECEIVED BY THE CLASS REPRESENTATIVE………………………………………