Data Science
Course Title Data Science
Credit 2
Teaching per Week 3 hrs.
Minimum weeks per 15 (Including Class work, examination, preparation etc.)
Semester
Purpose of Course This course aims at introducing the students into the world of
Data Science. Understand the basic concepts of Data Science, Life
Cycle, Big Data, Advance Database, Data Warehouse
Course Objective Provide fundamental knowledge about Data Science in real world.
Pre-requisite Basic Knowledge of Database Management System.
Course Out come After successful completion of the course a student will be
Able to understand about need of Data Science
Able to understand about Big Data and Data Warehouse
Able to understand about Advance Database System
Differentiate Data Science and Data Analysis
Unit Content
1 Introduction to Data Science
What is Data Science
Need of data Science
Business Intelligence v/s Data Science
Component of data science
Data Science Life Cycle
Tools for Data Science
2 Introduction to Big Data
Classification of Data
Definition and Evolution of Big Data
Challenges of Big Data
Characteristics of Big Data
Big Data Applications
Big Data Architecture.
3 Introduction to HADOOP
Apache Hadoop
Hadoop Architecture & Hadoop Ecosystem
Hadoop ecosystem components
MapReduce
HDFS
YARN
4 Advance Database System
Types of Databases
Introduction of NoSQL
Need of NoSQL
Advantages of NoSQL
SQL vs NoSQL
Introduction to different type of NoSQL databases.
5 Data Analytics
What is Data analytics
Use of Data Analytics
Data Analytics Life Cycle
Types of analysis
Predictive
Descriptive
Prescriptive
Diagnostic.
Reference Books
1) Thomas Erl, Wajid Khattak, Paul Buhler, Big data Fundamentals Concepts, Driver &
Techniques, Pearson
2) Tom White, “HADOOP: The definitive Guide”, O Reilly
3) Dan Sullivan,"NoSQL for Mere Mortals", Pearson Education.
4) David Dietrich - Barry Hiller - “Data Science & Big Data Analytics” - EMC education services -
Wiley publications