Introduction to Data Science
Course Structure
(CS2004)
Dr. Kusum Kumari Bharti
Computer Science and Engineering Department,
PDPM-Indian Institute of Information Technology Design and
Manufacturing, Jabalpur
Dumna Airport Road - 482005
Email: kusum@[Link]
Content
Course Objective
Course Learning Outcome
Course Home Page
Lecture Plan
Evaluation Scheme
Self Learning Resources
2
Course Objective
To elaborate the basics of data science and provide a foundation for
understanding the challenges and applications.
Data
Analytics
(FE, Data
Data wrangling,
Visualizatio EDA)
Programmi
n
ng (R,
(Matplotlib,
Python
seaborn,
etc.)
Tableau,
Power BI)
DS
Math Machine
Learning
(Statistics, (Regression
Linear ,
Algebra, Classificati
Calculus) on)
Web
Scraping
Content
Course Objective
Course Learning Outcome
Course Home Page
Lecture Plan
Evaluation Scheme
Self Learning Resources
5
COURSE LEARNING OUTCOMES (CLOS)
On completion of this course, the students will be able to:
To analyse the need and usage of analytics and visualization techniques.
To implement how to manage, manipulate, cleanse and analyse data.
To Implement various data manipulation approaches in Python.
To develop different type of data dashboard for data analysis using different tools.
6
Content
Course Objective
Course Learning Outcome
Course Home Page
Lecture Plan
Evaluation Scheme
Self Learning Resources
7
Course Home Page (Canvas)
Course Home page consist of:-
Notifications related to: Exams, Quizzes, Assignment, Project work, Lab
evaluation, submission deadlines, etc
Slides
Lab Experiments
Assignments and Projects
Books
Supplement Resources
8
Content
Course Objective
Course Learning Outcome
Course Home Page
Lecture Plan
Evaluation Scheme
Self Learning Resources
9
Lecture Plan
Lectures per week = 3 (1hr)
Lab per week = 2 (2hr)
10
Evaluation Scheme
Evaluation Scheme*
Quizzes : 20 (Quiz 1 02.03.2023
Quiz 2 04.10.2023)
Mid Term : 25 (18.09.2023 to 23.09.2023)
End Term : 35 (20.11.2023 to 25.11.2023)
Lab Exam : 20 (Will be Announced in the Labs)
* Tentative
11
Lab plan
Total Marks 20
Tasks:
Lab assignment Marks 10
Group based project on Real life applications – Batch size 03 and Marks 10
12
Course plan
Part I (Aug 2023) Introduction to Data Science, Data, Basic
statistics, Intermediate statistics
Part II (Sep 2023) Intermediate statistics, Advance
Statistics, Exploratory data analysis
Part III (Oct 2023) Feature Engineering, Database
Part IV (Nov 2023) Machine Learning
13
Lab plan
Part I (Aug 2023) Crash course on Python
Basic syntax, Advance of python
Part II (Sep 2023) Cover Basics of Data Science
Exploratory data analysis
Part III (Oct 2023) Covers Basics of Data Science
Exploratory data analysis, Databases
Part IV (Nov 2023) Advances in Data Science
Machine learning for Data Science (Data collection and Experimentation) 14
Content
Course Objective
Course Learning Outcome
Course Home Page
Lecture Plan
Evaluation Scheme
Self Learning Resources
15
Self Learning Resources
Coursera
Codecademy
Edx
Udemy Alison
Datacamp Treehouse
Udacity FutureLearn
MITOpenCourseware Hardvard Extension School
Kaggle
Self Learning Resources
Papers with codes ([Link] )
Projects ([Link] )
Data Science news
([Link]
and-Data-Science )
Join and Explore Linkedin
Explore internships
Explore freelancing projects
Explore current trends in the job market
Connect to your seniors
18
TOP COMPETITIVE DATA SCIENCE PLATFORMS OTHER THAN KAGGLE
Drivendata hosts data science competitions CrowdANALYTIX is a crowdsourced InnoCentive mainly focuses
for social good in areas like international analytics platform that converts on problems dealing with
development, health, education, research and business challenges and problems life sciences
conservation, and public services. into competitions.
TunedIT Challenges platform for hosting data Codalab enables researchers, developers, Zindi focused on solving
competitions — for educational, scientific and data scientists to collaborate, with the Africa’s most pressing
and business purposes. goal of advancing research fields problems.
Analytics Vidhya provides a community- Data scientists cooperate with each
The data science challenge
based knowledge portal for Analytics and other and connect with businesses and
platform crowdAI hosts
Data Science professionals. governments globally to solve the
multiple open data science
Analytics Vidhya also offer job opportunities hardest business problems across
challenges each year.
to the top scorers. industries.
The challenges are designed to encourage the brightest KDD Cup is the annual Data Mining and
minds in data science to help solve real-world problems. Knowledge Discovery competition
ViZDoom is a Doom-based AI Research Machine Learning Contests is a data
Platform for Reinforcement Learning from science competition aggregator site.
Raw Visual Information.
Source: [Link]
other-than-kaggle-2995e9dad93c
Finally
Learning by Doing
22