Lecture 1 Course Introduction
Lecture 1 Course Introduction
Topics Activities
• What is data science? • Setup Python programing environment
• Write and run first Python program
• Course objectives and overview
• Example of a data science project
2
© 2021 Datapot. All rights reserved.
Why?
Jobs?
3
© 2021 Datapot. All rights reserved.
Why?
Jobs?
5
© 2021 Datapot. All rights reserved.
Memes !
6
© 2021 Datapot. All rights reserved.
Why?
7
© 2021 Datapot. All rights reserved.
Co u rs e I nt ro d u c t i o n
• Long time ago (thousands of years) science was only emprical and people
counted stars
9
© 2021 Datapot. All rights reserved.
History
Khufu, Khafre, Menkaure
Constellation Orion
Pyramid (Antarctica)
10
© 2021 Datapot. All rights reserved.
History (cont)
• Long time ago (thousands of years) science was only emprical and people
counted stars or crops
11
© 2021 Datapot. All rights reserved.
History (cont)
• Long time ago (thousands of years) science was only emprical and people
counted stars or crops and used the data to create to create machines to
describe the phenomena
Stonehedge
© 2021 Datapot. All rights reserved.
Antikihira mechanism 12
History (cont)
13
© 2021 Datapot. All rights reserved.
History (cont)
14
© 2021 Datapot. All rights reserved.
History (cont)
• And then … data science: “Data Science is a multidisciplinary field that uses
scientific methods, processes, algorithms and systems to extract knowledge
and insights from structured and unstructured data.” – Wikipedia.
• Inter-disciplinary
• Data and task focused
• Adaptable to changes in the
environment and needs
15
© 2021 Datapot. All rights reserved.
The Potential of Data Science
Business Analytics Disease Diagnosis
Getting insights from business data Detecting malaria from blood smears
Drug Discovery Agriculture
Get the Data What would you do if you had all of the data?
17
© 2021 Datapot. All rights reserved.
The Data Science Process
18
© 2021 Datapot. All rights reserved.
The Data Science Process
19
© 2021 Datapot. All rights reserved.
The Data Science Process
20
© 2021 Datapot. All rights reserved.
The Data Science Process
21
© 2021 Datapot. All rights reserved.
L e ct u re 1 : Co u rse In t ro d u ct i o n & E nv i ro n m en t Set u p
This course is designed to help you start your data science career
journey. After completing this course, you should be able to:
• Define basic concepts in programming for data analytic and scientific
computing.
• Use Python programing language for importing & reading different types of data
• Use Python programing language and libraries (Pandas) for cleaning data
• Use Python programming language for analyzing & visualizing data.
• Use libraries (Scikit Learn) to build and evaluate basic machine learning models
doing inference on data
23
© 2021 Datapot. All rights reserved.
Course outline
24
© 2021 Datapot. All rights reserved.
Course prerequisites
• Preferred Knowledge
• Basic knowledge on algebra and calculus
• Other requirements
• You must have a computer in order to do coding
25
© 2021 Datapot. All rights reserved.
Co u rs e I nt ro d u c t i o n
- You will get full 10% if you - You will need to finish all - You will work on course projects
attend all classes exercises & homework for each in group (3-4 students)
- You will not be graded all module. - You can choose a project in
course if you miss 20% of - All homework are weighted predefined projects or you can
the classes (3 classes). equally. propose your own projects.
- Due at the beginning of the next
lecture.
27
© 2021 Datapot. All rights reserved.
Course Projects
28
© 2021 Datapot. All rights reserved.
Tools for the course
Jupyterhub Canvas
• Remove an environment
conda remove --name fdc104 --all
if __name__ == "__main__":
user = sys.argv[1]
say_hello(user)
Wrap-up
41
© 2019 Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you