0% found this document useful (0 votes)
8 views40 pages

For Students Copy Intro To Data Science

Uploaded by

zyrine paus
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views40 pages

For Students Copy Intro To Data Science

Uploaded by

zyrine paus
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Course Description

Data Science is the study of the generalizable extraction of knowledge from data. Being a data
scientist or data analyst requires an integrated skill set spanning mathematics, statistics, machine
learning, databases and other branches of computer science along with a good understanding of the
craft of problem formulation to engineer effective solutions. This course will introduce students to this
rapidly growing field and equip them with some of its basic principles and tools as well as its general
mindset.
Students will learn concepts, techniques and tools they need to deal with various facets of data science
practice, including data collection and integration, exploratory data analysis, predictive modeling,
descriptive modeling, data product creation, evaluation, and effective communication.
The focus in the treatment of these topics will be on breadth, rather than depth, and emphasis will be
placed on integration and synthesis of concepts and their application to solving problems. To make the
learning contextual, real datasets from a variety of disciplines will be used.
Apply Data Science in a CASE STUDY. Explain the significance of Exploratory Data Analysis (EDA) in
data science.
COURSE OBJECTIVES
Upon successful completion of this course, the students acquire the following
knowledge and skills:
1. Define and analyze the issues and challenges related to the data science domain
including issues related to machine learning, data mining and data visualization.
2. Identify, analyze and use data science strategies and tools to solve business
problems and improve business decision making.
3. Identify and examine ethical issues related to the data science domain,
particularly issues related to privacy, data sharing and algorithmic decision-
making.
Grading
Your final grade will be determined based on your performance on each of
the following items; the percentages in parenthesis show the weight each
item carries to the final grade.
• Class participation (10%)
• Assignments (30%)
• Project (30%)
• Final exam (30%)
Introduction
to
Data Science
INTRODUCTION TO DATA SCIENCE

Turning DATA ANALYZING DATA


into to get INSIGHTS
INFORMATION

Identifying CONTEXTUALIZING,
TRENDS, APPLYING and
PATTERNS, and UNDERSTANDING
CORRELATIONS them
What does a DATA SCIETIST do?

• Get and process data to


convert it from its raw
format to a cleaner format.
• Calculate and interpret
statistical variables.
• Create visualizations and
draw conclusions for the
analysis.
• Suggest applications from
the information and develop
machine learning
The Components and Essentials of Data
Science
STATISTICS
• Understanding the
different types of data
you can encounter.
• Understanding statistical
terms.
 Types of means.
 Fluctuations in
data.
• Splitting up, grouping,
and segmenting data
The Components and Essentials of Data
Science
DATA VISUALIZATION
• Why data visualization is a key
skill for data scientists?
• Graphs for showing and comparing
different numbers of variables.
 One variable graphs
 Two variable graphs
 Three variable graphs
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
DESCRIPTIVE STATISTICS
(Data Visualization)
Primary Goal of Descriptive Statistics
Is to provide a clear and concise summary of the data,
enabling researchers or analysts to gain insights and
understand patterns, trends, and distributions within the
dataset.
Statistics
Is the brief overview section
included in a long report or
document. This part of the report
primarily focuses on the key topics
and most important data within it.
The Components and Essentials of Data Science

PROGRAMMING
• Why knowing how to program makes your life so much easier?
 Ease of automation.
 Being able to customize, explore, prototype and test.
• Essential packages to use in Python.
 Pandas for data analysis (other types of programs).
Statistical Types
3 TYPES OF DATA
1. Numerical 2. Categorical 3. Ordinal
(Quantitative Data) (Qualitative Data)
Statistical Types
- Regression
EXERCISES - Mean
- Standard Deviation
Classify the following data values to - Sample Size
their data type: Determination
• The typical “average”. -Hypothesis Testing
-and more….
• Sum of all your values
dividend
by the number of
values.
PROS
Easy to understand.
Takes into account all the
data.
CONS
Data
Science
(BIG
DATA)
Goals:
 Know Where to Look?
 Terminology
 How to ask/answer the
right questions?
Questions
 The value of the ?
data.
 Opportunities Data
(Identify the
 Challenges right data)
Data
Science
(BIG
DATA)
Questions
?

Data
(Identify the
right data)

Team Experts
Collecting Visualize
Curating (Communication) Reproducibil
Collecting ity
Labelled or not
Analyze
- supervised
Model (Machine
- unsupervised
Learning)
Data
Science
(BIG
DATA)
Questions
?
Does the past represent the
future?
What do you want to model?
How will the model be used? Team Experts
What data do you need? Or do
you have?
How hard to get the data?
Data Reproducibil
(Identify the ity
Labelled or not
right data) - supervised
Inputs / - unsupervised
A Systems View of Project Management

A systems approach emerged in the 1950s to describe a more analytical


approach to management and problem solving.
Three parts include:
Systems philosophy: an overall model for thinking about things as
systems.
Systems analysis: problem-solving approach.
Systems management: address business, technological and
organizational issued before making changes to systems.
IT Project Management with Accountancy
Project management is the process of managing, planning, and developing information
technology projects. Project managers can use software to move through the five
phases of the IT project management life cycle and accomplish complex tasks more
effectively. It also adaptable and resourceful leaders.

An IT project manager is a professional


charged with overseeing the process of
planning, executing and delegating
responsibilities around an organization's
information technology (IT) pursuits and goals.
What does an IT Project Manager Do?
 Ensure the product functions
 Assign tasks to team members
 Track progress and performance
 Lead Agile meetings with stakeholders
• Interviewing /

• Financial loss
Unit Overview MIND MAP Survey Users and
Participants
• Employment loss • Analyzing the
Communication
• Missed existing system
Skills
opportunities Consequences by determining –
Identify how it works,
for Team
• Gantt charts Problems what it does,
Failure • Economic
• Scheduling tasks with existing who uses it
Project feasibili
• Journals/diaries System
Requirement
Management Techniques Understand ty
• Funding mgt plan Report
Tools for Managing ing the • Technical
• Communication Requirement
Social & a Project Problem feasibili
mgt plan Prototype
Ethical Issues ty
• Operation
Test Data al
Feasibility
• Volume feasibili
Details:
Testing, Study
data Original system ty frame
• Time
Evaluatin Choosing Best
• Simulate requirements have Project • Schedulin
• Subprojects
g and Planning Solution
d data been achieved? g
• Participants
Trial with Operation Maitainin Management
• Live Requirement • Information
data Manuel g Tech
Report
Effect on users
Development • Data/Informa
Approaches tion
Modification where
problems identified • Needs of
• Traditional
users
Implemen Clarifying new • Outsourcing
Designing System Benefits • Prototyping
• Participating Acquiring IT – tation
• Customizatio
training Hardware/Softwa Participant n
• Testing re Development/Rol • Participant
• Conversion of Implementation e dev’t
data for the new Plan Refining
Operation • Agile
system Prototypes • Context
Manual for methods
Conversion Methods: Tools used in diagrams
Participants Designing • DFD
• Parallel
conversion • Decision
• Direct conversion trees
• Phased conversion • Decision
• Pilot conversion tables
Project Management Challenges
Organizational Project
Management(OPM)

Project Program Portfolio


Portfolio are collection of programs and projects that support a specific business goal.
Portfolio Management is the process of managing investments with the help of right
tool and strategy to generate optimum return downsizing risk within a given time
horizon.
A Portfolio Manager is a person who understands his client’s investment needs and
suggests a suitable investment mix to meet his investment objectives while
maintaining risk-return balance.
The Objectives:
• Capital Appreciation • Asset Allocation
• Investment Goals • Liquidity
• Portfolio Efficiency • Diversification
• Risk Mitigation • Tax Planning
Process of Portfolio Management

1. Investment 3. Estimating
2. Client Profiling
Objective Market

6. Portfolio
5. Portfolio
4. Asset Allocation Strategy
Selection
Formulation

7. Portfolio 8. Evaluation & 9. Portfolio


Implementation Review Rebalancing
IT program management is primarily
implemented in government and large
enterprises having a number of IT projects
in development. The key objective behind
all such projects is to reach a bigger goal.

For example, for an IT health care program,


IT program management may include
developing one or more key health care applications, upgrading network infrastructure,
implementing cloud computing technologies and similar projects. IT program
management is generally overseen by an IT program manager with multiple IT project
managers working under them.
Comparison between Project, Program and Portfolio
Comparison Projects Programs Portfolios
Factor
Scope Narrow Scope Wide Scope Business Scope
Change Minimum Expect change & even Continually Monitor Changes
embrace it
Planning Detailed Planning High Level Planning Maintain necessary process &
communication
Monitoring Monitor & Control Task Monitor Projects & Ongoing Monitor Aggregate Performance
and Work Work & Value Indicators
Role Team Players Leaders Leaders
Management Manage the Team Manage Project Managers May Manage with Portfolio
Members Management Staff
Success Budget, Schedule & RIO, New Capabilities & Aggregate Performance of
Product Delivered Benefit Delivered Portfolio Components
Instructions:
Group yourselves into 3
Members each Group.
Instructions:
Please see the Format.
Instructions:
Present to the class any
topic about Data Mining or
Machine Learning or Data
Visualization.
SEE EXAMPLES
OF
CASE STUDY
Thank
you !

You might also like