0% found this document useful (0 votes)
14 views71 pages

Data Analysis Intro Session

The document outlines an introduction to data analysis, covering its definition, process, and importance in business decision-making. It includes a detailed breakdown of the data analysis process, types, and common responsibilities of data analysts, as well as essential skills and tools required for the role. Additionally, it discusses diploma goals for aspiring data analysts, including project work, technical skills, and personal branding.

Uploaded by

elhady59
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views71 pages

Data Analysis Intro Session

The document outlines an introduction to data analysis, covering its definition, process, and importance in business decision-making. It includes a detailed breakdown of the data analysis process, types, and common responsibilities of data analysts, as well as essential skills and tools required for the role. Additionally, it discusses diploma goals for aspiring data analysts, including project work, technical skills, and personal branding.

Uploaded by

elhady59
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 71

Data Analysis

Welcome All
About yourself?
Introduce yourself(Name-Age-Department-Experience)
Why data analysis
Future
About Instructor
Data Analyst @ AlliedConslulting.
Data Analyst @ Joeex.
Data Analysis Instructor @Machinfy.
Data Analysis and AI Instructor @Tech Mind.
Data Scientists and Analyst @Mindset.
Data Scientists and Analyst @‫رواد الحضارة‬.
Top 100 Arabic content creators in programming for 2021-2022.
Programming Instructor.
Founder and Team Leader @EGI Team.
Data Science leader at Google Developer.
Data Science Mentor @CIS & EGI.
Former AI intern at NTI.
Former Data Analyst intern at FWD.
Session one topics
Discussion
What is data analysis?
Data Analysis Process?
Data Analysis types?
Skills & tools for the data analyst.
Our roadmap.
Dashboards
Data Analyst Job Requirements
Task.
Diploma Goals
Do projects in different business domains.
Strong technical information and recommendations for business.
Build a data analyst mindset.
Presentation skills and documentation skills.
How to search alone
Make a good CV & Portfolio
how to brand yourself.
How to be a data analyst freelancer?
Data Analysis
Data Analysis
Data analysis is the process of cleaning, changing, and processing raw
data and extracting actionable, relevant information that helps
businesses make informed decisions.

The procedure helps reducethe risks inherent


in decision-making by providing useful insights
and statistics, often presented in charts,
images, tables, and graphs.
Data Analysis
Data Analysis Process?

1. Data Requirement Gathering.


2. Data Collection
3. Data Cleaning
4. Data Analysis
5. Data Interpretation
6. Data Visualization
Data Requirement Gathering.
Data Cleaning
EDA
Data Analysis Process?
Let's consider a case study where a retail company wants to analyze
sales data to identify factors influencing customer purchasing
behavior.
Data Analysis Process?
Let's consider a case study where a retail company wants to analyze
sales data to identify factors influencing customer purchasing
behavior.
Define the Problem
Define the Problem: The retail company aims to understand
customer purchasing behavior and identify factors that impact sales.
Gather Data
Gather Data: Collect relevant sales data from various sources, such
as transaction records, customer demographics, product attributes,
pricing data, and promotional activities.
Exploratory Data Analysis (EDA)
Exploratory Data Analysis (EDA): Perform EDA to explore the sales data.
Calculate basic statistics like total sales, average purchase amount, and
analyze trends over time. Generate visualizations such as line plots or bar
charts to identify patterns, seasonality, and correlations between
variables.
Interpretation of Results
Interpretation of Results: Analyze the output from the data analysis.
Identify significant factors influencing customer purchasing behavior,
such as price sensitivity, the impact of promotions, or the popularity of
specific products. Interpret the findings to gain insights into customer
preferences and behavior.
Data Analysis Process?
Data Analysis Process?
Data Analysis Types?
Data Analysts: Common Responsibilities

Report production
Pattern recognition
Data collection and administration
Inter and cross-departmental collaborations
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
DE vs DS vs DA
Data Analyst VS AI
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
EXCEL BASICS

Chatgpt - Linkedin - MS365.


Uses of Excel Tables
1. Easy Formatting
2. Filter & Sort Buttons
3. Auto-Freeze Headers
4. Single-Click Selections

Benefits of 5. Easy Rearrangement


6. Easy Growth of Data
Using Excel 7. Special Printing Option

Tables 8. Automatically Copy formulas


9. Special Total Row
10. Insert Slicer
11. Dynamic Named Range ( Best with Pivot
Tables)
12. Structured References
Proper Data Sets
Acc,Jan,Feb,Mar Acc Jan-18 Feb-18 Mar-18
1007,1359.89,9084.37,3067.48 1007 1,360 9,084 3,067
1007,7704.9,1835.01,8984.49 1007 7,705 1,835 8,984
1007,3236.85,9965.21,5068.39 1007 3,237 9,965 5,068
1007,4964.71,5691,9912.81 1007 4,965 5,691 9,913
1007,1813.13,8390.1,2811.06 1007 1,813 8,390 2,811
1008,8384.85,2688.01,2330.52 1008 8,385 2,688 2,331
1008,4536.33,4987.61,2170.42 1008 4,536 4,988 2,170
1008,6064.04,9574.15,7256.14 1008 6,064 9,574 7,256

Raw 1008,5450.77,6068.59,3026.44
1008,5834.98,8664.09,5118.04
1009,6837.75,9304.22,2248.21
1008
1008
1009
5,451
5,835
6,838
6,069
8,664
9,304
3,026
5,118
2,248

Data
1009,2774.84,6321.8,1258.83 1009 2,775 6,322 1,259
1009,4335,1405.15,9760.86 1009 4,335 1,405 9,761

Account Jan Feb Mar


1007 19,079 34,966 29,844
1008 30,271 31,982 19,902
1009 13,948 17,031 13,268
Grand Total 63,298 83,979 63,014
HFM Acc Jan-18 Feb-18
1007 1,360 9,084 3,067 6,134.9
1007 7,705 1,835 8,984 6
1007 3,237 9,965 5,068 748.71
1007 4,965 5,691 9,913

1007 1,813 8,390 2,811


1008 8,385 2,688 2,331
Subtotal 32,175 -37
1008 4,536 4,988 2,170 %

Improper 1008

1008
6,064

5,451
9,574

6,069
Mar
7,256

3,026

Data Set 1008


1009
1009
5,835
6,838
2,775
8,664
9,304
6,322
5,118
2,248
1,259
1009 4,335 1,405 9,761

1. Missing headers 1. Adjacent calculation


2.Blank rows 2.Inconsistent data types
3. Subtotals in between
rows
Column
or Field

Acc Jan-18 Feb-18 Mar-18 Header in First raw


1007 1,360 9,084 3,067
1007 7,705 1,835 8,984
1007 3,237 9,965 5,068 Row / Record
1007 4,965 5,691 9,913
1007 1,813 8,390 2,811
1008 8,385 2,688 2,331
1008 4,536 4,988 2,170
1008 6,064 9,574 7,256

Proper 1008
1008
1009
5,451
5,835
6,838
6,069
8,664
9,304
3,026
5,118
2,248

Data Set
1009 2,775 6,322 1,259
1009 4,335 1,405 9,761

1. No blanks, 1. Header = Column name =


2. No totals or subtotals Field Name
3. No adjacent cells 2. Row = Record = one data
4. Complete headers point for each field
5. Consistent data types 3. Each cell contains one value
Text Date Number

Acc Date Amount


1007 01-01-18 1,360
1008 01-01-18 8,385
1007 01-02-18 9,084

Data Types
1008 01-02-18 2,688
1007 01-03-18 3,067
1008 01-03-18 2,331

1. Text: (Text, String)


2. Date: (Date, Time, Duration)
3. Number: (Whole Number, Decimal Number, Integer, Double,
Currency, Percentage, etc..)
1. The dates in Excel are stored as numbers, and then formatted
to display the date.

2. The dates are referred to as serial numbers in Excel. The date


Date & calendar in Excel starts on January 1st, 1900. As far as Excel is
concerned this day starts the beginning of time.

Time 1. One hour in Excel is represented by the number: 1/24 = 0.04167

In Excel 2. One minute in Excel is represented by the number:


1/(24*60) =
1/1440 = 0.000694

5. So 8:30 AM can be calculated as: (8 * (1/24)) + (30 *


(1/1440))
= .354167
The big thing to be aware of here is that data types and formats are
not even close to the same thing:

Formats: Control how a number is displayed, without

Data Type affecting


the underlying precision in any way.
Data Types: Control the type of data, and will change the
precision of the value to become consistent with the type of data
vs Formats you have declared.

This is obviously a very important distinction that you should be aware


of. Setting a data type can (and often does) change the underlying
value in some way, while formatting never does.
Data Sets Demo

You might also like