Data Analysis
Welcome All
About yourself?
Introduce yourself(Name-Age-Department-Experience)
Why data analysis
Future
About Instructor
Data Analyst @ AlliedConslulting.
Data Analyst @ Joeex.
Data Analysis Instructor @Machinfy.
Data Analysis and AI Instructor @Tech Mind.
Data Scientists and Analyst @Mindset.
Data Scientists and Analyst @رواد الحضارة.
Top 100 Arabic content creators in programming for 2021-2022.
Programming Instructor.
Founder and Team Leader @EGI Team.
Data Science leader at Google Developer.
Data Science Mentor @CIS & EGI.
Former AI intern at NTI.
Former Data Analyst intern at FWD.
Session one topics
Discussion
What is data analysis?
Data Analysis Process?
Data Analysis types?
Skills & tools for the data analyst.
Our roadmap.
Dashboards
Data Analyst Job Requirements
Task.
Diploma Goals
Do projects in different business domains.
Strong technical information and recommendations for business.
Build a data analyst mindset.
Presentation skills and documentation skills.
How to search alone
Make a good CV & Portfolio
how to brand yourself.
How to be a data analyst freelancer?
Data Analysis
Data Analysis
Data analysis is the process of cleaning, changing, and processing raw
data and extracting actionable, relevant information that helps
businesses make informed decisions.
The procedure helps reducethe risks inherent
in decision-making by providing useful insights
and statistics, often presented in charts,
images, tables, and graphs.
Data Analysis
Data Analysis Process?
1. Data Requirement Gathering.
2. Data Collection
3. Data Cleaning
4. Data Analysis
5. Data Interpretation
6. Data Visualization
Data Requirement Gathering.
Data Cleaning
EDA
Data Analysis Process?
Let's consider a case study where a retail company wants to analyze
sales data to identify factors influencing customer purchasing
behavior.
Data Analysis Process?
Let's consider a case study where a retail company wants to analyze
sales data to identify factors influencing customer purchasing
behavior.
Define the Problem
Define the Problem: The retail company aims to understand
customer purchasing behavior and identify factors that impact sales.
Gather Data
Gather Data: Collect relevant sales data from various sources, such
as transaction records, customer demographics, product attributes,
pricing data, and promotional activities.
Exploratory Data Analysis (EDA)
Exploratory Data Analysis (EDA): Perform EDA to explore the sales data.
Calculate basic statistics like total sales, average purchase amount, and
analyze trends over time. Generate visualizations such as line plots or bar
charts to identify patterns, seasonality, and correlations between
variables.
Interpretation of Results
Interpretation of Results: Analyze the output from the data analysis.
Identify significant factors influencing customer purchasing behavior,
such as price sensitivity, the impact of promotions, or the popularity of
specific products. Interpret the findings to gain insights into customer
preferences and behavior.
Data Analysis Process?
Data Analysis Process?
Data Analysis Types?
Data Analysts: Common Responsibilities
Report production
Pattern recognition
Data collection and administration
Inter and cross-departmental collaborations
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
Data Analysts: Common Responsibilities
DE vs DS vs DA
Data Analyst VS AI
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
Data Analyst Job Requirements
EXCEL BASICS
Chatgpt - Linkedin - MS365.
Uses of Excel Tables
1. Easy Formatting
2. Filter & Sort Buttons
3. Auto-Freeze Headers
4. Single-Click Selections
Benefits of 5. Easy Rearrangement
6. Easy Growth of Data
Using Excel 7. Special Printing Option
Tables 8. Automatically Copy formulas
9. Special Total Row
10. Insert Slicer
11. Dynamic Named Range ( Best with Pivot
Tables)
12. Structured References
Proper Data Sets
Acc,Jan,Feb,Mar Acc Jan-18 Feb-18 Mar-18
1007,1359.89,9084.37,3067.48 1007 1,360 9,084 3,067
1007,7704.9,1835.01,8984.49 1007 7,705 1,835 8,984
1007,3236.85,9965.21,5068.39 1007 3,237 9,965 5,068
1007,4964.71,5691,9912.81 1007 4,965 5,691 9,913
1007,1813.13,8390.1,2811.06 1007 1,813 8,390 2,811
1008,8384.85,2688.01,2330.52 1008 8,385 2,688 2,331
1008,4536.33,4987.61,2170.42 1008 4,536 4,988 2,170
1008,6064.04,9574.15,7256.14 1008 6,064 9,574 7,256
Raw 1008,5450.77,6068.59,3026.44
1008,5834.98,8664.09,5118.04
1009,6837.75,9304.22,2248.21
1008
1008
1009
5,451
5,835
6,838
6,069
8,664
9,304
3,026
5,118
2,248
Data
1009,2774.84,6321.8,1258.83 1009 2,775 6,322 1,259
1009,4335,1405.15,9760.86 1009 4,335 1,405 9,761
Account Jan Feb Mar
1007 19,079 34,966 29,844
1008 30,271 31,982 19,902
1009 13,948 17,031 13,268
Grand Total 63,298 83,979 63,014
HFM Acc Jan-18 Feb-18
1007 1,360 9,084 3,067 6,134.9
1007 7,705 1,835 8,984 6
1007 3,237 9,965 5,068 748.71
1007 4,965 5,691 9,913
1007 1,813 8,390 2,811
1008 8,385 2,688 2,331
Subtotal 32,175 -37
1008 4,536 4,988 2,170 %
Improper 1008
1008
6,064
5,451
9,574
6,069
Mar
7,256
3,026
Data Set 1008
1009
1009
5,835
6,838
2,775
8,664
9,304
6,322
5,118
2,248
1,259
1009 4,335 1,405 9,761
1. Missing headers 1. Adjacent calculation
2.Blank rows 2.Inconsistent data types
3. Subtotals in between
rows
Column
or Field
Acc Jan-18 Feb-18 Mar-18 Header in First raw
1007 1,360 9,084 3,067
1007 7,705 1,835 8,984
1007 3,237 9,965 5,068 Row / Record
1007 4,965 5,691 9,913
1007 1,813 8,390 2,811
1008 8,385 2,688 2,331
1008 4,536 4,988 2,170
1008 6,064 9,574 7,256
Proper 1008
1008
1009
5,451
5,835
6,838
6,069
8,664
9,304
3,026
5,118
2,248
Data Set
1009 2,775 6,322 1,259
1009 4,335 1,405 9,761
1. No blanks, 1. Header = Column name =
2. No totals or subtotals Field Name
3. No adjacent cells 2. Row = Record = one data
4. Complete headers point for each field
5. Consistent data types 3. Each cell contains one value
Text Date Number
Acc Date Amount
1007 01-01-18 1,360
1008 01-01-18 8,385
1007 01-02-18 9,084
Data Types
1008 01-02-18 2,688
1007 01-03-18 3,067
1008 01-03-18 2,331
1. Text: (Text, String)
2. Date: (Date, Time, Duration)
3. Number: (Whole Number, Decimal Number, Integer, Double,
Currency, Percentage, etc..)
1. The dates in Excel are stored as numbers, and then formatted
to display the date.
2. The dates are referred to as serial numbers in Excel. The date
Date & calendar in Excel starts on January 1st, 1900. As far as Excel is
concerned this day starts the beginning of time.
Time 1. One hour in Excel is represented by the number: 1/24 = 0.04167
In Excel 2. One minute in Excel is represented by the number:
1/(24*60) =
1/1440 = 0.000694
5. So 8:30 AM can be calculated as: (8 * (1/24)) + (30 *
(1/1440))
= .354167
The big thing to be aware of here is that data types and formats are
not even close to the same thing:
Formats: Control how a number is displayed, without
Data Type affecting
the underlying precision in any way.
Data Types: Control the type of data, and will change the
precision of the value to become consistent with the type of data
vs Formats you have declared.
This is obviously a very important distinction that you should be aware
of. Setting a data type can (and often does) change the underlying
value in some way, while formatting never does.
Data Sets Demo