0% found this document useful (0 votes)

21 views9 pages

EDA - Reviewer Midterm

Uploaded by

s.bakansa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views9 pages

EDA - Reviewer Midterm

Uploaded by

s.bakansa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Introduction to Business Statistics (Chapter 1)  Descriptive Statistics: Summarizes and describes the

main features of a data set (e.g., mean, median).

1.1 Data
 Statistical Inference: Uses sample data to make
 Data: Facts and figures used to draw conclusions.
conclusions about the population.
 Data Set: Collection of data for a specific study.
1.4 Case Studies on Sampling and Statistical Inference
 Elements: The entities (people, objects, events)
 Cell Phone Case: Estimating cell phone costs using
being studied.
sample data.
 Variable: A characteristic of an element that can be
 Marketing Research Case: Rating a new bottle design
measured.
based on consumer feedback.
o Quantitative Variable: Numerical values
 Car Mileage Case: Estimating average car mileage
representing quantities (e.g., age, income).
using sample data.
o Qualitative Variable: Categorical values (e.g.,
Importance of Random Sampling:
gender, color).
 Ensures that the sample is representative of the
Types of Data:
population, reducing bias.
 Cross-Sectional Data: Collected at the same point in
1.5 Scales of Measurement (Optional)
time.
 Nominal Scale: Categories with no order (e.g.,
 Time Series Data: Collected over different time
gender, colors).
periods (e.g., monthly sales data).
 Ordinal Scale: Categories with a specific order (e.g.,
1.2 Data Sources
rankings, satisfaction levels).
 Existing Sources: Data already collected by others
 Interval Scale: Numerical values with equal intervals
(e.g., government reports, libraries, internet).
but no true zero (e.g., temperature in Celsius).
 Experimental Studies: Data collected by
 Ratio Scale: Numerical values with equal intervals
manipulating independent variables to observe
and a true zero (e.g., weight, income).
effects on a response variable.
Key Takeaways:
 Observational Studies: Data collected without
manipulating variables (e.g., surveys).  Data is the foundation of statistical analysis, and
understanding variables and data types is crucial.
Steps in Initiating a Study:
 Data Sources can be existing or collected through
1. Define the response variable (variable of interest).
experimental/observational studies.
2. Identify independent variables (related factors).
 Populations and Samples help generalize findings
3. Decide if the study is experimental (manipulate from a subset to the entire group.
variables) or observational (no manipulation).
 Random Sampling ensures unbiased and
1.3 Populations and Samples representative data collection.

 Population: The entire set of elements of interest.  Scales of Measurement help classify data for
appropriate analysis.
 Census: Data collected from every element in the
population. Descriptive Statistics - Tabular and Graphical Methods

 Sample: A subset of the population used to draw 1. Summarizing Qualitative Data

conclusions about the entire population.
 Frequency Distribution: A table that summarizes the
Descriptive Statistics vs. Statistical Inference: number (frequency) of items in each category.
o Relative Frequency: Proportion of items in  Interpretation: Clusters, gaps, and outliers can be
each class (frequency ÷ total observations). easily identified.

o Percent Frequency: Relative frequency 4. Stem-and-Leaf Displays

multiplied by 100.
 Definition: A graphical method that splits each data
 Graphical Methods: point into a "stem" (leading digit(s)) and a "leaf"
(trailing digit).
o Bar Charts: Represent frequencies of
categories using bars.  Use: Display the distribution of data while retaining
the original values.
o Pie Charts: Show proportions of categories
as slices of a pie.  Example: For the number 23, the stem is 2 and the
leaf is 3.
o Pareto Chart: A bar chart where categories
are ordered by frequency, highlighting the 5. Contingency Tables (Optional)
most significant categories.
 Definition: A table that classifies data based on two
2. Summarizing Quantitative Data dimensions (rows and columns).

 Frequency Distribution: Group quantitative data into  Use: Examine relationships between two categorical
classes (intervals) and count the number of variables.
observations in each class.
 Example: Rows could represent gender, and columns
o Steps: could represent product preferences.

1. Determine the number of classes. 6. Scatter Plots (Optional)

2. Calculate class length (range ÷  Definition: A graph that shows the relationship
number of classes). between two quantitative variables.

3. Form non-overlapping classes of o X-axis: Independent variable.

equal width.
o Y-axis: Dependent variable.
4. Tally and count observations in each
 Types of Relationships:
class.
o Linear: Data points form a straight line.
5. Graph the histogram.
 Positive: As one variable increases,
 Graphical Methods:
the other increases.
o Histogram: A bar chart for quantitative data,
 Negative: As one variable increases,
showing the distribution of data across
the other decreases.
classes.
o No Linear Relationship: No clear pattern
o Frequency Polygon: A line graph connecting
between variables.
the midpoints of the tops of the bars in a
histogram. 7. Misleading Graphs and Charts (Optional)

o Ogive: A line graph that shows cumulative  Common Issues:

frequencies.
o Scaling: Manipulating the axis scale to
3. Dot Plots exaggerate or minimize trends.

 Definition: A simple graphical display where each o Truncated Axes: Starting the axis at a value
data point is represented by a dot along a number other than zero to distort proportions.
line.
o 3D Effects: Using 3D visuals that can distort
 Use: Visualize the distribution of small data sets. the perception of data.
 How to Spot: Always check the axes, scales, and o Standard Deviation: The square root of the
context of the graph. variance. Measures the spread of data
around the mean.
Key Concepts
Empirical Rule:
 Frequency Distribution: Summarizes data by
counting occurrences in categories or classes.  For normal distributions:

 Bar Charts & Pie Charts: Used for qualitative data to o ~68% of data falls within ±1 standard
show frequencies or proportions. deviation of the mean.

 Histograms & Frequency Polygons: Used for o ~95% within ±2 standard deviations.
quantitative data to show distributions.
o ~99.7% within ±3 standard deviations.
 Dot Plots & Stem-and-Leaf Displays: Simple
Chebyshev’s Theorem:
graphical methods for small data sets.
 Applies to any distribution:
 Contingency Tables: Analyze relationships between
two categorical variables. o At least 75% of data falls within ±2 standard
 Scatter Plots: Visualize relationships between two deviations.
quantitative variables. o At least 89% within ±3 standard deviations.
 Misleading Graphs: Be cautious of graphs that z-Scores:
distort data through scaling or visual effects.
 Measures how many standard deviations a value (x)
Descriptive Statistics - Numerical Methods (Chapter 3) is from the mean.
3.1 Describing Central Tendency o Positive z-score: x is above the mean.
 Central Tendency: Represents the center or middle o Negative z-score: x is below the mean.
of a data set.
o z = 0: x is equal to the mean.
 Measures of Central Tendency:

o Mean (μ): The average value. Calculated as

the sum of all values divided by the number 3.3 Percentiles, Quartiles, and Box-and-Whiskers
of values. Displays

o Median (Md): The middle value when data is  Percentile: A value below which a given percentage
ordered. If there’s an even number of of data falls.
observations, it’s the average of the two o 1st Quartile (Q1): 25th percentile.
middle values.
o 2nd Quartile (Median): 50th percentile.
o Mode (Mo): The most frequently occurring
value in the data set. o 3rd Quartile (Q3): 75th percentile.

3.2 Measures of Variation  Interquartile Range (IQR): Q3 - Q1. Measures the

spread of the middle 50% of data.
 Variation: Describes how spread out the data is.
 Box-and-Whisker Plot: Visualizes the distribution of
 Measures of Variation: data using quartiles, median, and outliers.
o Range: The difference between the largest 3.4 Covariance, Correlation, and Least Squares Line
and smallest values. (Optional)
o Variance: The average of the squared  Covariance: Measures the relationship between two
deviations from the mean. variables (x and y).
o Positive Covariance: As x increases, y  Covariance and Correlation: Measure relationships
increases. between variables.

o Negative Covariance: As x increases, y  Weighted Mean and Geometric Mean: Useful for
decreases. specialized data analysis.

 Correlation Coefficient (r): Measures the strength

and direction of the linear relationship between two
Probability
variables.

o Ranges from -1 to 1.

o r = 1: Perfect positive correlation.

o r = -1: Perfect negative correlation.

o r = 0: No correlation.

 Least Squares Line: A line that minimizes the sum of

squared differences between observed and predicted
values (used in regression analysis).

3.5 Weighted Means and Grouped Data (Optional)

 Weighted Mean: Used when some data points are

more important than others. Calculated by
multiplying each value by its weight and dividing by
the sum of weights.

 Grouped Data: Data organized into intervals. Mean

and standard deviation can be estimated using
midpoint values and frequencies.

3.6 Geometric Mean (Optional)

 Geometric Mean: Used for rates of return or growth

rates.

o Calculated as the nth root of the product of

(1 + R₁) × (1 + R₂) × ... × (1 + Rₙ), where Rᵢ are
the rates of return.

o Useful for calculating average growth over

multiple periods.

Key Takeaways:

 Central Tendency: Mean, median, and mode

describe the center of data.

 Variation: Range, variance, and standard deviation

measure data spread.

 Percentiles and Quartiles: Help understand data

distribution and identify outliers.
Discrete Random Variable
Continuous Random Variable
Sampling and Sampling Distribution 5. Stratified Random, Cluster, and Systematic Sampling
(Optional)

 Stratified Random Sampling:

o Divide the population into non-overlapping

groups (strata) based on similarity.

o Randomly sample from each stratum.

o Combine the samples to form the full

sample.

o Use: When the population has distinct

subgroups (e.g., age, gender, income).

 Cluster Sampling:

o Divide the population into clusters (e.g.,

schools, neighborhoods).

o Randomly select entire clusters for

sampling.

o Use: When it is difficult to sample

individuals directly.

 Systematic Sampling:

o Select every kk-th element from a list after

a random start.

o Use: When the population is ordered in

some way.

6. Surveys and Errors in Survey Sampling (Optional)

 Types of Survey Questions:

o Dichotomous: Yes/No questions.

o Multiple Choice: List of options to choose

from.

o Open-Ended: Respondents answer in their

own words.

 Sources of Error:

o Sampling Error: Differences between the

sample and the population.

o Non-Sampling Error: Errors due to data

collection, processing, or respondent bias.

Key Concepts

 Random Sampling: Ensures every subset of the

population has an equal chance of being selected.
 Sampling Distribution: The distribution of a statistic
(e.g., mean, proportion) over all possible samples.

 Central Limit Theorem: The sampling distribution of

the mean is approximately normal for large nn.

 Stratified, Cluster, and Systematic Sampling:

Alternative sampling methods for specific scenarios.

 Survey Errors: Sampling and non-sampling errors

can affect the accuracy of survey results.

Confidence Intervals

EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
8 pages
Statistics - Material
No ratings yet
Statistics - Material
12 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
13 pages
Pointers To Review Statistics
No ratings yet
Pointers To Review Statistics
6 pages
Introduction to Data & Statistics
No ratings yet
Introduction to Data & Statistics
21 pages
Business Statstics Complete
No ratings yet
Business Statstics Complete
13 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
Chapter 2 Descriptive Statistics
No ratings yet
Chapter 2 Descriptive Statistics
3 pages
Final SB: Chapter1: Overview of Statistics
No ratings yet
Final SB: Chapter1: Overview of Statistics
32 pages
Introduction To Statistics 2024-2025
No ratings yet
Introduction To Statistics 2024-2025
40 pages
Descriptive Statistics Guide
No ratings yet
Descriptive Statistics Guide
9 pages
Understanding Statistics Basics
No ratings yet
Understanding Statistics Basics
4 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Power BI
No ratings yet
Power BI
8 pages
Business Analytics
No ratings yet
Business Analytics
44 pages
Statistics
No ratings yet
Statistics
7 pages
Unit 2 - Merged
No ratings yet
Unit 2 - Merged
17 pages
Ôn tập lý thuyết - SB - chap 1-5
No ratings yet
Ôn tập lý thuyết - SB - chap 1-5
12 pages
Business Analytics Overview (MIS171)
No ratings yet
Business Analytics Overview (MIS171)
6 pages
Unit 6 - Data and Sampling Methods
No ratings yet
Unit 6 - Data and Sampling Methods
5 pages
Statistical Studies
No ratings yet
Statistical Studies
18 pages
Probability and Statistics
No ratings yet
Probability and Statistics
50 pages
Reasearch Methodology and Statistics
No ratings yet
Reasearch Methodology and Statistics
13 pages
MR Kinyera
No ratings yet
MR Kinyera
6 pages
BSC First Year Syllabus
100% (1)
BSC First Year Syllabus
6 pages
Stats Reviewer
No ratings yet
Stats Reviewer
5 pages
Stats Midterms Cheat Sheet
No ratings yet
Stats Midterms Cheat Sheet
3 pages
Quantitative Skills 1 Graphing
No ratings yet
Quantitative Skills 1 Graphing
40 pages
Statistics Referesher
No ratings yet
Statistics Referesher
30 pages
Statistics (Curso Completo)
No ratings yet
Statistics (Curso Completo)
9 pages
Creative and Minimal Portfolio Presentation
No ratings yet
Creative and Minimal Portfolio Presentation
5 pages
Statistics and Data Analysis Overview
No ratings yet
Statistics and Data Analysis Overview
5 pages
Chapter2-Statistical Analysis
No ratings yet
Chapter2-Statistical Analysis
86 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
45 pages
1.1 CS3352-FDS - Unit 1
No ratings yet
1.1 CS3352-FDS - Unit 1
42 pages
UNIT-I Study Guide - Introduction To Business Stati
No ratings yet
UNIT-I Study Guide - Introduction To Business Stati
6 pages
Introductory Statistics Class Notes
No ratings yet
Introductory Statistics Class Notes
3 pages
BRM Unit-1
No ratings yet
BRM Unit-1
25 pages
BasicStatistics I
No ratings yet
BasicStatistics I
90 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
Inferential Statistics
No ratings yet
Inferential Statistics
92 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
4 pages
Sasa Reviewer P1, P4 at P5
No ratings yet
Sasa Reviewer P1, P4 at P5
10 pages
STAB22 Lecture's Notes
No ratings yet
STAB22 Lecture's Notes
64 pages
Introduction To Statistics
100% (3)
Introduction To Statistics
43 pages
Statistics in Research Processing and Data Analysis
No ratings yet
Statistics in Research Processing and Data Analysis
34 pages
MPC 006 2024-25 For SSC and All Educational Needs
No ratings yet
MPC 006 2024-25 For SSC and All Educational Needs
27 pages
Research Method Lecture Notes
No ratings yet
Research Method Lecture Notes
32 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
30 pages
Business Statistics for Decision Making
No ratings yet
Business Statistics for Decision Making
6 pages
Module 3 Data Analysis Techniques
No ratings yet
Module 3 Data Analysis Techniques
55 pages
Basic Statistical Data Descriptions
No ratings yet
Basic Statistical Data Descriptions
7 pages
BUSS1020
No ratings yet
BUSS1020
6 pages
ISDS 361A - Cheat Sheet Exam 1 PDF
No ratings yet
ISDS 361A - Cheat Sheet Exam 1 PDF
2 pages
Central Tendencies
No ratings yet
Central Tendencies
5 pages
Economics Stats Guide
No ratings yet
Economics Stats Guide
10 pages
Statisitcs
No ratings yet
Statisitcs
22 pages
Weekly Journal
No ratings yet
Weekly Journal
3 pages
Weekly Journal Template
No ratings yet
Weekly Journal Template
1 page
CEIT FilmSynopsis HindiNaAkoMagaling
No ratings yet
CEIT FilmSynopsis HindiNaAkoMagaling
1 page
CEIT FilmMakersBio HindiNaAkoMagaling
No ratings yet
CEIT FilmMakersBio HindiNaAkoMagaling
1 page
Pearson'S Product-Moment Correlation Coefficient: X First Variable y Other Variable
No ratings yet
Pearson'S Product-Moment Correlation Coefficient: X First Variable y Other Variable
3 pages
Chapter 03 - Descriptive Statistics: Numerical Measures: Page 1
100% (3)
Chapter 03 - Descriptive Statistics: Numerical Measures: Page 1
50 pages
Effect Size in Research Analysis
No ratings yet
Effect Size in Research Analysis
10 pages
Analysis of Multifactor Experiments: Corresponds To Chapter 13 of Tamhane and Dunlop
No ratings yet
Analysis of Multifactor Experiments: Corresponds To Chapter 13 of Tamhane and Dunlop
74 pages
Regression Analysis in Malayalam
No ratings yet
Regression Analysis in Malayalam
22 pages
BST Assignment Questions - 2025
No ratings yet
BST Assignment Questions - 2025
5 pages
Sample Final Exam Mathematics in The Modern World For Review
No ratings yet
Sample Final Exam Mathematics in The Modern World For Review
2 pages
AGB Unit
No ratings yet
AGB Unit
63 pages
Measures of Central Tendency & Variability in Student Performance Analysis
No ratings yet
Measures of Central Tendency & Variability in Student Performance Analysis
20 pages
Bma2202 Bbusiness Statistics 1 PDF
No ratings yet
Bma2202 Bbusiness Statistics 1 PDF
241 pages
Statistics Course Syllabus Overview
No ratings yet
Statistics Course Syllabus Overview
3 pages
Data Fun
No ratings yet
Data Fun
20 pages
Pham Khoa Vien GBS190915 GBS0903 Truong Ngoc Thinh
No ratings yet
Pham Khoa Vien GBS190915 GBS0903 Truong Ngoc Thinh
58 pages
Types of Mean in Statistics Explained
No ratings yet
Types of Mean in Statistics Explained
7 pages
Define Mean Square Error
No ratings yet
Define Mean Square Error
3 pages
SPSS Practical MS Word PDF
No ratings yet
SPSS Practical MS Word PDF
67 pages
A Caution Regarding Rules of Thumb For Variance in
No ratings yet
A Caution Regarding Rules of Thumb For Variance in
19 pages
Core Concepts in Clinical Research Data and Basic Statistics
No ratings yet
Core Concepts in Clinical Research Data and Basic Statistics
18 pages
Correlation and Regression
No ratings yet
Correlation and Regression
39 pages
Analysis of Variance (1 & 2 Way)
No ratings yet
Analysis of Variance (1 & 2 Way)
15 pages
Statistics Book PDF
No ratings yet
Statistics Book PDF
271 pages
Lean Six Sigma Green Belt Certification Training Manual CSSC 2018 06b (1) (201 250)
No ratings yet
Lean Six Sigma Green Belt Certification Training Manual CSSC 2018 06b (1) (201 250)
50 pages
Class 10 Maths Statistics Solutions
100% (1)
Class 10 Maths Statistics Solutions
23 pages
Teacher Achievement Performance: Exploring The Impact of Organization Culture, Achievement Motivation, and Job Satisfaction
No ratings yet
Teacher Achievement Performance: Exploring The Impact of Organization Culture, Achievement Motivation, and Job Satisfaction
15 pages
PSMCL Stock Price Data Analysis
No ratings yet
PSMCL Stock Price Data Analysis
21 pages
Business Analytic Shubham Jindal
No ratings yet
Business Analytic Shubham Jindal
11 pages
Iim Iprobability
No ratings yet
Iim Iprobability
43 pages
CH 01 - Data and Statistics: Page 1
100% (6)
CH 01 - Data and Statistics: Page 1
35 pages
Chapter 1 Quiz
No ratings yet
Chapter 1 Quiz
4 pages
Pooja Kabadi - Predictive Modelling Project
No ratings yet
Pooja Kabadi - Predictive Modelling Project
70 pages

EDA - Reviewer Midterm

Uploaded by

EDA - Reviewer Midterm

Uploaded by

Introduction to Business Statistics (Chapter 1)  Descriptive Statistics: Summarizes and describes the

main features of a data set (e.g., mean, median).

 Sample: A subset of the population used to draw 1. Summarizing Qualitative Data

o Percent Frequency: Relative frequency 4. Stem-and-Leaf Displays

1. Determine the number of classes. 6. Scatter Plots (Optional)

3. Form non-overlapping classes of o X-axis: Independent variable.

o Ogive: A line graph that shows cumulative  Common Issues:

o Mean (μ): The average value. Calculated as

3.2 Measures of Variation  Interquartile Range (IQR): Q3 - Q1. Measures the

 Correlation Coefficient (r): Measures the strength

o r = 1: Perfect positive correlation.

o r = -1: Perfect negative correlation.

 Least Squares Line: A line that minimizes the sum of

3.5 Weighted Means and Grouped Data (Optional)

 Weighted Mean: Used when some data points are

 Grouped Data: Data organized into intervals. Mean

3.6 Geometric Mean (Optional)

 Geometric Mean: Used for rates of return or growth

o Calculated as the nth root of the product of

o Useful for calculating average growth over

 Central Tendency: Mean, median, and mode

 Variation: Range, variance, and standard deviation

 Percentiles and Quartiles: Help understand data

 Stratified Random Sampling:

o Divide the population into non-overlapping

o Randomly sample from each stratum.

o Combine the samples to form the full

o Use: When the population has distinct

o Divide the population into clusters (e.g.,

o Randomly select entire clusters for

o Use: When it is difficult to sample

o Select every kk-th element from a list after

o Use: When the population is ordered in

6. Surveys and Errors in Survey Sampling (Optional)

 Types of Survey Questions:

o Dichotomous: Yes/No questions.

o Multiple Choice: List of options to choose

o Open-Ended: Respondents answer in their

o Sampling Error: Differences between the

o Non-Sampling Error: Errors due to data

 Random Sampling: Ensures every subset of the

 Central Limit Theorem: The sampling distribution of

 Stratified, Cluster, and Systematic Sampling:

 Survey Errors: Sampling and non-sampling errors

You might also like