0% found this document useful (0 votes)

159 views9 pages

Probability Distributions and Hypothesis Testing

The document discusses probability distributions and hypothesis testing, essential tools for data analysis and statistical modeling. It covers types of probability distributions (discrete and continuous), their parameters, and the process of hypothesis testing, including formulating null and alternative hypotheses, significance levels, and common tests. Practical examples illustrate the application of these concepts in real-world scenarios.

Uploaded by

soukaina.assam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

159 views9 pages

Probability Distributions and Hypothesis Testing

Uploaded by

soukaina.assam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Probability Distributions and Hypothesis

Testing
Probability distributions and hypothesis testing are fundamental tools in data analysis and

statistical modeling. They allow us to understand the underlying patterns in data and make

informed decisions based on evidence. Probability distributions describe the likelihood of

different outcomes, while hypothesis testing provides a framework for evaluating claims

about populations based on sample data. Mastering these concepts is crucial for drawing

meaningful conclusions from data and building robust statistical models.

Probability Distributions
A probability distribution is a mathematical function that describes the likelihood of obtaining

the possible values that a random variable can assume. In simpler terms, it's a way to

visualize and understand the range of possible outcomes for a given event and how likely

each outcome is.

Types of Probability Distributions

Probability distributions are broadly classified into two types: discrete and continuous.

Discrete Probability Distributions

Discrete probability distributions deal with random variables that can only take on a finite

number of values or a countably infinite number of values. These values are typically

integers.

● Bernoulli Distribution: Represents the probability of success or failure of a single trial.

It's characterized by a single parameter, p, which represents the probability of
success.
● Example: Flipping a coin once. The outcome is either heads (success) or tails
(failure).
● Real-world example: Whether a customer clicks on an advertisement
(success) or not (failure).
● Hypothetical scenario: A quality control inspector checks a single item to see
if it's defective. The item is either defective (success) or not defective (failure).
● Binomial Distribution: Represents the probability of obtaining a certain number of
successes in a fixed number of independent trials. It's characterized by two
parameters: n, the number of trials, and p, the probability of success on each trial.
● Example: Flipping a coin 10 times and counting the number of heads.
● Real-world example: The number of defective items in a batch of 100
products.
● Hypothetical scenario: A salesperson makes 20 sales calls and counts the
number of successful sales.
● Poisson Distribution: Represents the probability of a certain number of events
occurring in a fixed interval of time or space. It's characterized by a single parameter,
λ (lambda), which represents the average rate of events.
● Example: The number of customers arriving at a store in an hour.
● Real-world example: The number of emails received per day.
● Hypothetical scenario: The number of accidents at an intersection in a week.

Continuous Probability Distributions

Continuous probability distributions deal with random variables that can take on any value

within a given range.

● Normal Distribution: Also known as the Gaussian distribution, it's one of the most
important distributions in statistics. It's characterized by two parameters: μ (mu), the
mean, and σ (sigma), the standard deviation. The normal distribution is symmetrical
and bell-shaped.
● Example: The height of adult humans.
● Real-world example: The distribution of test scores in a large class.
● Hypothetical scenario: The daily temperature in a city over a year.
● Exponential Distribution: Represents the time until an event occurs. It's characterized
by a single parameter, λ (lambda), which represents the rate of events.
● Example: The time until a machine fails.
● Real-world example: The time between customer arrivals at a call center.
● Hypothetical scenario: The lifespan of a light bulb.
● Uniform Distribution: Represents a situation where all values within a given range are
equally likely. It's characterized by two parameters: a, the minimum value, and b, the
maximum value.
● Example: A random number generator that produces numbers between 0 and
1 with equal probability.
● Real-world example: The waiting time for a bus that arrives every 15 minutes
(assuming you arrive at a random time).
● Hypothetical scenario: The thickness of a metal sheet produced by a
machine, where the thickness is equally likely to be any value within a certain
tolerance range.

Parameters of Probability Distributions

Each probability distribution is defined by one or more parameters that determine its shape

and location. Understanding these parameters is crucial for selecting the appropriate

distribution for a given situation and interpreting the results. For example, the normal

distribution is defined by its mean (μ) and standard deviation (σ), while the Poisson

distribution is defined by its rate parameter (λ).

Probability Density Function (PDF) and Cumulative

Distribution Function (CDF)

● Probability Density Function (PDF): For continuous distributions, the PDF represents
the probability density at each point. The area under the PDF curve between two
points represents the probability that the random variable falls within that range.
● Cumulative Distribution Function (CDF): The CDF represents the probability that the
random variable is less than or equal to a given value. It's calculated by integrating
the PDF from negative infinity to the given value.

Hypothesis Testing
Hypothesis testing is a statistical method used to evaluate a claim or hypothesis about a

population based on sample data. It involves formulating a null hypothesis (H0) and an

alternative hypothesis (H1), and then using statistical tests to determine whether there is

enough evidence to reject the null hypothesis in favor of the alternative hypothesis.

Null and Alternative Hypotheses

● Null Hypothesis (H0): A statement about the population that we assume to be true
unless there is sufficient evidence to reject it. It often represents the status quo or a
commonly accepted belief.
● Alternative Hypothesis (H1): A statement that contradicts the null hypothesis and
represents what we are trying to prove.
● Example:
● H0: The average height of adult males is 5'10".
● H1: The average height of adult males is not 5'10".

Steps in Hypothesis Testing

1. State the null and alternative hypotheses: Clearly define the hypotheses you want to
test.
2. Choose a significance level (α): The significance level represents the probability of
rejecting the null hypothesis when it is actually true (Type I error). Common values for
α are 0.05 and 0.01.
3. Select a test statistic: Choose an appropriate test statistic based on the type of data
and the hypotheses being tested. Examples include the t-statistic, z-statistic, and
chi-square statistic.
4. Calculate the test statistic and p-value: Calculate the value of the test statistic using
the sample data and determine the p-value. The p-value represents the probability of
observing a test statistic as extreme as or more extreme than the one calculated,
assuming the null hypothesis is true.
5. Make a decision: Compare the p-value to the significance level (α). If the p-value is
less than α, reject the null hypothesis in favor of the alternative hypothesis.
Otherwise, fail to reject the null hypothesis.

Types of Errors in Hypothesis Testing

● Type I Error (False Positive): Rejecting the null hypothesis when it is actually true.
The probability of making a Type I error is equal to the significance level (α).
● Type II Error (False Negative): Failing to reject the null hypothesis when it is actually
false. The probability of making a Type II error is denoted by β.
● Power of a Test (1 - β): The probability of correctly rejecting the null hypothesis when
it is false.

Common Hypothesis Tests

● T-tests: Used to compare the means of two groups.

● One-sample t-test: Compares the mean of a single sample to a known value.
● Two-sample t-test: Compares the means of two independent samples.
● Paired t-test: Compares the means of two related samples (e.g., before and
after measurements).
● Z-tests: Used to compare the means of two groups when the population standard
deviations are known or the sample sizes are large.
● Chi-square tests: Used to test for associations between categorical variables.
● Chi-square test of independence: Tests whether two categorical variables are
independent.
● Chi-square goodness-of-fit test: Tests whether a sample distribution fits a
hypothesized distribution.
● ANOVA (Analysis of Variance): Used to compare the means of three or more groups.
P-value

The p-value is a crucial concept in hypothesis testing. It represents the probability of

observing a test statistic as extreme as, or more extreme than, the one calculated from the

sample data, assuming the null hypothesis is true. A small p-value (typically less than the

significance level α) provides evidence against the null hypothesis, leading to its rejection.

● Example: Suppose you are testing the hypothesis that a new drug is effective in
reducing blood pressure. You conduct a clinical trial and obtain a p-value of 0.03. If
your significance level is 0.05, you would reject the null hypothesis and conclude that
the drug is effective. However, if your significance level is 0.01, you would fail to
reject the null hypothesis.

Significance Level (α)

The significance level (α) is a pre-determined threshold that represents the probability of

making a Type I error (rejecting the null hypothesis when it is true). It is typically set at 0.05

or 0.01, meaning that there is a 5% or 1% chance of rejecting the null hypothesis when it is

actually true.

One-Tailed vs. Two-Tailed Tests

● One-Tailed Test: Used when the alternative hypothesis specifies a direction (e.g., the
mean is greater than a certain value).
● Two-Tailed Test: Used when the alternative hypothesis does not specify a direction
(e.g., the mean is not equal to a certain value).
● Example:
● One-tailed: H0: μ = 10, H1: μ > 10
● Two-tailed: H0: μ = 10, H1: μ ≠ 10
Practical Examples and
Demonstrations
Let's consider a few practical examples to illustrate the application of probability distributions

and hypothesis testing.

Example 1: Coin Flipping

Suppose you flip a coin 100 times and observe 60 heads. You want to test the hypothesis

that the coin is fair (i.e., the probability of heads is 0.5).

1. Null Hypothesis (H0): The coin is fair (p = 0.5).

2. Alternative Hypothesis (H1): The coin is not fair (p ≠ 0.5).
3. Significance Level (α): 0.05.

Test Statistic: We can use a z-test for proportions. The test statistic is calculated as:
z = (p̂ - p) / sqrt(p(1-p)/n)

4. where p̂ is the sample proportion (60/100 = 0.6), p is the hypothesized proportion
(0.5), and n is the sample size (100).

Calculation:
z = (0.6 - 0.5) / sqrt(0.5(1-0.5)/100) = 2

5.
6. P-value: The p-value for a two-tailed z-test with a test statistic of 2 is approximately
0.0455.
7. Decision: Since the p-value (0.0455) is less than the significance level (0.05), we
reject the null hypothesis and conclude that the coin is not fair.

Example 2: Comparing Two Groups

Suppose you want to compare the average test scores of two groups of students: a control

group and an experimental group. You collect data on the test scores of 30 students in each

group.

1. Null Hypothesis (H0): The average test scores of the two groups are equal (μ1 = μ2).
2. Alternative Hypothesis (H1): The average test scores of the two groups are not equal
(μ1 ≠ μ2).
3. Significance Level (α): 0.05.

Test Statistic: We can use a two-sample t-test. The test statistic is calculated as:
t = (x̄1 - x̄2) / sqrt(s1^2/n1 + s2^2/n2)

4. where x̄1 and x̄2 are the sample means, s1 and s2 are the sample standard
deviations, and n1 and n2 are the sample sizes.

Calculation: Suppose the sample mean and standard deviation for the control group are 75
and 10, respectively, and the sample mean and standard deviation for the experimental
group are 80 and 12, respectively. Then the test statistic is:
t = (80 - 75) / sqrt(10^2/30 + 12^2/30) ≈ 1.72

5.
6. P-value: The p-value for a two-tailed t-test with a test statistic of 1.72 and 58 degrees
of freedom is approximately 0.091.
7. Decision: Since the p-value (0.091) is greater than the significance level (0.05), we
fail to reject the null hypothesis and conclude that there is not enough evidence to
suggest that the average test scores of the two groups are different.
Example 3: A/B Testing

A/B testing is a common application of hypothesis testing in marketing and web

development. Suppose a company wants to test two different versions of a website landing

page to see which one leads to a higher conversion rate (e.g., the percentage of visitors who

make a purchase).

1. Null Hypothesis (H0): There is no difference in conversion rates between the two
landing pages (p1 = p2).
2. Alternative Hypothesis (H1): There is a difference in conversion rates between the
two landing pages (p1 ≠ p2).
3. Significance Level (α): 0.05.
4. Test Statistic: We can use a z-test for comparing two proportions.
5. Calculation: Suppose landing page A has 1000 visitors and 50 conversions
(conversion rate of 5%), and landing page B has 1000 visitors and 65 conversions
(conversion rate of 6.5%). The z-test statistic can be calculated, and based on that,
the p-value can be determined.
6. Decision: If the p-value is less than 0.05, we reject the null hypothesis and conclude
that there is a statistically significant difference in conversion rates between the two
landing pages. The company would then choose the landing page with the higher
conversion rate.

COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Unit 6
No ratings yet
Unit 6
81 pages
Unit I
No ratings yet
Unit I
8 pages
Inferential Statistics
No ratings yet
Inferential Statistics
51 pages
RM Module 3
No ratings yet
RM Module 3
30 pages
Part 4
No ratings yet
Part 4
25 pages
UNIT - 4 Complete
No ratings yet
UNIT - 4 Complete
77 pages
DMV Unit 2
No ratings yet
DMV Unit 2
50 pages
Introduction To Hypothesis Tests: Assistant Prof. Dr. Özgür Tosun
No ratings yet
Introduction To Hypothesis Tests: Assistant Prof. Dr. Özgür Tosun
71 pages
Hypothesis Testing and Errors Explained
67% (3)
Hypothesis Testing and Errors Explained
37 pages
Computational Data Science - Unit 4
No ratings yet
Computational Data Science - Unit 4
18 pages
T Test
No ratings yet
T Test
50 pages
Hypothesis Lecture
No ratings yet
Hypothesis Lecture
7 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
40 pages
ML Unit-3
No ratings yet
ML Unit-3
18 pages
Advanced Econometrics I (Lecture of 25 July 2025)
No ratings yet
Advanced Econometrics I (Lecture of 25 July 2025)
36 pages
5 - Stat Lecture..
No ratings yet
5 - Stat Lecture..
44 pages
Understanding Normal Distribution Basics
No ratings yet
Understanding Normal Distribution Basics
23 pages
Statistics
No ratings yet
Statistics
28 pages
Engineering Data Analysis Probability: Probability Is A Measure Quantifying The Likelihood That Events Will Occur
No ratings yet
Engineering Data Analysis Probability: Probability Is A Measure Quantifying The Likelihood That Events Will Occur
8 pages
FDSA Unit - 3
No ratings yet
FDSA Unit - 3
59 pages
Statistics for Data Science Guide
No ratings yet
Statistics for Data Science Guide
22 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
10 pages
Module 02 - AIML Statisitcs
No ratings yet
Module 02 - AIML Statisitcs
103 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
Hypothesis Testing Basics and Examples
No ratings yet
Hypothesis Testing Basics and Examples
28 pages
Hypothesis Testing: Definition: The Hypothesis Testing Refers To The Predefined Formal Procedures That Are Used by
No ratings yet
Hypothesis Testing: Definition: The Hypothesis Testing Refers To The Predefined Formal Procedures That Are Used by
4 pages
Understanding Probability Distributions
No ratings yet
Understanding Probability Distributions
20 pages
Research - Stats Notes
No ratings yet
Research - Stats Notes
44 pages
Unit 4R
No ratings yet
Unit 4R
15 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
FBA Module 2
No ratings yet
FBA Module 2
27 pages
What Is A Probability Distribution
No ratings yet
What Is A Probability Distribution
11 pages
Lecture 4 - Data Science Statistics
No ratings yet
Lecture 4 - Data Science Statistics
21 pages
Stati Sem 3 Q&A
No ratings yet
Stati Sem 3 Q&A
12 pages
ML Unit 3
No ratings yet
ML Unit 3
46 pages
Print Out Stats
No ratings yet
Print Out Stats
8 pages
Statistics Lec 10
No ratings yet
Statistics Lec 10
22 pages
Probability & Hypothesis Testing Guide
No ratings yet
Probability & Hypothesis Testing Guide
25 pages
Week 2 Quantitative Data Analysis
No ratings yet
Week 2 Quantitative Data Analysis
22 pages
Quant Part2
No ratings yet
Quant Part2
40 pages
Probability and Statistics Guide
No ratings yet
Probability and Statistics Guide
58 pages
Eco254 Summary (Full) 08024665051
No ratings yet
Eco254 Summary (Full) 08024665051
12 pages
StockWatson Econ CH 2
No ratings yet
StockWatson Econ CH 2
39 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Business Statistics
No ratings yet
Business Statistics
25 pages
Definition of Median
No ratings yet
Definition of Median
6 pages
Cor 006 1ST at Reviewer
No ratings yet
Cor 006 1ST at Reviewer
2 pages
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
No ratings yet
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
91 pages
Basics of Statistics
No ratings yet
Basics of Statistics
8 pages
T Test
No ratings yet
T Test
29 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
8 pages
3 - Introduction To Inferential Statistics
No ratings yet
3 - Introduction To Inferential Statistics
32 pages
Math 212E Handout 4
No ratings yet
Math 212E Handout 4
7 pages
Prop Final 4
No ratings yet
Prop Final 4
119 pages
Hypothesis Testing - The Scientists' Moral Imperative
No ratings yet
Hypothesis Testing - The Scientists' Moral Imperative
34 pages
Biostatistics Unit 5. Measure of Skew
No ratings yet
Biostatistics Unit 5. Measure of Skew
38 pages
The 8 Basic Statistics Concepts For Data Science - +
No ratings yet
The 8 Basic Statistics Concepts For Data Science - +
19 pages
Practical Titration PDF
0% (1)
Practical Titration PDF
164 pages
Pronoun Previous Year Errors
No ratings yet
Pronoun Previous Year Errors
3 pages
Keya V Kemunto (Civil Appeal 101of2023) 2025KEHC7189 (KLR) (29may2025) (Judgment)
No ratings yet
Keya V Kemunto (Civil Appeal 101of2023) 2025KEHC7189 (KLR) (29may2025) (Judgment)
5 pages
Regarding Information About NRB Bond Related Issue Offices/branches Name of Issuing Bank:United Commercial Bank PLC Table-1
No ratings yet
Regarding Information About NRB Bond Related Issue Offices/branches Name of Issuing Bank:United Commercial Bank PLC Table-1
2 pages
The Philosophy of Happiness
No ratings yet
The Philosophy of Happiness
51 pages
Summary Ebook - The Four Steps To The Epiphany PDF
No ratings yet
Summary Ebook - The Four Steps To The Epiphany PDF
304 pages
Baraba Vs Shylark
No ratings yet
Baraba Vs Shylark
16 pages
ALL Tenses
No ratings yet
ALL Tenses
6 pages
Script For Group Therapy
No ratings yet
Script For Group Therapy
29 pages
Hollow Silicon Waveguides For Integrated Optic Circuits - Ecoc Paper
No ratings yet
Hollow Silicon Waveguides For Integrated Optic Circuits - Ecoc Paper
3 pages
To The Lighthouse
No ratings yet
To The Lighthouse
10 pages
LP English
No ratings yet
LP English
10 pages
Radiocarbon Dating Accuracy
100% (1)
Radiocarbon Dating Accuracy
29 pages
Short Functional Text Guide
No ratings yet
Short Functional Text Guide
24 pages
Psa
100% (1)
Psa
5 pages
Chapter 4 Tribals, Dikus and The Vision of A Golden Age: NCERT Solutions For Class 8 Social Science History
No ratings yet
Chapter 4 Tribals, Dikus and The Vision of A Golden Age: NCERT Solutions For Class 8 Social Science History
2 pages
EAD Series
No ratings yet
EAD Series
15 pages
Series 2
No ratings yet
Series 2
82 pages
Bobbitt
No ratings yet
Bobbitt
20 pages
MCT Lesson Plan 1 & Reflection
No ratings yet
MCT Lesson Plan 1 & Reflection
10 pages
How To Deal With Difficult People
No ratings yet
How To Deal With Difficult People
45 pages
The Value of Values
No ratings yet
The Value of Values
216 pages
Enhancing Literacy and Numeracy in Education
No ratings yet
Enhancing Literacy and Numeracy in Education
2 pages
Instituto Nacional de Educación Básica Adscrito A
No ratings yet
Instituto Nacional de Educación Básica Adscrito A
5 pages
Grade 5 English Test Overview
No ratings yet
Grade 5 English Test Overview
3 pages
Global Conspiracy 1997 New AMerican Magazine-77
75% (4)
Global Conspiracy 1997 New AMerican Magazine-77
77 pages
A Busy Morning
No ratings yet
A Busy Morning
2 pages
1st Sunday of Advent Bumc Liturgy Nov. 29,2020
No ratings yet
1st Sunday of Advent Bumc Liturgy Nov. 29,2020
4 pages
Earth Science: Quarter 1 - Module 2: Earth Systems (Week 1/day 2-3)
No ratings yet
Earth Science: Quarter 1 - Module 2: Earth Systems (Week 1/day 2-3)
23 pages
Instrumental Music Education Teaching With The Musical and Practical in Harmony 2nd Edition Feldman Download
100% (1)
Instrumental Music Education Teaching With The Musical and Practical in Harmony 2nd Edition Feldman Download
43 pages

Probability Distributions and Hypothesis Testing

Uploaded by

Probability Distributions and Hypothesis Testing

Uploaded by

Probability Distributions and Hypothesis

informed decisions based on evidence. Probability distributions describe the likelihood of

meaningful conclusions from data and building robust statistical models.

each outcome is.

Types of Probability Distributions

Discrete Probability Distributions

●​ Bernoulli Distribution: Represents the probability of success or failure of a single trial.

Continuous Probability Distributions

within a given range.

Parameters of Probability Distributions

distribution is defined by its rate parameter (λ).

Probability Density Function (PDF) and Cumulative

Null and Alternative Hypotheses

Steps in Hypothesis Testing

Types of Errors in Hypothesis Testing

Common Hypothesis Tests

●​ T-tests: Used to compare the means of two groups.

The p-value is a crucial concept in hypothesis testing. It represents the probability of

Significance Level (α)

One-Tailed vs. Two-Tailed Tests

and hypothesis testing.

Example 1: Coin Flipping

that the coin is fair (i.e., the probability of heads is 0.5).

1.​ Null Hypothesis (H0): The coin is fair (p = 0.5).

Example 2: Comparing Two Groups

A/B testing is a common application of hypothesis testing in marketing and web

You might also like

● Bernoulli Distribution: Represents the probability of success or failure of a single trial.

● T-tests: Used to compare the means of two groups.

1. Null Hypothesis (H0): The coin is fair (p = 0.5).