0% found this document useful (0 votes)

19 views13 pages

Dva 2

The document covers various statistical methods including the chi-square test, maximum likelihood estimation, multivariate analysis, and regression analysis. It explains how to calculate the chi-square test, the principles of maximum likelihood estimation, and the types of regression models such as simple and multiple regression. Additionally, it discusses the assumptions and applications of these statistical techniques in various fields.

Uploaded by

Coder R1ck

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views13 pages

Dva 2

Uploaded by

Coder R1ck

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

UNIT -2

Que 1 : How do you calculate chi square test

?Explain with an example ? x2

Que 2 : Discuss in detail about Maximum

likelihood estimate test with example ? x2

Que 3 : Explain different types of variables used

in regression modelling ? x1

Que 4 : What is multivariate analysis ? Describe

in detail ? x1

Que 5 : What Regression analysis ? Explain

simple & multiple ? x1
Que 6 : What Bayesian Modelling ? How it works
describe its adv disadvantages ? x1
Chi Square Tes
est :
The chi-square
square test is a statistical procedure used to determine if
there's a significant association
association between two categorical variables or
er there is a significant difference between observed and
whether
expected frequencies in categorical dat
data.

parametric test, meaning it does not assume any particular

It is a non-parametric
distribution for the data.
dat

Types:

Square Test of Independence: Tests whether two

1. Chi-Square
categorical variables are related or independent of each other
(used with contingency tables
tables).
2. Chi-Square Goodness-of-Fit: Tests whether the observed
Square Test of Goodness
distribution of data matches an expected theoretical distribution
distribution.

Formula:

Assumptions:

1. Data must be categorical (nominal or ordinal)

ordinal .
2. Observations should be independent.
3. Categories of the variables must be mutually exclusive.
4. Expected frequency in each category should be at least 5 .
5. Data should be randomly selected to minimize bias

Steps of Chi Square Test :

1. State the Hypotheses
2. Construct a Contingency Table
3. Calculate Expected Frequencies
4. Calculate the Chi-Square Statistic
5. Determine Degrees of Freedom
6. Compare to Critical Value ( dva notes example)
T-test notes-dva

Correlation analysis :
Correlation analysis is a statistical method used to measure and
evaluate the strength and direction of the relationship between two or
more variables.

1. It helps identify whether changes in one variable are associated

with changes in another and quantifies the degree of this
association.
2. Used to discover if there is a relationship between variables and
how strong that relationship may be.
3. Commonly applied in market research, social sciences, and data
analysis to identify patterns, trends, and significant connection

Correlation Coefficient:

The correlation coefficient (often denoted as r) quantifies the strength

and direction of the linear relationship between two variables.

It ranges from −1 to +1:

+1: Perfect positive correlation (both variables increase together).

-1: Perfect negative correlation (one variable increases as the other

decreases).

0: No linear correlation
Types of Correlation :

Positive Correlation: Both variables move in the same direction (as one
increases, so does the other).

Negative Correlation: Variables move in opposite directions (as one

increases, the other decreases).

No Correlation: No relationship between variables

Diagram of all 3

Application :

1. Business analytics
2. Medical Research
3. Weather forecast
4. Scientific research
Maximum likelihood estimates :
1. Maximum likelihood estimates is a statistical method used to
estimate the parameters of a probability distribution based on
observed data.
2. The fundamental idea behind mle is to find the parameter values
that maximize the likelihood of observed data.

Likelihood Function: The likelihood function is defined as the joint

probability of the observed data given the parameters. It is denoted
as L(θ , x), where θ represents the parameters and x represents the
observed data.

Steps to Perform MLE

1. Assume a probability distribution for your data (e.g., normal,

binomial).
2. Write the likelihood function using the assumed distribution
and the sample.
3. Take the log of the likelihood function (log-likelihood).
4. Differentiate the log-likelihood with respect to the
parameter(s).
5. Set derivative = 0 and solve to find the parameter(s) that
maximize the function.
Applications of MLE

 Estimating parameters in machine learning models (e.g., logistic

regression).
 Used in Bayesian analysis (as a basis for priors).
 Widely used in probability modeling and hypothesis testing.

Example:
e: MLE for Estimating the Mean of a Normal Distribution

1. Sample data: x=[2,3,4]

2. Assume:: Data comes from a normal distribution N(μ,σ2)
3. Assume variance σ2=1 is known.
4. Goal: Estimate
e the mean μ using MLE

.
Multivariate Analysis :
1. Multivariate analysis refers to statistical techniques that
simultaneously examine three or more variables to understand
the relationships and patterns between them.
2. It is generally performed to uncover patterns, correlations, and
dependencies among multiple variables.
3. Unlike univariate (one variable) or bivariate (two variables)
analysis, multivariate analysis provides a more comprehensive
view by considering multiple variables simultaneously

Assumptions in MVA

 Normality: Data should follow a normal distribution.

 Linearity: Relationship among variables should be linear.
 Homogeneity of variance: Variances across groups should be
equal.
 No multicollinearity: Independent variables should not be
highly correlated.

Advantages :

a. Helps in dimensionality reduction and model performance.

b. More efficient than univariate and bivariate analysis.
c. Reveals hidden patterns and relationships.
Technique Purpose Example Use
Predict a continuous
Predict house price
Multiple Linear dependent variable based
based on area, location,
Regression (MLR) on several independent
and number of rooms
variables
Predicting if a student
Predicts a yes/no
Multiple Logistic passes based on hours
outcome using several
Regression studied, attendance,
variables.
and grades.
Multivariate Evaluate the effect of
Compare group means
Analysis of teaching method on
on multiple dependent
Variance student scores in math
variables simultaneously
(MANOVA) and science

Reduces many variables Combining diet,

Factor Analysis
into a few underlying exercise, and sleep into
(FA)
factors. a “health” factor

Transforms many Reducing height,

Principal
variables into a few weight, and age into a
Component
uncorrelated “body size”
Analysis (PCA)
components. component.
Group similar
Segmenting customers
observations together
Cluster Analysis by spending, age, and
based on several
purchase frequency.
variables.
Classify observations into
Discriminant Classify loan applicants
groups based on
Analysis (DA) as risky or safe
predictor variables
Regression Analysis / Modelling :
Regression analysis is a statistical method used to examine the
relationship between a dependent variable and one or more
independent variables (predictor).

Regression modelling is a statistical technique used to estimate and

model the relationships between a dependent (outcome) variable and
one or more independent (predictor) variables

Purpose:

1. To predict the value of a dependent variable based on

independent variables.
2. To understand the strength and direction of relationships
between dependent & independent variables.

Component Meaning
Dependent
The outcome variable being predicted.
Variable (Y)
Independent The predictor(s) used to predict dependent
Variable (X) variable (Y).
The value of the dependent variable when all
Intercept (β₀)
independent variables are zero.
Slope (β₁, β₂, ...) Amount Y changes for one-unit change in X.
Difference between observed and predicted
Error Term (ε)
values (residuals) of dependent variables.
3. Assumptions of Linear Regression:

1. Linearity – Relationship between X and Y is linear.

2. Independence – Observations are independent of each other.
3. Homoscedasticity – Constant variance of errors.
4. Normality – Residuals(predicted values) are normally distributed.
5. No Multicollinearity – Predictors(Independent variables) are not
highly correlated with each other.

Type Description
Simple Linear One independent variable predicts dependent
Regression variable; relationship is modeled with a straight
line.
Multiple Linear More than one independent variable predicts
Regression dependent variable.
Logistic Used when the dependent variable is categorical
Regression (usually binary). Predicts a categorical (yes/no or
0/1) outcome.
Polynomial Models a nonlinear relationship b/w X & Y using
Regression polynomial terms.
Ridge/Lasso Regularized regression to prevent overfitting in
Regression high-dimensional data, used in ml.

Applications:

 Predicting prices (e.g., real estate, stock).

 Medical studies (e.g., effect of lifestyle on health).
 Economics (e.g., impact of interest rates on GDP).
 Business forecasting (e.g., sales prediction).

Data Analysis Techniques Guide
No ratings yet
Data Analysis Techniques Guide
9 pages
Introductions Wps Office
100% (1)
Introductions Wps Office
8 pages
Unit 1
No ratings yet
Unit 1
24 pages
MBA60 - 616 Techniques
No ratings yet
MBA60 - 616 Techniques
42 pages
Chi-Square Goodness-of-Fit Test Guide
No ratings yet
Chi-Square Goodness-of-Fit Test Guide
77 pages
MNS3173 - Chapter 8 - Types of Data Analysis Methods
No ratings yet
MNS3173 - Chapter 8 - Types of Data Analysis Methods
19 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Business Research Methods Unit 4
No ratings yet
Business Research Methods Unit 4
25 pages
Stats Theory
No ratings yet
Stats Theory
43 pages
Understanding ANOVA and Statistical Models
No ratings yet
Understanding ANOVA and Statistical Models
4 pages
Regression & Group Differences Report
No ratings yet
Regression & Group Differences Report
41 pages
Applied Multivariate Research - Design and Interpretation P1
No ratings yet
Applied Multivariate Research - Design and Interpretation P1
60 pages
Advanced Data Analysis Binder 2015
100% (1)
Advanced Data Analysis Binder 2015
165 pages
Inferential Statistics in Environmental Data
No ratings yet
Inferential Statistics in Environmental Data
19 pages
c004ffce-df68-449b-b911-36aeb8d2a4d5
No ratings yet
c004ffce-df68-449b-b911-36aeb8d2a4d5
24 pages
Descriptive and Inferential Statistics
No ratings yet
Descriptive and Inferential Statistics
8 pages
Data Analysis Guide
No ratings yet
Data Analysis Guide
4 pages
Advance Business Research Methods
No ratings yet
Advance Business Research Methods
38 pages
Duleba1996 - Regression Analysis and Multivariate Analysis
No ratings yet
Duleba1996 - Regression Analysis and Multivariate Analysis
15 pages
Data Analysis for Students
No ratings yet
Data Analysis for Students
26 pages
304BA AdvancedStatisticalMethodsUsingR
No ratings yet
304BA AdvancedStatisticalMethodsUsingR
31 pages
Parametric vs Nonparametric Tests
No ratings yet
Parametric vs Nonparametric Tests
14 pages
Inferential Statistics: Identify
No ratings yet
Inferential Statistics: Identify
7 pages
Notes of DA Unit-II
No ratings yet
Notes of DA Unit-II
91 pages
Q2 M4 Lesson 6 - Planning Data Analysis
No ratings yet
Q2 M4 Lesson 6 - Planning Data Analysis
15 pages
01 Multivariate Analysis
100% (1)
01 Multivariate Analysis
40 pages
Quantitative Research
No ratings yet
Quantitative Research
25 pages
Intro to Statistical Methods
100% (1)
Intro to Statistical Methods
4 pages
Data Analytics-11
No ratings yet
Data Analytics-11
23 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Business Research CH-6
No ratings yet
Business Research CH-6
28 pages
Group 5 - Paz, Chavez, Raña, Corporal
No ratings yet
Group 5 - Paz, Chavez, Raña, Corporal
46 pages
Data Analysis - Selecting A Test
No ratings yet
Data Analysis - Selecting A Test
5 pages
Data Exploration
No ratings yet
Data Exploration
23 pages
Chapter 13 Multivariate Analysis Techniques
No ratings yet
Chapter 13 Multivariate Analysis Techniques
58 pages
Unit Iii Data Analysis and Reporting
No ratings yet
Unit Iii Data Analysis and Reporting
27 pages
Statistical Tests - Handout PDF
No ratings yet
Statistical Tests - Handout PDF
21 pages
3 4 Research 8 2
No ratings yet
3 4 Research 8 2
54 pages
9 Tutorial Statistics Revision
No ratings yet
9 Tutorial Statistics Revision
56 pages
Unit 2 Data Analytics
No ratings yet
Unit 2 Data Analytics
33 pages
Statistical Tests
No ratings yet
Statistical Tests
10 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
ENS185 Module On Statistical Tests
No ratings yet
ENS185 Module On Statistical Tests
9 pages
Statistics Tests
No ratings yet
Statistics Tests
9 pages
Ics054 Unit 2a
No ratings yet
Ics054 Unit 2a
8 pages
Data Analysis Notes
No ratings yet
Data Analysis Notes
9 pages
Jasp ASP Session
No ratings yet
Jasp ASP Session
4 pages
Market Risercz Moje Walsne Notatki Dzień Rpzed Egzmainem Z Chatem
No ratings yet
Market Risercz Moje Walsne Notatki Dzień Rpzed Egzmainem Z Chatem
5 pages
Bivariate Analysis: Research Methodology Digital Assignment Iii
No ratings yet
Bivariate Analysis: Research Methodology Digital Assignment Iii
6 pages
Data Analysis
No ratings yet
Data Analysis
33 pages
Statistical Fundamentals Using Microsoft Excel For Univariate and Bivariate Analysis by Rovai A.P.
No ratings yet
Statistical Fundamentals Using Microsoft Excel For Univariate and Bivariate Analysis by Rovai A.P.
628 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Statistical METHODS
No ratings yet
Statistical METHODS
96 pages
Planning Data Analysis
No ratings yet
Planning Data Analysis
15 pages
STA780 - Wk1 - Intro To Multivariate Analysis-Student
No ratings yet
STA780 - Wk1 - Intro To Multivariate Analysis-Student
92 pages
BRM Multi Var
No ratings yet
BRM Multi Var
38 pages
Inferential Statistics For Data Science
100% (1)
Inferential Statistics For Data Science
10 pages
JASP
No ratings yet
JASP
8 pages
AceInt Sales Intern Assignment-2
No ratings yet
AceInt Sales Intern Assignment-2
2 pages
CN Unit 4
No ratings yet
CN Unit 4
21 pages
Cnunit 2
No ratings yet
Cnunit 2
10 pages
CN Unit 1
No ratings yet
CN Unit 1
13 pages
Techno Trade: Quantity Product Unit Price
No ratings yet
Techno Trade: Quantity Product Unit Price
3 pages
BR111
No ratings yet
BR111
44 pages
Beam Design Principles and Analysis
No ratings yet
Beam Design Principles and Analysis
49 pages
Nonlinear Analysis of Stress-Strain of Reinforced
No ratings yet
Nonlinear Analysis of Stress-Strain of Reinforced
6 pages
Beyond The Syllabus
No ratings yet
Beyond The Syllabus
16 pages
Speed Enforcement Brochure 21x21 Rz-Web en
No ratings yet
Speed Enforcement Brochure 21x21 Rz-Web en
7 pages
Children's Arithmetic Research
100% (1)
Children's Arithmetic Research
5 pages
09 - CH75 - Engine Cooling & Sealing Air System
No ratings yet
09 - CH75 - Engine Cooling & Sealing Air System
26 pages
ErationCard - RKSY-I - RationCardNo - 1223970891 - 10227115 - 19 - 08 - 2025 11 - 21 - 09
No ratings yet
ErationCard - RKSY-I - RationCardNo - 1223970891 - 10227115 - 19 - 08 - 2025 11 - 21 - 09
1 page
2 2 Discovering Your Higher Purpose
No ratings yet
2 2 Discovering Your Higher Purpose
3 pages
Apple's Iphone Air and The Marketing
No ratings yet
Apple's Iphone Air and The Marketing
2 pages
LifeCare PCA Technical Service Manual-2013
No ratings yet
LifeCare PCA Technical Service Manual-2013
177 pages
Writing - 2025 Opinion Essay (Summary)
No ratings yet
Writing - 2025 Opinion Essay (Summary)
1 page
Storm Water Layout
No ratings yet
Storm Water Layout
1 page
ITSM Structure and Roles
No ratings yet
ITSM Structure and Roles
12 pages
Eras of Operations Management - Edited
No ratings yet
Eras of Operations Management - Edited
5 pages
Colloquial English Phrases
No ratings yet
Colloquial English Phrases
2 pages
Cpk vs. Ppk: Choosing the Right Index
100% (1)
Cpk vs. Ppk: Choosing the Right Index
10 pages
Shannon Vianny Vas FlowCV Resume 20250101
No ratings yet
Shannon Vianny Vas FlowCV Resume 20250101
2 pages
From RET To MK Party - The Mobilisation of Existing Communities To Drive Political Messaging - Final
100% (2)
From RET To MK Party - The Mobilisation of Existing Communities To Drive Political Messaging - Final
17 pages
High Voltage Insulating Materials
No ratings yet
High Voltage Insulating Materials
8 pages
A Detailed Presentation and Implementation Procedure of Axisymmetric Method of Characteristics For Rocket Nozzle Design
No ratings yet
A Detailed Presentation and Implementation Procedure of Axisymmetric Method of Characteristics For Rocket Nozzle Design
50 pages
Templates: Problematic Essay
100% (1)
Templates: Problematic Essay
3 pages
Complete Physics PDF Notes
No ratings yet
Complete Physics PDF Notes
241 pages
Spring 24 Tentative Time Table DBA
No ratings yet
Spring 24 Tentative Time Table DBA
7 pages
A4.1 SL
No ratings yet
A4.1 SL
12 pages
Travel Motor Parts List
No ratings yet
Travel Motor Parts List
2 pages
Anxiety Levels of Bse Science Students in Pangasinan - State University During The Covid 19 Pandemic
No ratings yet
Anxiety Levels of Bse Science Students in Pangasinan - State University During The Covid 19 Pandemic
91 pages
Multiple-Choice Questions On Pressure in Fluids
100% (4)
Multiple-Choice Questions On Pressure in Fluids
3 pages
Big Bag
No ratings yet
Big Bag
4 pages

Dva 2

Uploaded by

Dva 2

Uploaded by

UNIT -2

Que 1 : How do you calculate chi square test

Que 2 : Discuss in detail about Maximum

Que 3 : Explain different types of variables used

Que 4 : What is multivariate analysis ? Describe

Que 5 : What Regression analysis ? Explain

parametric test, meaning it does not assume any particular

Square Test of Independence: Tests whether two

1. Data must be categorical (nominal or ordinal)

Steps of Chi Square Test :

1. It helps identify whether changes in one variable are associated

The correlation coefficient (often denoted as r) quantifies the strength

It ranges from −1 to +1:

+1: Perfect positive correlation (both variables increase together).

-1: Perfect negative correlation (one variable increases as the other

Negative Correlation: Variables move in opposite directions (as one

No Correlation: No relationship between variables

Likelihood Function: The likelihood function is defined as the joint

Steps to Perform MLE

1. Assume a probability distribution for your data (e.g., normal,

 Estimating parameters in machine learning models (e.g., logistic

1. Sample data: x=[2,3,4]

 Normality: Data should follow a normal distribution.

a. Helps in dimensionality reduction and model performance.

Reduces many variables Combining diet,

Transforms many Reducing height,

Regression modelling is a statistical technique used to estimate and

1. To predict the value of a dependent variable based on

1. Linearity – Relationship between X and Y is linear.

 Predicting prices (e.g., real estate, stock).

You might also like