0% found this document useful (0 votes)

117 views12 pages

Least-Squares Regression: Chen 2450

This document discusses linear least-squares regression. It explains that given N data points (xi,yi), linear least-squares regression finds the parameters in a linear function f(x) that minimize the sum of the squared errors between the function values f(xi) and the actual yi values. It presents two approaches: taking the partial derivatives of the sum of squared errors with respect to the parameters and setting them equal to zero, and setting up and solving the "normal equations" which are linear equations in the parameters.

Uploaded by

mfelzien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views12 pages

Least-Squares Regression: Chen 2450

Uploaded by

mfelzien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Least-Squares Regression

ChEn 2450
Concept: Given N data points (xi,yi), nd parameters in the function f(x) that minimize the error between f(xi) and yi.
f(x)

x
Wednesday, September 28, 11 1

Hoffman 4.10.3

Linear Least-Squares Regression

Concept: Given N data points (xi,yi), nd parameters in the function f(x) that minimize the error between f(xi) and yi.
N

S=
i=1

[yi

f (xi )]

S - Sum of the squared errors.

To minimize S (error between function & data), we take partial derivatives w.r.t. the function parameters and set to zero. ASSUME f(x) is an f (x) = np-order polynomial:
np

ak x
k=0

N i=1

k=0

ak xk i

i - data point index k - polynomial coefcient index

S = ak

N i=1

Normal Equations Only for polynomials

2xk yi i

np j=0

aj xj = 0 k = 0 . . . np i

i - data point index j - dummy index k - polynomial coefcient index

np+1 equations for np+1 unknowns (ak). Equations are linear w.r.t. ak.
Wednesday, September 28, 11

Linear Least Squares regression.

Example: np=1 (Linear Polynomial)

S = ak
Linear polynomial, np=1:
N i=1

xk yi i

np j=0

aj xj i
N

k = 0 . . . np

Here we divided the entire equation by -2. (why?)

f (x) = a0 + a1 x

S=
i=1

[yi
N

a1 xi ]

S=
i=1

2 gi
n

gi = yi

a0
N

a1 xi
n

Apply chain rule.

X S gi X S = = (2gi )( 1) a0 g i a0 i=1 i=1 =

n X i=1

X S gi X S = = (2gi )( xi ) a1 g i a1 i=1 i=1 =

n X i=1

2 (yi

a1 xi )

2xi (yi
N

a1 xi )
N

S a0 S a1

=
i=1 N

2( 1) [yi 2( xi ) [yi
i=1

a0 a0

a1 xi ] a1 xi ]

0 0

=
i=1 N

[yi [yi
i=1

a0 a0

a1 xi ]
i=1 N

yi yi xi
i=1

=
i=1 N

(a0 + a1 xi )

a1 xi ] xi

=
i=1

(a0 + a1 xi ) xi

2 equations, 2 unknowns (a0, a1)

Wednesday, September 28, 11

Example (contd.)
N N

yi
i=1 N

=
i=1 N

(a0 + a1 xi ) (a0 + a1 xi ) xi
i=1

2 eqns, 2 unknowns. Lets put these in Matrix form...

yi xi
i=1

For N=4 points,

y1 + y2 + y 3 + y4 y 1 x 1 + y 2 x 2 + y3 x 3 + y 4 x 4

= (a0 + a1 x1 ) + (a0 + a1 x2 ) + (a0 + a1 x3 ) + (a0 + a1 x4 ) = (a0 + a1 x1 ) x1 + (a0 + a1 x2 ) x2 + (a0 + a1 x3 ) x3 + (a0 + a1 x4 ) x4

Step 1: dene the solution variable vector.

Step 2: dene the matrix and RHS vector.

a0 a1
PN

i=1

Wednesday, September 28, 11

N xi

i=1

# PN a0 i=1 xi PN 2 = a1 i=1 xi

! PN i=1 yi PN i=1 xi yi

# PN i=1 xi PN 2 i=1 xi

Linear least squares regression for a linear polynomial.

! PN i=1 yi PN i=1 xi yi

Example - Reaction Rate Constant

Pre-exponential factor Activation energy

Cl N=N Cl + N2
Benzene diazonium chloride Chlorobenzene

k = A exp
rate constant

Ea RT

Gas constant R=8.314 J/mol-K

Temperature
4.5 5

ln(k) = ln(A)

log(k) (1/s)

Ea RT

recall: ln(ab)=ln(a)+ln(b)

ln(A)=38.9246 Ea=121481.5094

data best fit

T (K) 313 319 323 328

k (1/s) 0.00043 0.00103 0.00180 0.00355 0.00717

5.5 6 6.5 7 7.5 8 3.9 3.8 3.7 1/RT (mol/J) 3.6 x 10

y = a0 + a1 x
y = ln(k), a0 = ln(A), 1 x= , RT a1 = Ea

333

N xi

i=1

# PN a0 b0 i=1 xi PN 2 = a1 b1 i=1 xi

Note: we need to calculate A (pre-exponential factor) from a0.

Now we are ready to go to the computer to determine a0 and a1.

Wednesday, September 28, 11

Polynomial Regression & Normal Equations

(Alternative Formulation for Linear Least Squares)

p = a0 + a1 x + a2 x + a3 x + + an x
2 3

Given N>np observations (xi,yi), and a np order polynomial, nd aj.

. . . x2 N
A

One equation for each observation (N equations)

1 1 . . .

1 xN

x1 x2 . . .

x2 1 x2 2

. . .

n x1 p n x2 p n

. . .

xNp

NOTE: this is an overdetermined (more equations than unknowns) linear problem for the coefcients, ai.
Example - linear polynomial: p(x) = a0 + a1 x

a0 a1 = . . . anp

y1 . . . yN
b

AT A = AT b
Normal Equations
Another form of linear least-squares regression.

1 x1 1 x2 . . . . . . 1 xN

y1 y2 a0 = . a1 . . yN

1 x1

1 x2

1 1 1 . . xN .

x1 x2 . . . 1 xN

a0 a1

1 x1

1 x2

y1 y 1 2 . . xN . yN
6

Wednesday, September 28, 11

The Two are One...

Consider each of the previous approaches for a rst order polynomial.
S = ak
N i=1

Direct Least Squares

xk yi i

np j=0
N

aj xj = 0 i
a0 a0

k = 0 . . . np
1 1 . . . 1

A A =A b
T T
A= x1 x2 . . . xN

Matrix Transpose Approach

k=0
i=1 N

(yi xi (yi
i=1
N N

a1 xi ) a1 xi )
N

= =

0, 0

a0 a1

k=1

AT b =
xi
i=1 N

a0
i=1 N

1 + a1 xi + a1

=
i=1 N

yi , xi yi
i=1

N i=1 yi N i=1 xi yi
N N

y1 y2 y3 . . . yN

a0
i=1

x2 i
i=1

N xi

i=1

N i=1 xi N 2 i=1 xi

a0 a1

N i=1 yi N = i=1 xi yi

A A=
T

i=1

N i=1 xi N 2 i=1 xi

Typically most convenient for linear regression problems.

Wednesday, September 28, 11

Example - Reaction Rate Constant

Pre-exponential factor Activation energy

Cl N=N Cl + N2
Benzene diazonium chloride Chlorobenzene

k = A exp
rate constant

Ea RT

Gas constant R=8.314 J/mol-K

Temperature
4.5 5

ln(k) = ln(A)

log(k) (1/s)

Ea RT

recall: ln(ab)=ln(a)+ln(b)

ln(A)=38.9246 Ea=121481.5094

data best fit

T (K) 313 319 323 328

k (1/s) 0.00043 0.00103 0.00180 0.00355 0.00717

5.5 6 6.5 7 7.5 8 3.9 3.8 3.7 1/RT (mol/J) 3.6 x 10

y = a0 + a1 x
y = ln(k), a0 = ln(A),

1 1 RT1 1 1 RT2 . . . . . . 1 RT1 N A

1 x= , RT a1 = Ea

333

Wednesday, September 28, 11

ln(k1 ) ln(k2 ) a0 = . . a1 . ln(kN )

A A =A b
T T

Note: need to calculate A (preexponential factor) from a0.

Linear Least Squares Regression in MATLAB

BEFORE you go to the computer, formulate the problem as a polynomial regression problem!
Do it manually - the way that we just showed.
See MATLAB code for previous example posted on class website. This is my favored method, and provides maximum exibility!

Polynomial regression: p=polyfit(x,y,n)

polyval(p,xi) evaluates the resulting polynomial at xi. gives the best t for a polynomial of order n through the data. if n==(length(x)-1) then you get an interpolant. if n<(length(x)-1) then you get a least-squares t. You still must get the problem into a polynomial form.
9

Wednesday, September 28, 11

The
R =1
2

2 Value R
N i=1 (yi N i=1 (yi

How well does the regressed line t the data?

f (xi )) y)
2 2

(xi,yi) - data points that we are regressing. f(x) - function we are regressing to. f(xi) - regression function value at xi.
N

1 N i=1

Average value of yi

R2=1
Wednesday, September 28, 11

Perfect t
10

Nonlinear Least Squares Regression

Assume f(x) has n parameters ak that we want to determine via regression.
N

S=
i=1

[yi

f (xi )]

S = 0, ak

k = 1...n

Same approach as before, but now the parameters of f(x) may enter nonlinearly! Example:
f (x) = ax
b
N

N X i=1

2 gi , i=1
N

g i = yi
N

axb i
xb i axb ln(xi ) i

b 2 axi

X @S @gi X @S = = 2gi @a @gi @a i=1 i=1 X @S @gi X @S = = 2gi @b @gi @b i=1 i=1
N N

S = a S = b

N X i=1 N X i=1

2xb yi i 2axb i

axb i axb i

N X i=1

x b yi i

axb i axb i

ln(xi ) yi

N X i=1

xb ln(xi ) yi i

Two nonlinear equations to solve for a and b.

Can we reformulate this as a linear problem?

Wednesday, September 28, 11 11

Kinetics Example Revisited

Sum of squared errors.

Minimize S w.r.t. A and Ea.

rate constant

N S= yi i=1

A exp

Ea RTi

Pre-exponential factor

Activation energy

k = A exp

Ea RT

Gas constant R=8.314 J/mol-K

Temperature

N i=1

2 gi ,
N X

gi = yi

A exp

Ea RTi

S A S Ea

@S = @A

@S = @Ea

@S @gi @gi @A i=1 N X = 2gi exp

i=1 N X

Ea RTi

0 0 = =

N 2A = exp RTi i=1

N i=1

2 exp

Ea yi A exp RTi Ea yi A exp RTi

Ea RTi

2 nonlinear equations with 2 unknowns A, Ea.

@S @gi @gi @Ea i=1 N X A Ea = 2gi exp RTi RTi i=1

N i=1

exp

Ea RTi

A exp A exp

Ea RTi

N 1 exp Ti i=1

Ea RTi

We will show how to solve this soon!

Wednesday, September 28, 11

CISE301-Topic 3 Curve Fitting
No ratings yet
CISE301-Topic 3 Curve Fitting
38 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
19 pages
Least Square+Best Fit
No ratings yet
Least Square+Best Fit
78 pages
3.2 Least Square and Polynomial Regression
No ratings yet
3.2 Least Square and Polynomial Regression
39 pages
Lecture25 Ps
No ratings yet
Lecture25 Ps
10 pages
Me310 5 Regression PDF
No ratings yet
Me310 5 Regression PDF
15 pages
Linear Least Squares
No ratings yet
Linear Least Squares
21 pages
Least Squares Curve Fitting: Numerical Methods
No ratings yet
Least Squares Curve Fitting: Numerical Methods
39 pages
7 Least Square PDF
No ratings yet
7 Least Square PDF
34 pages
Least Squares Problems Overview
No ratings yet
Least Squares Problems Overview
30 pages
Linear Curve Fitting Techniques
No ratings yet
Linear Curve Fitting Techniques
21 pages
Lec 34
No ratings yet
Lec 34
9 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
No ratings yet
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
33 pages
Lec 2
No ratings yet
Lec 2
50 pages
Linearization
No ratings yet
Linearization
23 pages
Chapter 5 Fitting Other Models
No ratings yet
Chapter 5 Fitting Other Models
22 pages
Chapter 2 SOLVING NONLINEAR EQUATION 3
No ratings yet
Chapter 2 SOLVING NONLINEAR EQUATION 3
14 pages
บทที่3
No ratings yet
บทที่3
36 pages
Linear Algebra & Equations Guide
No ratings yet
Linear Algebra & Equations Guide
2 pages
MATH042 - Lesson 8
No ratings yet
MATH042 - Lesson 8
16 pages
Least Squares Function Approximations
No ratings yet
Least Squares Function Approximations
9 pages
Curve Fitting
No ratings yet
Curve Fitting
13 pages
Lect 07&08 (lsp1) 2025
No ratings yet
Lect 07&08 (lsp1) 2025
66 pages
Exponential Model Curve Fitting
No ratings yet
Exponential Model Curve Fitting
47 pages
MATA2754 Least Square Fitting
No ratings yet
MATA2754 Least Square Fitting
19 pages
Numerical Methods: Least Squares & Interpolation
No ratings yet
Numerical Methods: Least Squares & Interpolation
16 pages
Least Squares Error Computation
No ratings yet
Least Squares Error Computation
28 pages
Linear Least-Squares Regression Guide
No ratings yet
Linear Least-Squares Regression Guide
22 pages
Understanding Numerical Regression
No ratings yet
Understanding Numerical Regression
60 pages
Least-Square Method
No ratings yet
Least-Square Method
32 pages
Module 5
No ratings yet
Module 5
26 pages
Linear Least-Squares Regression Methods
No ratings yet
Linear Least-Squares Regression Methods
24 pages
Notas de Optimizacion
No ratings yet
Notas de Optimizacion
3 pages
Least Squares & Numerical Integration
No ratings yet
Least Squares & Numerical Integration
4 pages
Polynomial Interpolation & Fitting Guide
No ratings yet
Polynomial Interpolation & Fitting Guide
4 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
28 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
100% (1)
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
25 pages
CLL113 Quiz 2 Solutions
No ratings yet
CLL113 Quiz 2 Solutions
22 pages
Day 1
No ratings yet
Day 1
41 pages
Numerical Methods For Non-Linear Least Squares Curve Fitting
No ratings yet
Numerical Methods For Non-Linear Least Squares Curve Fitting
55 pages
MAFE208IU-L6 - Least Squares Regression
No ratings yet
MAFE208IU-L6 - Least Squares Regression
45 pages
Curve Fitting-Linear Regression
No ratings yet
Curve Fitting-Linear Regression
20 pages
Econometrics I 3
No ratings yet
Econometrics I 3
27 pages
Lecture 8 Linear Regression 1
No ratings yet
Lecture 8 Linear Regression 1
25 pages
Chap 03
No ratings yet
Chap 03
59 pages
Chapter 3
No ratings yet
Chapter 3
65 pages
4 2 Numerical Analysis Function Approximation Nonlinear Regression
No ratings yet
4 2 Numerical Analysis Function Approximation Nonlinear Regression
28 pages
Clase 11 Calculo Numerico I
No ratings yet
Clase 11 Calculo Numerico I
37 pages
Curve Fitting and Regression Methods
No ratings yet
Curve Fitting and Regression Methods
171 pages
Lecture 13 - Least Squares
No ratings yet
Lecture 13 - Least Squares
28 pages
Applied Econometrics: Department of Economics Stern School of Business
No ratings yet
Applied Econometrics: Department of Economics Stern School of Business
27 pages
OLS
No ratings yet
OLS
18 pages
Advanced Energy Storage Lecture
No ratings yet
Advanced Energy Storage Lecture
20 pages
EEPC102 Module - 6 Lesson 2
No ratings yet
EEPC102 Module - 6 Lesson 2
12 pages
Fitting Functions To Data: Appendix N
No ratings yet
Fitting Functions To Data: Appendix N
4 pages
Chapter 05 - Least Squares
No ratings yet
Chapter 05 - Least Squares
27 pages
Computational Modeling Exercises
No ratings yet
Computational Modeling Exercises
8 pages
Acvuracy Precision Error Unit-1
No ratings yet
Acvuracy Precision Error Unit-1
29 pages
Homework Assignment 5
No ratings yet
Homework Assignment 5
10 pages
KEY ASPECTS OF ANALYTICAL METHOD VALIDATION AND LINEARITY EVALUATION (Araujo 2009) PDF
No ratings yet
KEY ASPECTS OF ANALYTICAL METHOD VALIDATION AND LINEARITY EVALUATION (Araujo 2009) PDF
11 pages
Finance Challenge Answer Key
No ratings yet
Finance Challenge Answer Key
63 pages
Understanding Association and Bias in Research
No ratings yet
Understanding Association and Bias in Research
16 pages
Sustainability 17 02395
No ratings yet
Sustainability 17 02395
18 pages
578 Brissette Et Al Geometallurgy New Accurate Testwork To Meet Accuracy of Mining Project Development Reviewed R3
No ratings yet
578 Brissette Et Al Geometallurgy New Accurate Testwork To Meet Accuracy of Mining Project Development Reviewed R3
13 pages
Business Statistics, 4e: by Ken Black
No ratings yet
Business Statistics, 4e: by Ken Black
38 pages
Week 11 - DOE - Nested Design
No ratings yet
Week 11 - DOE - Nested Design
33 pages
EViews Econometrics Guide
No ratings yet
EViews Econometrics Guide
86 pages
Levelling For High Precision Control
No ratings yet
Levelling For High Precision Control
97 pages
Quasi-Likelihood Functions, Generalized Linear Models, and The Gauss-Newton Method (Wedderburn Article)
No ratings yet
Quasi-Likelihood Functions, Generalized Linear Models, and The Gauss-Newton Method (Wedderburn Article)
9 pages
Applied Longitudinal Data Analysis Ch1&2
No ratings yet
Applied Longitudinal Data Analysis Ch1&2
48 pages
Cross-Sectional Dependence Tests in Stata
No ratings yet
Cross-Sectional Dependence Tests in Stata
15 pages
Mean Squared Error
No ratings yet
Mean Squared Error
5 pages
AI ML Honor Syllbus Sem I
No ratings yet
AI ML Honor Syllbus Sem I
6 pages
Econometric Analysis of Roy Model
No ratings yet
Econometric Analysis of Roy Model
64 pages
Towards Smarter E-Learning: Real-Time Analytics and Machine Learning For Personalized Education
No ratings yet
Towards Smarter E-Learning: Real-Time Analytics and Machine Learning For Personalized Education
12 pages
Lemma Publication
No ratings yet
Lemma Publication
11 pages
Sample141lab Pendulumg
No ratings yet
Sample141lab Pendulumg
10 pages
Dyedx: 1.1 Error Types
No ratings yet
Dyedx: 1.1 Error Types
44 pages
EC203 Tutorial 12 Time Series 16
No ratings yet
EC203 Tutorial 12 Time Series 16
4 pages
UNIT-2 ML Notes
No ratings yet
UNIT-2 ML Notes
15 pages
Ps4 Sol Fall2019
No ratings yet
Ps4 Sol Fall2019
11 pages
Pablo Pena - A Not So Happy Day After All
No ratings yet
Pablo Pena - A Not So Happy Day After All
8 pages
Regression For Everyone Vol. 1
No ratings yet
Regression For Everyone Vol. 1
25 pages
Lesson Plan - ML3
No ratings yet
Lesson Plan - ML3
4 pages
Types of Multiple Regression Explained
No ratings yet
Types of Multiple Regression Explained
35 pages
Question #1 of 139
No ratings yet
Question #1 of 139
59 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
7 pages

Least-Squares Regression: Chen 2450

Uploaded by

Least-Squares Regression: Chen 2450

Uploaded by

Least-Squares Regression

Linear Least-Squares Regression

S - Sum of the squared errors.

i - data point index k - polynomial coefcient index

Normal Equations Only for polynomials

i - data point index j - dummy index k - polynomial coefcient index

Linear Least Squares regression.

Example: np=1 (Linear Polynomial)

Here we divided the entire equation by -2. (why?)

Apply chain rule.

X S gi X S = = (2gi )( 1) a0 g i a0 i=1 i=1 =

X S gi X S = = (2gi )( xi ) a1 g i a1 i=1 i=1 =

2 equations, 2 unknowns (a0, a1)

Wednesday, September 28, 11

2 eqns, 2 unknowns. Lets put these in Matrix form...

For N=4 points,

= (a0 + a1 x1 ) + (a0 + a1 x2 ) + (a0 + a1 x3 ) + (a0 + a1 x4 ) = (a0 + a1 x1 ) x1 + (a0 + a1 x2 ) x2 + (a0 + a1 x3 ) x3 + (a0 + a1 x4 ) x4

Step 1: dene the solution variable vector.

Step 2: dene the matrix and RHS vector.

Wednesday, September 28, 11

Linear least squares regression for a linear polynomial.

Example - Reaction Rate Constant

Gas constant R=8.314 J/mol-K

data best fit

T (K) 313 319 323 328

k (1/s) 0.00043 0.00103 0.00180 0.00355 0.00717

5.5 6 6.5 7 7.5 8 3.9 3.8 3.7 1/RT (mol/J) 3.6 x 10

Note: we need to calculate A (pre-exponential factor) from a0.

Now we are ready to go to the computer to determine a0 and a1.

Wednesday, September 28, 11

Polynomial Regression & Normal Equations

Given N>np observations (xi,yi), and a np order polynomial, nd aj.

One equation for each observation (N equations)

Wednesday, September 28, 11

The Two are One...

Direct Least Squares

Matrix Transpose Approach

Typically most convenient for linear regression problems.

Wednesday, September 28, 11

Example - Reaction Rate Constant

Gas constant R=8.314 J/mol-K

data best fit

T (K) 313 319 323 328

k (1/s) 0.00043 0.00103 0.00180 0.00355 0.00717

5.5 6 6.5 7 7.5 8 3.9 3.8 3.7 1/RT (mol/J) 3.6 x 10

1 1 RT1 1 1 RT2 . . . . . . 1 RT1 N A

Wednesday, September 28, 11

ln(k1 ) ln(k2 ) a0 = . . a1 . ln(kN )

Note: need to calculate A (preexponential factor) from a0.

Linear Least Squares Regression in MATLAB

Polynomial regression: p=polyfit(x,y,n)

Wednesday, September 28, 11

How well does the regressed line t the data?

Nonlinear Least Squares Regression

Two nonlinear equations to solve for a and b.

Can we reformulate this as a linear problem?

Kinetics Example Revisited

Minimize S w.r.t. A and Ea.

Gas constant R=8.314 J/mol-K

@S @gi @gi @A i=1 N X = 2gi exp

N 2A = exp RTi i=1

Ea yi A exp RTi Ea yi A exp RTi

2 nonlinear equations with 2 unknowns A, Ea.

@S @gi @gi @Ea i=1 N X A Ea = 2gi exp RTi RTi i=1

We will show how to solve this soon!

Wednesday, September 28, 11

You might also like