0% found this document useful (0 votes)

68 views3 pages

Analysis of Quadratic Case: Theorem

The document discusses a combined method of steepest descent and Newton's method, focusing on its application to quadratic functions and its convergence properties. It establishes a theorem that outlines the iterative process and convergence rate, which is influenced by the steepest descent component and the eigenvalue structure of the associated matrix. Additionally, it highlights practical applications of this combined method in optimization problems involving multiple variables.

Uploaded by

bidbifb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views3 pages

Analysis of Quadratic Case: Theorem

Uploaded by

bidbifb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

10.

8 Combination of Steepest Descent and Newton’s Method 309

Analysis of Quadratic Case

Since the method is not a full Newton method, we can conclude that it possesses only
linear convergence and that the dominating aspects of convergence will be revealed
by an analysis of the method as applied to a quadratic function. Furthermore, as
might be intuitively anticipated, the associated rate of convergence is governed
by the steepest descent part of algorithm (57), and that rate is governed by a
Kantorovich-like ratio defined over the subspace orthogonal to N .
Theorem. (Combined method). Let Q be an n × n symmetric positive definite
matrix, and let x∗ ∈ E n . Define the function

Ex = 21 x − x∗ T Qx − x∗

and let b = Qx∗ . Let B be an n × m matrix of rank m. Starting at an arbitrary

point x0 , define the iterative process
a) uk = −BT QB−1 BT gk , where gk = Qxk − b.
b) zk = xk + Buk .
c) pk = b − Qzk .
pT p
d) xk+1 = zk + k pk , where k = Tk k .
pk Qpk
This process converges to x∗ , and satisfies

Exk+1 1 − Exk (58)

where 0 1, is the minimum of

pT p2
pT QppT Q−1 p

over all vectors p in the nullspace of BT .

Proof. The algorithm given in the theorem statement is exactly the general
combined algorithm specialized to the quadratic situation. Next we note that

BT pk = BT Q x∗ − zk = BT Q x∗ − xk − BT QBuk

−1 (59)
= −BT gk + BQBT BT QB BT gk = 0

which merely proves that the gradient at zk is orthogonal to N . Next we calculate

2Exk − Ezk = xk − x∗ T Qxk − x∗ − zk − x∗ T Qzk − x∗

= −2ukT BT Qxk − x∗ − ukT BT QBuk
(60)
= −2ukT BT gk + ukT BT QBBT QB−1 BT gk
= −ukT BT gk = gkT BBT QB−1 BT gk
310 Chapter 10 Quasi-Newton Methods

Then we compute

2Ezk − Exk+1 = zk − x∗ T Qzk − x∗ − xk+1 − x∗ T Qxk+1 − x∗

= −2k pTk Qzk − x∗ − 2k pTk Qpk
= 2k pTk pk − 2k pTk Qpk (61)

pTk pk 2
= k pTk pk =
pTk Qpk

Now using (59) and pk = −gk − QBuk we have

2Exk = xk − x∗ T Qxk − x∗ = gkT Q−1 gk

= pTk + ukT BT QQ−1 pk + QBuk
(62)
= pTk Q−1 pk + ukT BT QBuk
= pTk Q−1 pk + gkT BBT QB−1 BT gk

Adding (60) and (61) and dividing by (62) there results

Exk − Exk+1 gkT BBT QB−1 BT gk + pTk pk 2 /pTk Qpk

=
Exk pTk Q−1 pk + gkT BBT QB−1 BT gk
q + pTk pk /pTk Qpk
=
q + pTk Q−1 pk /pTk pk

where q 0. This has the form q + a/q + b with

pTk pk pTk Q−1 pk

a= b=
pTk Qpk pTk pk

But for any pk , it follows that a b. Hence

q+a a

q+b b

and thus

Exk − Exk+1 pTk pk 2

T
Exk pk Qpk pTk Q−1 pk

Finally,

pTk pk 2
Exk+1 Exk 1 − 1 − Exk
pTk Qpk pTk Q−1 pk

since BT pk = 0.
10.8 Combination of Steepest Descent and Newton’s Method 311

The value associated with the above theorem is related to the eigenvalue
structure of Q. If p were allowed to vary over the whole space, then the Kantorovich
inequality

pT p2 4aA

(63)
p Qpp Q p a + A2
T T −1

where a and A are, respectively, the smallest and largest eigenvalues of Q, gives
explicitly

4aA
=
a + A2

When p is restricted to the nullspace of BT , the corresponding value of is larger.

In some special cases it is possible to obtain a fairly explicit estimate of . Suppose,
for example, that the subspace N were the subspace spanned by m eigenvectors of
Q. Then the subspace in which p is allowed to vary is the space orthogonal to N
and is thus, in this case, the space generated by the other n − m eigenvectors of Q.
In this case since for p in N ⊥ (the space orthogonal to N ), both Qp and Q−1 p are
also in N ⊥ , the ratio satisfies

pT p2 4aA

=
p Qpp Q p a + A2
T T −1

where now a and A are, respectively, the smallest and largest of the n − m eigen-
values of Q corresponding to N ⊥ . Thus the convergence ratio (58) reduces to the
familiar form

A−a 2
Exk+1 Exk
A+a

where a and A are these special eigenvalues. Thus, if B, or equivalently N , is chosen

to include the eigenvectors corresponding to the most undesirable eigenvalues of
Q, the convergence rate of the combined method will be quite attractive.

Applications
The combination of steepest descent and Newton’s method can be applied usefully
in a number of important situations. Suppose, for example, we are faced with a
problem of the form

minimize fx y

where x ∈ E n y ∈ E m , and where the second partial derivatives with respect to x

are easily computable but those with respect to y are not. We may then employ
Newton steps with respect to x and steepest descent with respect to y.

Rrrdesdelinear and Nonlinear Programming-4
No ratings yet
Rrrdesdelinear and Nonlinear Programming-4
3 pages
E1 251 Linear and Nonlinear Op2miza2on
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on
24 pages
Lec 02
No ratings yet
Lec 02
43 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
Lec - 3 - Notes On Newton and Quasi Netwon Method-23-31
No ratings yet
Lec - 3 - Notes On Newton and Quasi Netwon Method-23-31
9 pages
Doan BFGS
No ratings yet
Doan BFGS
72 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Solutions To Selected Exercises and Additional Examples For My Book Numerical Methods For Evolutionary Differential Equations
No ratings yet
Solutions To Selected Exercises and Additional Examples For My Book Numerical Methods For Evolutionary Differential Equations
19 pages
Lecture 05 - Quasi Newthon Methods
No ratings yet
Lecture 05 - Quasi Newthon Methods
10 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
Conjugate Gradient Method
No ratings yet
Conjugate Gradient Method
50 pages
Steepest Descent Algorithm Overview
No ratings yet
Steepest Descent Algorithm Overview
28 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Chương 9
No ratings yet
Chương 9
12 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Root Finding Convergence Analysis
No ratings yet
Root Finding Convergence Analysis
5 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Understanding Sequential Quadratic Programming
No ratings yet
Understanding Sequential Quadratic Programming
50 pages
Newton's Method for Optimization Explained
No ratings yet
Newton's Method for Optimization Explained
14 pages
Steepest Descent in Unconstrained Optimization
No ratings yet
Steepest Descent in Unconstrained Optimization
12 pages
Mathematics: New Improvement of The Domain of Parameters For Newton's Method
No ratings yet
Mathematics: New Improvement of The Domain of Parameters For Newton's Method
12 pages
Newton's Method and Convergence Analysis
No ratings yet
Newton's Method and Convergence Analysis
51 pages
Unconstrained and Constrained Optimization Techniques
No ratings yet
Unconstrained and Constrained Optimization Techniques
25 pages
Iterative Methods for Linear Systems
No ratings yet
Iterative Methods for Linear Systems
4 pages
Linear Systems Solving Methods
No ratings yet
Linear Systems Solving Methods
11 pages
Notes On Some Methods For Solving Linear Systems: Dianne P. O'Leary, 1983 and 1999 September 25, 2007
No ratings yet
Notes On Some Methods For Solving Linear Systems: Dianne P. O'Leary, 1983 and 1999 September 25, 2007
11 pages
Big Data Optimization: Convergence Analysis
No ratings yet
Big Data Optimization: Convergence Analysis
14 pages
Compilations of Algorithms
No ratings yet
Compilations of Algorithms
20 pages
Lecture 9
No ratings yet
Lecture 9
8 pages
EE364a Homework 7 Solutions
No ratings yet
EE364a Homework 7 Solutions
16 pages
451hw02 Soln
No ratings yet
451hw02 Soln
16 pages
Gauss-Newton for Nonlinear Problems
No ratings yet
Gauss-Newton for Nonlinear Problems
5 pages
A Newton Method For Systems of M Equations in N Variables
No ratings yet
A Newton Method For Systems of M Equations in N Variables
13 pages
Tut 2s
No ratings yet
Tut 2s
5 pages
Equation-Solving This One Is The Key
No ratings yet
Equation-Solving This One Is The Key
8 pages
Second Order Optimization Methods
No ratings yet
Second Order Optimization Methods
11 pages
Applications of Real Analysis Methods
50% (2)
Applications of Real Analysis Methods
14 pages
Error Analysis Iterative Methods
No ratings yet
Error Analysis Iterative Methods
5 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
Abstract.: BIT I9 11979), 35ff 367
No ratings yet
Abstract.: BIT I9 11979), 35ff 367
12 pages
Iterative Methods For Linear Systems: Course Website
No ratings yet
Iterative Methods For Linear Systems: Course Website
24 pages
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
No ratings yet
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
16 pages
Machine Problem
No ratings yet
Machine Problem
15 pages
Newton's Method and High Order Iterations
No ratings yet
Newton's Method and High Order Iterations
10 pages
Chapter 4
No ratings yet
Chapter 4
27 pages
Derivation of The Conjugate Gradient Method: 1 Goal
No ratings yet
Derivation of The Conjugate Gradient Method: 1 Goal
5 pages
Derivation of The Conjugate Gradient Method: 1 Goal
No ratings yet
Derivation of The Conjugate Gradient Method: 1 Goal
5 pages
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
No ratings yet
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
3 pages
Nonlinear Spring21
No ratings yet
Nonlinear Spring21
45 pages
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
No ratings yet
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
17 pages
Sheet 2 Solution
No ratings yet
Sheet 2 Solution
5 pages
Nonlinear Equations: + 3x (X) 0 Sin (X) 4 + Sin (2x) + 6x + 9 0
No ratings yet
Nonlinear Equations: + 3x (X) 0 Sin (X) 4 + Sin (2x) + 6x + 9 0
19 pages
Quadratically: A Convergent Newton-Like Method Based Upon Gaussian Elimination
No ratings yet
Quadratically: A Convergent Newton-Like Method Based Upon Gaussian Elimination
10 pages
3 Sol
No ratings yet
3 Sol
3 pages
Anderson-Björck For Linear Sequences : by Richard F. King
No ratings yet
Anderson-Björck For Linear Sequences : by Richard F. King
6 pages
Lecture10 2
No ratings yet
Lecture10 2
8 pages
Ensemble Learning Techniques Overview
No ratings yet
Ensemble Learning Techniques Overview
5 pages
22CSE12-Jan - Feb 2023 Matrix Methods of Structural Analysis
No ratings yet
22CSE12-Jan - Feb 2023 Matrix Methods of Structural Analysis
3 pages
Understanding Kalman Filters Basics
No ratings yet
Understanding Kalman Filters Basics
22 pages
Image Segmentation & K-Means
No ratings yet
Image Segmentation & K-Means
48 pages
B.tech Cse 7th Sem Syllabus
No ratings yet
B.tech Cse 7th Sem Syllabus
6 pages
Finite Differences & Interpolation Guide
No ratings yet
Finite Differences & Interpolation Guide
10 pages
Fourier Representation Of: Signals and LTI Systems
No ratings yet
Fourier Representation Of: Signals and LTI Systems
23 pages
Optimal Merge Patterns in Fundamentals of Algorithms
No ratings yet
Optimal Merge Patterns in Fundamentals of Algorithms
7 pages
DSP Filter Design for Noise Removal
No ratings yet
DSP Filter Design for Noise Removal
23 pages
Matrix Case Study
No ratings yet
Matrix Case Study
51 pages
Modular Arithmetic PDF
No ratings yet
Modular Arithmetic PDF
3 pages
Machine Learning with Python Syllabus
No ratings yet
Machine Learning with Python Syllabus
2 pages
PCM Transmission Bandwidth Analysis
No ratings yet
PCM Transmission Bandwidth Analysis
13 pages
Recommendation Systems Using Nearest Neighbors
No ratings yet
Recommendation Systems Using Nearest Neighbors
8 pages
01 Speed Read Tensorflow Playground
No ratings yet
01 Speed Read Tensorflow Playground
6 pages
MODULE 2: Time-Domain Representations For LTI Systems: Outline
No ratings yet
MODULE 2: Time-Domain Representations For LTI Systems: Outline
28 pages
Automated Vehicle Number Plate Recognition
No ratings yet
Automated Vehicle Number Plate Recognition
9 pages
Image Processing Assignment
50% (2)
Image Processing Assignment
10 pages
Data Structures Question Bank
No ratings yet
Data Structures Question Bank
4 pages
Linear Programming for Students
No ratings yet
Linear Programming for Students
3 pages
Artificial Neural Networks (Anns) : 1 Brief History of The Method
No ratings yet
Artificial Neural Networks (Anns) : 1 Brief History of The Method
4 pages
Data Discretization II
No ratings yet
Data Discretization II
5 pages
Numerical Methods Question Bank
No ratings yet
Numerical Methods Question Bank
4 pages
Fast Implementation of Prim's Algorithm
No ratings yet
Fast Implementation of Prim's Algorithm
6 pages
Os 5
No ratings yet
Os 5
6 pages
Dsa Lab Manual II It
No ratings yet
Dsa Lab Manual II It
75 pages
Elementary Data Structures Guide
No ratings yet
Elementary Data Structures Guide
26 pages
Power Spectral Density Explained
0% (1)
Power Spectral Density Explained
2 pages
NUMSOL
No ratings yet
NUMSOL
5 pages

Analysis of Quadratic Case: Theorem

Uploaded by

Analysis of Quadratic Case: Theorem

Uploaded by

10.

8 Combination of Steepest Descent and Newton’s Method 309

Analysis of Quadratic Case

and let b = Qx∗ . Let B be an n × m matrix of rank m. Starting at an arbitrary

Exk+1 1 − Exk (58)

where  0  1, is the minimum of

over all vectors p in the nullspace of BT .

BT pk = BT Q x∗ − zk = BT Q x∗ − xk − BT QBuk

which merely proves that the gradient at zk is orthogonal to N . Next we calculate

2Exk − Ezk  = xk − x∗ T Qxk − x∗ − zk − x∗ T Qzk − x∗

2Ezk − Exk+1  = zk − x∗ T Qzk − x∗ − xk+1 − x∗ T Qxk+1 − x∗

Now using (59) and pk = −gk − QBuk we have

2Exk = xk − x∗ T Qxk − x∗ = gkT Q−1 gk

Adding (60) and (61) and dividing by (62) there results

Exk − Exk+1 gkT BBT QB−1 BT gk + pTk pk 2 /pTk Qpk

where q 0. This has the form q + a/q + b with

pTk pk pTk Q−1 pk

But for any pk , it follows that a b. Hence

Exk − Exk+1 pTk pk 2

pT p2 4aA

When p is restricted to the nullspace of BT , the corresponding value of  is larger.

pT p2 4aA

where a and A are these special eigenvalues. Thus, if B, or equivalently N , is chosen

minimize fx y

where x ∈ E n  y ∈ E m , and where the second partial derivatives with respect to x

You might also like

Exk+1 1 − Exk (58)

where 0 1, is the minimum of

2Exk − Ezk = xk − x∗ T Qxk − x∗ − zk − x∗ T Qzk − x∗

2Ezk − Exk+1 = zk − x∗ T Qzk − x∗ − xk+1 − x∗ T Qxk+1 − x∗

When p is restricted to the nullspace of BT , the corresponding value of is larger.

minimize fx y

where x ∈ E n y ∈ E m , and where the second partial derivatives with respect to x