0% found this document useful (0 votes)

141 views47 pages

OpTimIzation Overview

This document provides an overview of optimization concepts and algorithms. It defines optimization as searching for the best solution to minimize or maximize an objective function subject to constraints. It introduces classes of optimization problems including linear programming, quadratic programming, nonlinear programming, non-smooth optimization, integer programming, and optimal control problems. Finally, it discusses Newton-type optimization algorithms, focusing on the steepest descent method, Newton's method, and Newton-type methods for choosing a search direction to iteratively find a local minimizer.

Uploaded by

Kaseh Dhaah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

141 views47 pages

OpTimIzation Overview

Uploaded by

Kaseh Dhaah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Optimization: an Overview

Moritz Diehl,
Optimization in Engineering Center (OPTEC) & ESAT
K.U. Leuven, Belgium
(some slide material was provided by W. Bangerth, K. Mombaur,
and P. Riede)
Overview of presentation

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

What is optimization?

Optimization = search for the best solution

in mathematical terms:
minimization or maximization of an objective function f (x)
depending on variables x subject to constraints
Equivalence of maximization and minimization problems:
(from now on only minimization)

f(x) -f(x)

x x
x* Maximum

x* Minimum
Constrained optimization

Often variable x shall satisfy certain constraints, e.g.:

x 0
x1 2 + x2 2 = C
General formulation:

f objective function / cost function

h equality constraints
g inequality constraints
Simple example: Ball hanging on a spring

To find position at rest,

minimize potential energy!

min x12 + x22 + mx2

spring gravity
1 + x1 + x2 0
3 x1 + x2 0
Feasible set
Feasible set = collection of all
points that satisfy all constraints:

Example feasible set is intersection

of grey and blue area

x2 0

1 x12 x22 0
h1 ( x) := x2 0
h2 ( x) := 1 x12 x22 0
Local and global optima

f(x)
Local Minimum

Local Minimum

Global Minimum:

x
Derivatives
First and second derivatives of the objective function or the
constraints play an important role in optimization

The first order derivatives are called the gradient (of the resp. fct)

and the second order derivatives are called the Hessian matrix
Optimality conditions (unconstrained)

min f ( x) x R n

Assume that f is twice differentiable.

We want to test a point x* for local
optimality.

necessary condition:
f(x*)=0 (stationarity)
x*
sufficient condition:
x* stationary and 2f(x*) positive definite
Types of stationary points

(a)-(c) x* is stationary: f(x*)=0

2f(x) positive definite: 2f(x) negative definite:

local minimum local maximum

2f(x*) indefinite: saddle point

Ball on a spring without constraints

min2 x12 + x22 + mx2

contour lines of f(x)

gradient vector
f ( x) = (2 x1 ,2 x2 + m)

unconstrained minimum:
m
0 = f ( x* ) ( x1* , x2* ) = (0, )
2
Sometimes there are many local minima

e.g. potential energy

of macromolecule

Global optimization is a very hard issue - most algorithms find only

the next local minimum. But there is a favourable special case...
Convex functions

Convex: all connecting Non-convex: some connecting

lines are above graph lines are not above graph
Convex feasible sets

Convex: all connecting lines Non-convex: some connecting

between feasible points are in line between two feasible points
the feasible set is not in the feasible set
Convex problems

Convex problem if
f(x) is convex and the feasible set is convex

One can show:

For convex problems, every local minimum is also a global minimum.
It is sufficient to find local minima!
Characteristics of optimization problems 1

size / dimension of problem n ,

i.e. number of free variables

continuous or discrete search space

number of minima
Characteristics of optimization problems 2

Properties of the objective function:

type: linear, nonlinear, quadratic ...
smoothness: continuity, differentiability

Existence of constraints

Properties of constraints:
equalities / inequalities
type: simple bounds, linear, nonlinear,
dynamic equations (optimal control)
smoothness
Overview of presentation

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

Problem Class 1: Linear Programming (LP)

Linear objective,
linear constraints:
Linear Optimization Problem
(convex)

Example: Logistics Problem

shipment of quantities a1, a2, ... am
of a product from m locations
Origin of linear
to be received at n detinations in programming
quantities b1, b2, ... bn in 2nd world war
shipping costs cij
determine amounts xij
Problem Class 2: Quadratic Programming (QP)

Quadratic objective and linear

constraints:
Quadratic Optimization Problem
(convex, if Q pos. def.)

Example: Markovitz mean variance portfolio optimization

quadratic objective: portfolio variance (sum of the variances and
covariances of individual securities)
linear constraints specify a lower bound for portfolio return

QPs play an important role as subproblems in nonlinear optimization

Problem Class 3: Nonlinear Programming (NLP)

Nonlinear Optimization Problem

(in general nonconvex)

TODAYS MAIN TOPIC!

E.g. the famous nonlinear Rosenbrock

function
Problem Class 4: Non-smooth optimization

objective function or constraints are

non-differentiable or not continuous e.g.
Problem Class 5: Integer Programming (IP)

Some or all variables are integer

(e.g. linear integer problems)

Special case: combinatorial optimization

problems -- feasible set is finite

Example: traveling salesman problem

determine fastest/shortest round
trip through n locations
Problem Class 6: Optimal Control

Optimization problems Variables (partly -dim.)

including dynamics in form of
differential equations
(infinite dimensional)

THIS COURSES MAIN TOPIC!

Overview of presentation

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

Aim of Newton type optimization algorithms

min f ( x) ( x R ) n

Find a local minimizer x* of f(x), i.e. a point satisfying

f(x*)=0
Derivative based algorithms

Fundamental underlying structure of most algorithms:

choose start value x0

for i=1, ......:
determine direction of search (descent) p
determine step length
new iterate x i+1 = xi + p
check convergence

Optimization algorithms differ in the choice of p und

Basic algorithm:

Search direction:
choose descent direction
(f should be decreased)

Step length:
solve1-d minimization approximately,
satisfy Armijo condition
Computation of step length

Dream:
exact line search: k = arg min f ( xk + pk )

In practice:
inexact line search: k
arg min f ( x k
+ p k
)

ensure sufficient decrease, e.g. Armijo condition

How to compute search direction?

We discuss three algorithms:

Steepest descent method
Newtons method
Newton type methods
Algorithm 1: Steepest descent method

Based on first order Taylor series approximation of objective function

maximum descent, if
Steepest descent method

Choose steepest descent search direction, perform (exact) line search:

p k = f ( x k ) x k +1 = x k k f ( x k )

search direction is perpendicular to level sets of f(x)

Gradient direction
Convergence of steepest descent method

steepest descent method has linear convergence

i.e. x k x* C xk 1 x*

gain is a fixed factor C<1

convergence can be very slow
if C close to 1

If f(x) = xTAx, A positive definite,

eigenvalues of A, one can show that
max min
C
max + min
Example - steepest descent method

1 1 2
f ( x, y) = 4 ( x y 2 )2 + + y
100 100
banana valley function,
global minimum at x=y=0
Example - steepest descent method

xk x*

Convergence of steepest descent method:

needs almost 35.000 iterations to come closer than 0.1 to the solution
mean value of convergence constant C: 0.99995
at (x=4,y=2), there holds

268 0.1
1 = 0.1, 2 = 268 C 0.9993
268+ 0.1
Algorithm 2: Newtons Method

Based on second order Taylor series approximation of f(x)

Newton-Direction
Visualization of Newtons method

pk minimizes quadratic approximation of the objective

1 kT 2
Q ( p ) = f ( x ) + f ( x ) p + p f ( x k ) p k
k k k k

Gradient direction
if quadratic model is
good, then take full
step with k =1
Newton direction
Convergence of Newtons method

Newtons method has quadratic convergence

k 1 * 2
i.e. x x
k *
C x x

This is very fast close to a solution:

Correct digits double in each iteration!

Example - Newtons method

1 1 2
f ( x, y) = 4 ( x y 2 )2 + + y
100 100

banana valley function,

global minimum at x=y=0
Example - Newtons method

xk x*

Convergence of Newtons method:

less than 25 iterations for an accuracy of better than 10-7!
convergence roughly linear for first 15-20 iterations since step
length k 1
convergence roughly quadratic for last iterations with step length
k = 1
Comparison of steepest descent and Newton

xk x*

For banana valley example:

Newtons method much faster than steepest descent method (factor
1000)
Newtons method superior due to higher order of convergence
steepest descent method converges too slowly for practical
applications
Algorithm 3: Newton type methods

In practice, evaluation of second derivatives

for the hessian can be difficult!

approximate hessian matrix 2f(xk)

often methods ensure that the approximation Bk is positive
definite
xk +1 = x k Bk 1f ( xk )
Bk 2 f ( xk )

methods are collectively known as Newton type methods

Newton type variants

Notation:

Steepest Descent:

Convergence rate: linear

Newton Method:

Convergence rate: quadradic

Newton type variants (continued)

BFGS update (Broyden, Fletcher, Goldfarb, Shanno)

Convergence rate: super-linear

For Least-Squares Problems: Gauss-Newton Method

Convergence rate: linear

Summary: Optimization Overview
Optimization problems can be
(un)constrained, (non)convex, (non)linear, (non)smooth,
continuous/integer,(in)finite dimensional, ...

Aim: find local minima of smooth nonlinear problems: f(x*)=0

Derivative based methods iterate x i+1 = xi + i pi with

search direction pi and step length i .
start at initial guess x0 ,

Three methods:
steepest descent: intuitive, but slow (linear convergence)
Newtons method: very fast (quadratic convergence)
Newton type methods: cheap, fast, and popular, e.g.
BFGS (superlinear)
Gauss-Newton (fast linear convergence)
Attention: Nonlinear vs. Convex Optimization

For nonlinear problems, Newton type algorithms only find local minima, and
"optimal solution" depends on initialization!

Important exception: convex problems

"The great watershed in optimization isn't between linearity and

nonlinearity, but convexity and nonconvexity - R. Tyrrell Rockafellar
Literature

J. Nocedal, S. Wright: Numerical Optimization, Springer, 1999/2006

P. E. Gill, W. Murray, M. H. Wright: Practical Optimization, Academic
Press, 1981
R. Fletcher, Practical Methods of Optimization, Wiley, 1987
D. E. Luenberger: Linear and Nonlinear Programming, Addison
Wesley, 1984

Numerical Optimization Course Notes
No ratings yet
Numerical Optimization Course Notes
96 pages
CH 4
No ratings yet
CH 4
28 pages
CS-6777 Liu Abs
100% (1)
CS-6777 Liu Abs
103 pages
Optimization Methods 1755746982
No ratings yet
Optimization Methods 1755746982
116 pages
Process Optimization
100% (1)
Process Optimization
70 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Optimization
No ratings yet
Optimization
16 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
02 Grad Desc
No ratings yet
02 Grad Desc
54 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Optimization for Engineers
No ratings yet
Optimization for Engineers
166 pages
Matinf 2360 Part 3
No ratings yet
Matinf 2360 Part 3
106 pages
Optimization for Engineers
No ratings yet
Optimization for Engineers
59 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
Op Tim Ization
No ratings yet
Op Tim Ization
4 pages
15 Optimization Script
No ratings yet
15 Optimization Script
62 pages
Mathematical Optimization
No ratings yet
Mathematical Optimization
11 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
Short Introduction To Optimization MIT
No ratings yet
Short Introduction To Optimization MIT
16 pages
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
No ratings yet
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
17 pages
ML Module 5 Full Notes
No ratings yet
ML Module 5 Full Notes
23 pages
Chapter 1 - Fundamental of Optimization
No ratings yet
Chapter 1 - Fundamental of Optimization
23 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Data Science - Convex Optimization and Examples PDF
No ratings yet
Data Science - Convex Optimization and Examples PDF
9 pages
Concept of Optimization
No ratings yet
Concept of Optimization
34 pages
Numopt 0
No ratings yet
Numopt 0
163 pages
Efficient Methods in Optimization
No ratings yet
Efficient Methods in Optimization
159 pages
Midterm 1 Notes
No ratings yet
Midterm 1 Notes
46 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Stats 102B Cheat Sheet
No ratings yet
Stats 102B Cheat Sheet
4 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
Fundamental of Optimization - WS1
No ratings yet
Fundamental of Optimization - WS1
14 pages
Non Linear Programming
No ratings yet
Non Linear Programming
109 pages
Lecture 8
No ratings yet
Lecture 8
24 pages
Berkeley-Tutorial Optimization For Machine Learningpart2
No ratings yet
Berkeley-Tutorial Optimization For Machine Learningpart2
35 pages
K Chqe LXZ 2 BJai MRL
No ratings yet
K Chqe LXZ 2 BJai MRL
43 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
IB352 Warwick Wk4 - Lecture-4
No ratings yet
IB352 Warwick Wk4 - Lecture-4
22 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Optimization: Optimization and Some Traditional Methods
No ratings yet
Optimization: Optimization and Some Traditional Methods
21 pages
Optimization Algorithms For Data Analysis Wright
No ratings yet
Optimization Algorithms For Data Analysis Wright
49 pages
Optimization 2
No ratings yet
Optimization 2
40 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Chapter 9st - Non-Linear Programming
No ratings yet
Chapter 9st - Non-Linear Programming
21 pages
Chapter 2 - Final
No ratings yet
Chapter 2 - Final
11 pages
Machine Learning - Lecture 2
No ratings yet
Machine Learning - Lecture 2
28 pages
Nocedal - Wright CH - 02-02
No ratings yet
Nocedal - Wright CH - 02-02
12 pages
MATLAB Optimization Algorithms Guide
No ratings yet
MATLAB Optimization Algorithms Guide
18 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Phrasal Verbs in English Explained
No ratings yet
Phrasal Verbs in English Explained
6 pages
Apostrophes Are Used To - That Something Belongs To Someone or Something
No ratings yet
Apostrophes Are Used To - That Something Belongs To Someone or Something
6 pages
Essay
No ratings yet
Essay
2 pages
Introduction
No ratings yet
Introduction
3 pages
Math 4a-25-First-Week
No ratings yet
Math 4a-25-First-Week
41 pages
Eigenvalues & Eigenvectors Guide
No ratings yet
Eigenvalues & Eigenvectors Guide
22 pages
Optimization Method Minor Project Levenberg-Marquardt Method Semester 6 3 SSCM 2019/2020
No ratings yet
Optimization Method Minor Project Levenberg-Marquardt Method Semester 6 3 SSCM 2019/2020
18 pages
Business Maths Assignment
No ratings yet
Business Maths Assignment
6 pages
ANN & FS (Unit 1, 2) Pyq
No ratings yet
ANN & FS (Unit 1, 2) Pyq
3 pages
Floater Hormann
No ratings yet
Floater Hormann
17 pages
Chapter 7
No ratings yet
Chapter 7
83 pages
Design and Analysis of Algorithms Dec 2023
No ratings yet
Design and Analysis of Algorithms Dec 2023
8 pages
Intro To ML PDF
No ratings yet
Intro To ML PDF
66 pages
ANN Backpropagation: Weight Updates For Hidden Nodes: Step 1: Update The Weights V
No ratings yet
ANN Backpropagation: Weight Updates For Hidden Nodes: Step 1: Update The Weights V
3 pages
Runge-Kutta Methods for ODEs
No ratings yet
Runge-Kutta Methods for ODEs
16 pages
Sparse Modelling
No ratings yet
Sparse Modelling
13 pages
Algorithms and Flowcharts
No ratings yet
Algorithms and Flowcharts
4 pages
Introduction
No ratings yet
Introduction
32 pages
Introduction To Artificial Life (Alife) : Computational Modeling Lab
No ratings yet
Introduction To Artificial Life (Alife) : Computational Modeling Lab
8 pages
Mullers Method 2
No ratings yet
Mullers Method 2
8 pages
Numerical Methods in Applied Maths
No ratings yet
Numerical Methods in Applied Maths
33 pages
Test ICS Sem 2 20202021 v2
No ratings yet
Test ICS Sem 2 20202021 v2
3 pages
Exercise 6
No ratings yet
Exercise 6
2 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
5 pages
Math 12 Linear Programming Assignment
No ratings yet
Math 12 Linear Programming Assignment
4 pages
Problem Solving Agents
No ratings yet
Problem Solving Agents
15 pages
Probabilistic Numerics: Integration & Differential Equations
No ratings yet
Probabilistic Numerics: Integration & Differential Equations
86 pages
Supplier Defects Analysis 2014-2018
No ratings yet
Supplier Defects Analysis 2014-2018
21 pages
Computational Problem
No ratings yet
Computational Problem
2 pages
Advanced Linear Programming Exam
No ratings yet
Advanced Linear Programming Exam
4 pages
LR-Heteroskedastisitas Test-Log10 Method
No ratings yet
LR-Heteroskedastisitas Test-Log10 Method
4 pages
X and y I.e., For Arguments and Entries Values, and Plot The Different Points On The Graph For
No ratings yet
X and y I.e., For Arguments and Entries Values, and Plot The Different Points On The Graph For
4 pages
Engineering Model Calibration
No ratings yet
Engineering Model Calibration
21 pages
Interpolation
100% (1)
Interpolation
2 pages

OpTimIzation Overview

Uploaded by

OpTimIzation Overview

Uploaded by

Optimization: an Overview

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

Optimization = search for the best solution

Often variable x shall satisfy certain constraints, e.g.:

f objective function / cost function

To find position at rest,

min x12 + x22 + mx2

Example feasible set is intersection

Assume that f is twice differentiable.

(a)-(c) x* is stationary: f(x*)=0

2f(x*) positive definite: 2f(x*) negative definite:

2f(x*) indefinite: saddle point

min2 x12 + x22 + mx2

contour lines of f(x)

e.g. potential energy

Global optimization is a very hard issue - most algorithms find only

Convex: all connecting Non-convex: some connecting

Convex: all connecting lines Non-convex: some connecting

One can show:

size / dimension of problem n ,

continuous or discrete search space

Properties of the objective function:

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

Example: Logistics Problem

Quadratic objective and linear

Example: Markovitz mean variance portfolio optimization

QPs play an important role as subproblems in nonlinear optimization

Nonlinear Optimization Problem

TODAYS MAIN TOPIC!

E.g. the famous nonlinear Rosenbrock

objective function or constraints are

Some or all variables are integer

Special case: combinatorial optimization

Example: traveling salesman problem

Optimization problems Variables (partly -dim.)

THIS COURSES MAIN TOPIC!

Optimization: basic definitions and concepts

Introduction to classes of optimization problems

Introduction to Newton type optimization algorithms

Find a local minimizer x* of f(x), i.e. a point satisfying

Fundamental underlying structure of most algorithms:

choose start value x0

Optimization algorithms differ in the choice of p und

ensure sufficient decrease, e.g. Armijo condition

We discuss three algorithms:

Based on first order Taylor series approximation of objective function

Choose steepest descent search direction, perform (exact) line search:

search direction is perpendicular to level sets of f(x)

steepest descent method has linear convergence

gain is a fixed factor C<1

If f(x) = xTAx, A positive definite,

Convergence of steepest descent method:

Based on second order Taylor series approximation of f(x)

pk minimizes quadratic approximation of the objective

Newtons method has quadratic convergence

This is very fast close to a solution:

Correct digits double in each iteration!

banana valley function,

Convergence of Newtons method:

For banana valley example:

In practice, evaluation of second derivatives

approximate hessian matrix 2f(xk)

methods are collectively known as Newton type methods

Convergence rate: linear

Convergence rate: quadradic

BFGS update (Broyden, Fletcher, Goldfarb, Shanno)

Convergence rate: super-linear

For Least-Squares Problems: Gauss-Newton Method

Convergence rate: linear

Aim: find local minima of smooth nonlinear problems: f(x*)=0

Derivative based methods iterate x i+1 = xi + i pi with

Important exception: convex problems

2f(x) positive definite: 2f(x) negative definite: