0% found this document useful (0 votes)

10 views9 pages

Constraint Optimization

Unconstrained optimization problems focus on finding the minimum or maximum of a function without restrictions on decision variables, forming the theoretical foundation for more complex problems. Various algorithms are discussed, including one-dimensional methods like Golden Section Search and Brent's Method, as well as multidimensional methods such as Steepest Descent, Newton's Method, and Conjugate Gradient Method, each with distinct advantages and limitations. The document also highlights practical implementations using NumPy and SciPy for optimization tasks.

Uploaded by

magwaturephillimon07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

Constraint Optimization

Uploaded by

magwaturephillimon07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

3.

Unconstrained Optimization

a. Introduction to Unconstrained Optimization Problems

An unconstrained optimization problem is the fundamental building block of optimization. It involves

finding the minimum or maximum of a function without any restrictions on the values of the decision
variables. The standard form is:

Minimize f(x) for x ∈ Rⁿ

Maximize f(x) for x ∈ Rⁿ

where f: Rⁿ → R is the objective function and x is the vector of decision variables.

Why is it important?

1. Theoretical Foundation: The optimality conditions (gradient = 0, Hessian positive definite)

are derived for unconstrained problems. These conditions form the basis for understanding
more complex constrained problems.

2. Algorithmic Core: The algorithms developed for unconstrained problems (like Gradient
Descent and Newton's Method) are often adapted and extended to handle constraints via
penalty methods, barrier methods, or Lagrange multipliers.

3. Direct Applications: Many problems can be naturally formulated as unconstrained, such as:

o Least Squares Regression: Minimizing the sum of squared errors.

o Maximum Likelihood Estimation: Maximizing the likelihood function to find model

parameters.

o Neural Network Training: Minimizing a loss function (e.g., Mean Squared Error,
Cross-Entropy) with respect to the network's weights and biases.

around x*. If this holds for all x ∈ Rⁿ, then x* is a global minimizer.
The goal is to find a local minimizer x*, a point such that f(x*) ≤ f(x) for all x in a neighborhood

b. One-Dimensional Optimization Algorithms (Line Search Methods)

These algorithms find the minimum of a function of a single variable, f(α). They are crucial as a sub-
problem in multidimensional optimization, where we often need to find the best step size α to take
in a given direction p (i.e., minimize f(xₖ + αpₖ)).

1. Golden Section Search

 Purpose: A robust, derivative-free method for finding a minimum of a unimodal function (a

function with a single minimum in the interval) within a specified interval [a, b].

 Intuition: It works by recursively narrowing the interval that contains the minimum by
comparing function values at interior points. The key is to choose these interior points in
such a way that one of them can be reused in the next iteration, minimizing the number of
function evaluations.

 The Golden Ratio: The algorithm uses the golden ratio φ = (1 + √5)/2 ≈ 1.618 to determine
the interior points.
o c = b - (b - a)/φ

o d = a + (b - a)/φ

 Algorithm Steps:

1. Start with an interval [a, b] known to contain a minimum.

2. Calculate two interior points: c and d.

3. If f(c) < f(d), the minimum must lie in [a, d]. Set b = d.

4. Else, the minimum lies in [c, b]. Set a = c.

5. Repeat until the interval length is sufficiently small.

 Diagram: Golden Section Search Iteration

 Pros & Cons:

o ✅ Pros: Very reliable; always converges for unimodal functions; doesn't require
derivatives.

o ❌ Cons: Convergence is linear (slow); only for 1D problems; requires a bracketing

interval.

2. Brent's Method

 Purpose: A more sophisticated 1D minimization algorithm that combines the reliability of

bracketing (like Golden Section) with the speed of derivative-based methods.

 Intuition: Brent's method uses three techniques:

1. Inverse Quadratic Interpolation: Fits an inverse quadratic function to the three best
points and finds its minimum. Very fast when it works.

2. Successive Parabolic Interpolation: Fits a parabola to three points. Also fast.

3. Golden Section Search: Used as a reliable fallback when the interpolation steps
would be slow or unreliable (e.g., when points are co-linear).

 How it works: The algorithm tries to use the fast interpolation methods. If the predicted
minimum is within the current bracketing interval and promises a good convergence rate, it
uses that prediction. Otherwise, it takes a Golden Section step to ensure progress.

 Pros & Cons:

o ✅ Pros: Superfast convergence (superlinear) for well-behaved functions; reliable due

to the fallback mechanism; considered one of the best general-purpose 1D
minimizers.

o ❌ Cons: More complex to implement; logic for switching methods is heuristic.

c. Multidimensional Optimization Algorithms

These are the workhorses for most practical problems. They iteratively generate a sequence of
points x₀, x₁, x₂, ... that (hopefully) converges to a local minimizer x*.
General Iterative Scheme:
xₖ₊₁ = xₖ + αₖ pₖ
where:

 pₖ is the search direction.

 αₖ is the step length or learning rate, found via a 1D line search (e.g., Golden Section, Brent)
along the direction pₖ.

The choice of the search direction pₖ defines the algorithm.

1. Method of Steepest Descent

 Search Direction: pₖ = -∇f(xₖ)

o The most obvious choice: move in the direction where the function decreases most
rapidly locally.

direction: xₖ₊₁ = xₖ - αₖ ∇f(xₖ).

 Algorithm: At each iteration, calculate the gradient and take a step in the negative gradient

 Diagram: Path of Steepest Descent

The path often exhibits a "zig-zag" pattern, especially in narrow valleys, leading to slow convergence.

 Pros & Cons:

o ✅ Pros: Very simple to understand and implement; guaranteed convergence under

mild conditions; only requires first derivatives (gradient).

o ❌ Cons: Convergence is very slow (linear convergence rate); performance is poor for
ill-conditioned problems (where the Hessian has eigenvalues of very different
magnitudes); zig-zag behavior is inefficient.

2. Newton's Method

 Search Direction: pₖ = - (H(xₖ))⁻¹ ∇f(xₖ)

o This direction is obtained by minimizing a quadratic approximation (second-order

Taylor expansion) of f at xₖ.

 Algorithm: xₖ₊₁ = xₖ - (H(xₖ))⁻¹ ∇f(xₖ)

 Intuition: Newton's method uses both gradient (slope) and Hessian (curvature) information.
It doesn't just go downhill; it assumes the function looks like a bowl and jumps directly to the
bottom of that bowl.

 Diagram: Newton's Method Step

 Pros & Cons:

o ✅ Pros: Extremely fast convergence (quadratic convergence rate) near the optimum
if the initial guess is good.

o ❌ Cons: Requires calculation and inversion of the Hessian matrix H, which is

computationally expensive O(n³) for n variables; requires second derivatives; if the
Hessian is not positive definite, the direction pₖ might not be a descent direction;
can diverge if started far from a minimum.

3. Conjugate Gradient Method

 Purpose: A method designed to accelerate convergence for large problems where storing or
inverting the Hessian (like in Newton's method) is too expensive. It's often used for very
high-dimensional problems (e.g., n > 10,000).

 Intuition: Instead of using the negative gradient (which can lead to zig-zagging), it chooses a
search direction that is conjugate to the previous search direction. This means that each new
direction is chosen to not spoil the minimization achieved by the previous direction.

pₖ = -∇f(xₖ) + βₖ pₖ₋₁
 Search Direction: The direction is updated iteratively:

where βₖ is a scalar (e.g., computed by the Fletcher-Reeves or Polak-Ribière formula) that

ensures conjugacy.

 Pros & Cons:

o ✅ Pros: Faster convergence than Steepest Descent; doesn't require storing a matrix
(like the Hessian), only the previous gradient and direction; more memory efficient
than Newton's method.

o ❌ Cons: Convergence is slower than Newton's method (superlinear rate for quadratic
functions); can be more sensitive to the accuracy of the line search.

d. Implementation using NumPy and SciPy

The scipy.optimize module provides implementations for all these algorithms, saving you from
writing them from scratch.

Key Function: scipy.optimize.minimize(fun, x0, method, jac, hess, ...)

 fun: The objective function to minimize.

 x0: Initial guess (ndarray).

 method: The algorithm to use.

 jac: (Optional) Function to compute the Jacobian (gradient).

 hess: (Optional) Function to compute the Hessian matrix.

Code Examples:

python

import numpy as np

from scipy.optimize import minimize

# Define the Rosenbrock function (a classic test problem)

def rosen(x):
return np.sum(100.0 * (x[1:] - x[:-1]**2.0)**2.0 + (1 - x[:-1])**2.0)

x0 = np.array([-1.2, 1.0]) # Initial guess

# 1. Using Nelder-Mead (Simplex, derivative-free)

result_nm = minimize(rosen, x0, method='Nelder-Mead')

print("Nelder-Mead result:", result_nm.x)

# 2. Using BFGS (A Quasi-Newton method that approximates the Hessian)

result_bfgs = minimize(rosen, x0, method='BFGS')

print("BFGS result:", result_bfgs.x)

# 3. Using Newton-CG (Uses Newton's direction with CG to solve H p = -∇f)

# We need to provide the gradient and Hessian for Newton-CG

def rosen_der(x):

# ... code to compute gradient ...

xm = x[1:-1]

xm_m1 = x[:-2]

xm_p1 = x[2:]

der = np.zeros_like(x)

der[1:-1] = 200*(xm - xm_m1**2) - 400*(xm_p1 - xm**2)xm - 2(1-xm_m1)

der[0] = -400x[0](x[1]-x[0]**2) - 2*(1-x[0])

der[-1] = 200*(x[-1]-x[-2]**2)

return der

result_ncg = minimize(rosen, x0, method='Newton-CG', jac=rosen_der)

print("Newton-CG result:", result_ncg.x)

PART 2: HIGH-MARK ESSAY QUESTIONS & ANSWERS

Essay Question 1 (25 Marks)

"Compare and contrast the Method of Steepest Descent, Newton's Method, and the Conjugate
Gradient method for multidimensional unconstrained optimization. Your answer must detail the
mathematical formulation of the search direction for each, their convergence properties,
computational requirements, and practical suitability. Use diagrams where appropriate to illustrate
the behavior of each algorithm."

Answer:

The choice of algorithm for unconstrained optimization is a trade-off between computational cost,
convergence speed, and robustness. The Method of Steepest Descent, Newton's Method, and the
Conjugate Gradient method represent key points on this spectrum, each with distinct advantages
and limitations.

I. Method of Steepest Descent: The Simple Workhorse

direction of the negative gradient pₖ = -∇f(xₖ). This is the direction of the locally steepest decrease in
The search direction for Steepest Descent is intuitively derived: at each point xₖ, it moves in the

the function value.

Mathematically, the update is xₖ₊₁ = xₖ - αₖ ∇f(xₖ), where αₖ is found via a line search. Its
convergence is guaranteed under mild conditions but is linear. This rate is often unacceptably slow,
especially for ill-conditioned problems where the Hessian matrix has a high condition number (a
large ratio of its largest to smallest eigenvalue). In such landscapes, like the Rosenbrock banana
function, the path of Steepest Descent exhibits a characteristic zig-zag pattern (as shown in Diagram
1), as each new gradient direction is orthogonal to the previous one, leading to inefficient progress.

Computationally, it is cheap per iteration, requiring only the calculation of the gradient vector O(n). It
does not require second derivatives or matrix storage. Its practicality is limited to small, simple
problems or as a component in more complex algorithms where a cheap, rough step is needed.

II. Newton's Method: The Second-Order Benchmark

Newton's Method uses a more sophisticated search direction: pₖ = - (H(xₖ))⁻¹ ∇f(xₖ). This direction is
derived by minimizing the second-order Taylor series approximation of f at xₖ. It doesn't just go
downhill; it assumes the function is quadratic and jumps directly to the estimated minimum of that
quadratic.

The convergence rate of Newton's Method is quadratic near a local minimizer, meaning the number
of correct digits roughly doubles with each iteration. This is dramatically faster than the linear rate of
Steepest Descent. However, this speed comes at a high computational cost. Each iteration requires:

1. Computation of the Hessian matrix H (O(n²) operations).

2. Solving the linear system H p = -∇f (O(n³) operations for factorization).

This makes it prohibitively expensive for high-dimensional problems. Furthermore, it is less
robust. If the initial guess is poor or the Hessian is not positive definite, the method may not
converge. It requires careful implementation to handle indefinite Hessians.

III. Conjugate Gradient Method: The Smart Middle Ground

search direction is updated as pₖ = -∇f(xₖ) + βₖ pₖ₋₁. The scalar βₖ (calculated by, e.g., the Fletcher-
The Conjugate Gradient (CG) method seeks to overcome the limitations of the previous two. Its

Reeves formula) ensures that the new direction is conjugate to the previous one, meaning it
preserves the minimization achieved in previous steps and prevents the zig-zagging of Steepest
Descent.

For quadratic functions in n dimensions, CG converges in at most n iterations (direct method). For
general non-linear functions, its convergence is superlinear. While slower than Newton's quadratic
rate, it is significantly faster than linear convergence.

Computationally, CG is very efficient. It requires only O(n) storage (it needs to store the previous
gradient and direction vectors, not the Hessian) and O(n) operations per iteration. This makes it the
algorithm of choice for very large-scale problems (e.g., n > 10,000) where storing an n x n Hessian is
impossible. Its main drawback is that it can be more sensitive to the accuracy of the line search
compared to other methods.

IV. Summary and Practical Suitability

Search Convergence Cost per

Method Storage Best For
Direction pₖ Rate Iteration

Simple, small
Steepest problems;
-∇f(xₖ) Linear O(n) O(n)
Descent educational
purposes.

Medium-sized
∇f(xₖ)
Newton's -H⁻¹(xₖ)
Quadratic O(n³) O(n²) problems (n < 1000)
Method
with good Hessians.

Very large-scale
Conjugate -∇f(xₖ) + βₖ
Superlinear O(n) O(n) problems (n very
Gradient pₖ₋₁
large).

In practice, Quasi-Newton methods like BFGS and L-BFGS, which build an approximation of the
Hessian to achieve a superlinear convergence rate without the cost of calculating the exact Hessian,
are often the most popular choice for general-purpose, medium-sized optimization. They effectively
bridge the gap between the robustness of Steepest Descent and the speed of Newton's Method.

Essay Question 2 (25 Marks)

"The line search is a critical sub-problem in many multidimensional optimization algorithms.

Explain the role of line search, and critically evaluate the Golden Section Search and Brent's
algorithms for solving it. Discuss the circumstances under which one would be preferred over the
other, and how their performance impacts the overall efficiency of the multidimensional
optimizer."

Answer:
Multidimensional optimization algorithms like Steepest Descent and Conjugate Gradient follow a
two-step iterative process: 1) choose a search direction pₖ, and 2) determine how far to move in that
direction. This second step, finding the step length αₖ that (approximately) minimizes φ(α) = f(xₖ +
αpₖ), is known as the line search. Its effective solution is paramount to the overall performance and
robustness of the optimization algorithm.

I. The Role of Line Search

The purpose of the line search is not necessarily to find the exact minimizer along the ray, but to find
a step length α that provides a sufficient decrease in the objective function. A good line search
ensures global convergence (convergence from any starting point) and prevents the algorithm from
taking overly large steps that cause divergence or overly small steps that lead to glacial progress. It
transforms a direction-finding algorithm into a complete, convergent minimization routine.

II. Golden Section Search: The Reliable Bracketer

Golden Section Search (GSS) is a derivative-free, bracketing method designed to find the minimum of
a unimodal function within an interval [a, b]. Its core principle is to recursively narrow the interval of
uncertainty by comparing function values at two interior points, c and d, chosen based on the golden
ratio.

GSS is celebrated for its robustness and guaranteed convergence. It will always find a minimum
within the bracketing interval, and its performance is predictable. It does not require any derivative
information, making it applicable to non-smooth functions.

However, this robustness comes at the cost of speed. GSS has a linear convergence rate, meaning
the error reduces by a constant factor each iteration. It typically requires about 40-60 iterations to
reduce the interval length by a factor of 1e-6. Each iteration requires one or two new function
evaluations. For expensive objective functions f(x), this can be a significant bottleneck for the overall
optimizer.

III. Brent's Method: The Hybrid Accelerator

Brent's method is a sophisticated hybrid algorithm that aims to achieve the reliability of bracketing
with the speed of polynomial interpolation. It combines three techniques:

1. Golden Section Search: Used as a reliable fallback to guarantee progress.

2. Successive Parabolic Interpolation: Fits a parabola to three points to predict the minimum.

3. Inverse Quadratic Interpolation: Fits an inverse quadratic to three points.

Brent's method intelligently switches between these techniques. It uses parabolic or inverse
quadratic interpolation when the predicted point is within the current bracket and promises fast
convergence. If the interpolation step is deemed unreliable (e.g., the points are co-linear, or the
prediction is outside the bracket), it defaults to a step of Golden Section Search.

This strategy gives Brent's method a superlinear convergence rate for well-behaved functions, often
requiring far fewer function evaluations than GSS to achieve the same accuracy. It is widely regarded
as one of the best general-purpose 1D minimizers in practice.

The main drawback is its complexity. The logic for switching between methods is heuristic and more
complicated to implement correctly. However, for the end-user, this complexity is hidden within
robust implementations like scipy.optimize.minimize_scalar(method='brent').
IV. Impact on Multidimensional Optimization and Preference

The choice of line search algorithm directly impacts the efficiency of the multidimensional optimizer:

 Number of Function Evaluations: This is often the dominant computational cost. A faster line
search like Brent's reduces the total number of f(x) calls dramatically.

 Quality of the Step: A more accurate line search (finding a better α) can lead to better search
directions in subsequent iterations, especially for algorithms like Conjugate Gradient which
are sensitive to the line search accuracy.

Circumstances for Preference:

 Use Golden Section Search when the objective function is noisy, non-differentiable, or has
discontinuities. Its reliance only on function values and its robustness make it a safe, albeit
slower, choice. It is also useful for validating results from more complex methods.

 Use Brent's Method for the vast majority of smooth, well-behaved objective functions. Its
superior speed makes it the default choice in most scientific libraries
(scipy.optimize.minimize uses Brent's method for its line search by default in many cases).
The reduction in function evaluations leads to a significantly more efficient overall
optimization process.

In conclusion, while the multidimensional search direction (e.g., gradient, Newton, conjugate) gets
more attention, the line search is the silent workhorse that ensures progress. The choice between a
robust but slow method like Golden Section and a fast, intelligent hybrid like Brent's is a classic trade-
off between reliability and efficiency, with Brent's method being the preferred choice for most
modern scientific computing applications involving smooth functions.

CS-6777 Liu Abs
100% (1)
CS-6777 Liu Abs
103 pages
Process Optimization
100% (1)
Process Optimization
70 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Optimization Methods 1755746982
No ratings yet
Optimization Methods 1755746982
116 pages
Optimization Lesson 4 - Numerical Solutions of Unconstrained Single-Variable Optimization
No ratings yet
Optimization Lesson 4 - Numerical Solutions of Unconstrained Single-Variable Optimization
14 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Deterministic Unconstrained Optimization Methods
No ratings yet
Deterministic Unconstrained Optimization Methods
30 pages
Gradient and Newton Optimization
No ratings yet
Gradient and Newton Optimization
42 pages
Structural and Multidisciplinary Optimization
No ratings yet
Structural and Multidisciplinary Optimization
33 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Unconstrained Function Optimization
No ratings yet
Unconstrained Function Optimization
30 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
Chapter 2 - Final
No ratings yet
Chapter 2 - Final
11 pages
US - TMC - 05 - Optimization 2022
No ratings yet
US - TMC - 05 - Optimization 2022
43 pages
Optimization Techniques Lecture
No ratings yet
Optimization Techniques Lecture
37 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
33-Cauchy Method and Fletcher-Reeves Method-13-04-2024
No ratings yet
33-Cauchy Method and Fletcher-Reeves Method-13-04-2024
37 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
DSI434 Presentation Unconstrained Optimization
No ratings yet
DSI434 Presentation Unconstrained Optimization
14 pages
Optimization 2
No ratings yet
Optimization 2
40 pages
ECEG-6311 Power System Optimization and AI: Linear and Non Linear Programming Yoseph Mekonnen (PH.D.)
No ratings yet
ECEG-6311 Power System Optimization and AI: Linear and Non Linear Programming Yoseph Mekonnen (PH.D.)
56 pages
Am205 Workshop Optimization
No ratings yet
Am205 Workshop Optimization
38 pages
Nocedal - Wright CH - 02-02
No ratings yet
Nocedal - Wright CH - 02-02
12 pages
pdfHXu ch1
No ratings yet
pdfHXu ch1
30 pages
Midterm 1 Notes
No ratings yet
Midterm 1 Notes
46 pages
GA Ex 2
No ratings yet
GA Ex 2
21 pages
CH 4
No ratings yet
CH 4
28 pages
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
No ratings yet
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
9 pages
Op Tim Ization
No ratings yet
Op Tim Ization
25 pages
Optimization Methods for Engineers
No ratings yet
Optimization Methods for Engineers
31 pages
Lec 30
No ratings yet
Lec 30
22 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Unconstrained Optimization - Ipynb - Colaboratory
No ratings yet
Unconstrained Optimization - Ipynb - Colaboratory
5 pages
Overview of Traditional Optimization Methods
No ratings yet
Overview of Traditional Optimization Methods
17 pages
Elimination Methods
No ratings yet
Elimination Methods
34 pages
4 Pattern Directions, 21-08-2024
No ratings yet
4 Pattern Directions, 21-08-2024
58 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
Numerical Optimization Course Notes
No ratings yet
Numerical Optimization Course Notes
96 pages
Efficient Methods in Optimization
No ratings yet
Efficient Methods in Optimization
159 pages
Steepest Descent for Optimization
No ratings yet
Steepest Descent for Optimization
29 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
FALLSEM2020-21 CHE1011 TH VL2020210101704 Reference Material I 05-Sep-2020 Lecture 17 PDF
No ratings yet
FALLSEM2020-21 CHE1011 TH VL2020210101704 Reference Material I 05-Sep-2020 Lecture 17 PDF
19 pages
ECOM 6302: Engineering Optimization: Chapter Three
100% (1)
ECOM 6302: Engineering Optimization: Chapter Three
56 pages
Op Tim Ization
No ratings yet
Op Tim Ization
4 pages
Optimization Methods for Functions f1, f2
No ratings yet
Optimization Methods for Functions f1, f2
9 pages
Multi Variable Optimization: Min F (X, X, X, - X)
No ratings yet
Multi Variable Optimization: Min F (X, X, X, - X)
38 pages
Stats 102B Cheat Sheet
No ratings yet
Stats 102B Cheat Sheet
4 pages
Unconstrained and Constrained Optimization Techniques
No ratings yet
Unconstrained and Constrained Optimization Techniques
25 pages
Optimumengineeringdesign Day5
No ratings yet
Optimumengineeringdesign Day5
84 pages
Unconstrained Optimization Methods Overview
No ratings yet
Unconstrained Optimization Methods Overview
13 pages
CSE488 Lab6 Optimization
No ratings yet
CSE488 Lab6 Optimization
20 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Nonlinear Optimization with MATLAB
No ratings yet
Nonlinear Optimization with MATLAB
38 pages
Nonlinear Optimization: Benny Yakir
No ratings yet
Nonlinear Optimization: Benny Yakir
38 pages
Numerical Optimization
No ratings yet
Numerical Optimization
9 pages
Unconstraint Optimization
No ratings yet
Unconstraint Optimization
12 pages
HCF and LCM Questions
No ratings yet
HCF and LCM Questions
1 page
What Is Memory
No ratings yet
What Is Memory
1 page
What Is An Adaptive Route - Definition From Techopedia
No ratings yet
What Is An Adaptive Route - Definition From Techopedia
1 page
What Is Research - Definition, Types, Methods & Examples
No ratings yet
What Is Research - Definition, Types, Methods & Examples
1 page
How To Remove or Uninstall An Antivirus Program
No ratings yet
How To Remove or Uninstall An Antivirus Program
1 page
JK Flip Flop, SR Flip Flop Using D Flip Flop
No ratings yet
JK Flip Flop, SR Flip Flop Using D Flip Flop
1 page
What Is The Difference Between Integrated and Non Integrated Motherboard - Brainly - in
No ratings yet
What Is The Difference Between Integrated and Non Integrated Motherboard - Brainly - in
1 page
Functional Requirements Vs Non Functional Requirements - Key Differences
No ratings yet
Functional Requirements Vs Non Functional Requirements - Key Differences
1 page
It Research Ittd 1 2020
No ratings yet
It Research Ittd 1 2020
1 page
Distributed Routing
No ratings yet
Distributed Routing
1 page
Flip-Flops and Latches - Northwestern Mechatronics Wiki
No ratings yet
Flip-Flops and Latches - Northwestern Mechatronics Wiki
1 page
How Does Mime Enhance SMTP, Computer Network Security
No ratings yet
How Does Mime Enhance SMTP, Computer Network Security
1 page
6 Strategies For Coping With Change - Henry Ford LiveWell
No ratings yet
6 Strategies For Coping With Change - Henry Ford LiveWell
1 page
Difference Between Synchronous and Asynchronous Counter
No ratings yet
Difference Between Synchronous and Asynchronous Counter
1 page
Caching - Why Is CPU Cache Memory So Fast - Software Engineering Stack Exchange
No ratings yet
Caching - Why Is CPU Cache Memory So Fast - Software Engineering Stack Exchange
1 page
10 Ways To Tackle Common Project Management Challenges Like A Pro
No ratings yet
10 Ways To Tackle Common Project Management Challenges Like A Pro
1 page
Proposal For Automated Asset Management Application
No ratings yet
Proposal For Automated Asset Management Application
2 pages
Z2FS Throttle Check Valve Guide
No ratings yet
Z2FS Throttle Check Valve Guide
8 pages
Adobe India Hackathon 2025
No ratings yet
Adobe India Hackathon 2025
1 page
3-Theories of Origin of State Social Contract
No ratings yet
3-Theories of Origin of State Social Contract
4 pages
Le Petit Chef - Case - Report1
100% (1)
Le Petit Chef - Case - Report1
9 pages
Python Programming Question Bank
No ratings yet
Python Programming Question Bank
6 pages
Thesis Topics For Administration and Supervision
100% (3)
Thesis Topics For Administration and Supervision
5 pages
Starbucks Supply Chain Insights
No ratings yet
Starbucks Supply Chain Insights
2 pages
Publish To The World: I Turned The Corner
No ratings yet
Publish To The World: I Turned The Corner
3 pages
Surface EMG
No ratings yet
Surface EMG
7 pages
The Making of A Hotwife: The Next Morning
No ratings yet
The Making of A Hotwife: The Next Morning
36 pages
4 X 30 Targeting Optrics
No ratings yet
4 X 30 Targeting Optrics
2 pages
Technology Transfer
No ratings yet
Technology Transfer
3 pages
Chem Main File
No ratings yet
Chem Main File
12 pages
Grade 1 English Sumar Pack
No ratings yet
Grade 1 English Sumar Pack
6 pages
Arundel Partners
86% (7)
Arundel Partners
2 pages
HLP22 Oil Purifier PDF
No ratings yet
HLP22 Oil Purifier PDF
4 pages
2024 Graduate Exam Instructions
No ratings yet
2024 Graduate Exam Instructions
14 pages
Preview ISA+MC96.1-1982
No ratings yet
Preview ISA+MC96.1-1982
14 pages
Design Clearances For Standard Wrenches and Sockets
100% (5)
Design Clearances For Standard Wrenches and Sockets
2 pages
Making Successful Transitions: UMW Speaking Center Presents
No ratings yet
Making Successful Transitions: UMW Speaking Center Presents
2 pages
Par-Q Physical Activity Readiness Questionnaire: Please Answer The Following Questions Honestly With A YES or A NO
No ratings yet
Par-Q Physical Activity Readiness Questionnaire: Please Answer The Following Questions Honestly With A YES or A NO
1 page
XE Engine (Component Manual)
86% (7)
XE Engine (Component Manual)
298 pages
History of Machine Translation-2006 PDF
No ratings yet
History of Machine Translation-2006 PDF
21 pages
Ahmed Umer - Senior Software Test Engineer
No ratings yet
Ahmed Umer - Senior Software Test Engineer
2 pages
Using COMSOL-Multiphysics in An Eddy Current
No ratings yet
Using COMSOL-Multiphysics in An Eddy Current
5 pages
Linear Regression and Correlation: Mcgraw Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw Hill/Irwin
37 pages
Theories of First Language Acquisition
No ratings yet
Theories of First Language Acquisition
5 pages
Anurag Newv
No ratings yet
Anurag Newv
32 pages
Suppliers Assessment Checklist
100% (2)
Suppliers Assessment Checklist
3 pages
Budgeting & Forecasting Exam Answers'
No ratings yet
Budgeting & Forecasting Exam Answers'
3 pages

Constraint Optimization

Uploaded by

Constraint Optimization

Uploaded by

3.

a. Introduction to Unconstrained Optimization Problems

An unconstrained optimization problem is the fundamental building block of optimization. It involves

Minimize f(x) for x ∈ Rⁿ

Maximize f(x) for x ∈ Rⁿ

where f: Rⁿ → R is the objective function and x is the vector of decision variables.

1. Theoretical Foundation: The optimality conditions (gradient = 0, Hessian positive definite)

o Least Squares Regression: Minimizing the sum of squared errors.

o Maximum Likelihood Estimation: Maximizing the likelihood function to find model

b. One-Dimensional Optimization Algorithms (Line Search Methods)

1. Golden Section Search

 Purpose: A robust, derivative-free method for finding a minimum of a unimodal function (a

1. Start with an interval [a, b] known to contain a minimum.

2. Calculate two interior points: c and d.

4. Else, the minimum lies in [c, b]. Set a = c.

5. Repeat until the interval length is sufficiently small.

 Diagram: Golden Section Search Iteration

 Pros & Cons:

o ❌ Cons: Convergence is linear (slow); only for 1D problems; requires a bracketing

 Purpose: A more sophisticated 1D minimization algorithm that combines the reliability of

 Intuition: Brent's method uses three techniques:

2. Successive Parabolic Interpolation: Fits a parabola to three points. Also fast.

 Pros & Cons:

o ✅ Pros: Superfast convergence (superlinear) for well-behaved functions; reliable due

o ❌ Cons: More complex to implement; logic for switching methods is heuristic.

c. Multidimensional Optimization Algorithms

 pₖ is the search direction.

The choice of the search direction pₖ defines the algorithm.

1. Method of Steepest Descent

 Search Direction: pₖ = -∇f(xₖ)

direction: xₖ₊₁ = xₖ - αₖ ∇f(xₖ).

 Diagram: Path of Steepest Descent

 Pros & Cons:

o ✅ Pros: Very simple to understand and implement; guaranteed convergence under

 Search Direction: pₖ = - (H(xₖ))⁻¹ ∇f(xₖ)

o This direction is obtained by minimizing a quadratic approximation (second-order

 Algorithm: xₖ₊₁ = xₖ - (H(xₖ))⁻¹ ∇f(xₖ)

 Diagram: Newton's Method Step

 Pros & Cons:

o ❌ Cons: Requires calculation and inversion of the Hessian matrix H, which is

3. Conjugate Gradient Method

where βₖ is a scalar (e.g., computed by the Fletcher-Reeves or Polak-Ribière formula) that

 Pros & Cons:

d. Implementation using NumPy and SciPy

Key Function: scipy.optimize.minimize(fun, x0, method, jac, hess, ...)

 fun: The objective function to minimize.

 x0: Initial guess (ndarray).

 method: The algorithm to use.

 jac: (Optional) Function to compute the Jacobian (gradient).

 hess: (Optional) Function to compute the Hessian matrix.

from scipy.optimize import minimize

# Define the Rosenbrock function (a classic test problem)

x0 = np.array([-1.2, 1.0]) # Initial guess

# 1. Using Nelder-Mead (Simplex, derivative-free)

result_nm = minimize(rosen, x0, method='Nelder-Mead')

print("Nelder-Mead result:", result_nm.x)

# 2. Using BFGS (A Quasi-Newton method that approximates the Hessian)

result_bfgs = minimize(rosen, x0, method='BFGS')

print("BFGS result:", result_bfgs.x)

# 3. Using Newton-CG (Uses Newton's direction with CG to solve H p = -∇f)

# We need to provide the gradient and Hessian for Newton-CG

# ... code to compute gradient ...

der[1:-1] = 200*(xm - xm_m1**2) - 400*(xm_p1 - xm**2)*xm - 2*(1-xm_m1)

der[0] = -400*x[0]*(x[1]-x[0]**2) - 2*(1-x[0])

result_ncg = minimize(rosen, x0, method='Newton-CG', jac=rosen_der)

print("Newton-CG result:", result_ncg.x)

PART 2: HIGH-MARK ESSAY QUESTIONS & ANSWERS

Essay Question 1 (25 Marks)

I. Method of Steepest Descent: The Simple Workhorse

the function value.

II. Newton's Method: The Second-Order Benchmark

1. Computation of the Hessian matrix H (O(n²) operations).

2. Solving the linear system H p = -∇f (O(n³) operations for factorization).

III. Conjugate Gradient Method: The Smart Middle Ground

IV. Summary and Practical Suitability

Search Convergence Cost per

der[1:-1] = 200*(xm - xm_m1**2) - 400*(xm_p1 - xm**2)xm - 2(1-xm_m1)

der[0] = -400x[0](x[1]-x[0]**2) - 2*(1-x[0])