0% found this document useful (0 votes)

121 views11 pages

Kuhn-Tucker Conditions Explained

The document summarizes the Kuhn-Tucker conditions, which are necessary conditions for a constrained optimization problem to have a local minimum. 1) The conditions involve finding Lagrange multipliers that satisfy equations relating the partial derivatives of the objective function and constraints. 2) A multiplier is zero if its associated inequality constraint is not active at the minimum. 3) The conditions ensure there is no feasible direction at the minimum that can reduce the objective function. However, they are only necessary, not sufficient for optimality.

Uploaded by

Michał Gromisz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views11 pages

Kuhn-Tucker Conditions Explained

Uploaded by

Michał Gromisz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

The Kuhn-Tucker conditions

Hao Zhang

1. General Case

The General form of optimization problems is below:

minimize f ( x); x ∈ Rn
such that g j ( x) ≤ 0; j = 1,..., p
h j ( x) = 0; j = 1,..., q

In general, this problem may have several local minima. Only under

special circumstances are sure of the existence of single global minimum.

The necessary conditions for a minimum of the constrained problem are

obtained by using the Lagrange multiplier method.

We start by considering the special case of equality constraints only.

Using the Lagrange multiplier technique, we define the Lagrangian

function
ne

( x, λ ) = f ( x ) − ∑ λ j h j ( x )
Ldx
j =1

λ j are unknown Lagrangian multipliers.

The necessary conditions for a stationary point are

-1-
∂L ∂f ne
∂h j
= − ∑ λj =0 i = 1,..., n
∂xi ∂xi j =1 ∂xi

∂L
= h j ( x) = 0 j = 1,..., ne
∂λ j

These conditions, however, apply only at a regular point, that is at a

point where the gradients of the constraints are linearly independent. If we

have constraints gradients that are linearly dependent, it means that we can

remove some constraints without affecting the solution. At a regular point,

these two equations represent n+ne equations for the ne Lagrange

multipliers and the n coordinates of the stationary point.

The situation is somewhat more complicated when inequality

constraints are present. To be able to apply the Lagrange multiplier

method we first transform the inequality constraints to equality constraints

by adding slack variables. That is, the inequality constraints are written as

g j ( x) − t 2j = 0, j = 1,..., ng
where tj is a slack variable which measures how far the jth constraint

is from being critical. We can now form a Lagrangian function

L ( x, t , λ ) = f − ∑ λ j ( g j − t 2j )
-2-

j =1
Differentiating the Lagrangian function with respect to x, λ, t we

obtain

∂L ∂f
ng
∂g j
= − ∑ λg = 0, i = 1,..., n
∂xi ∂xi j =1 ∂xi

∂L j = 1,..., ng
= − g j + t 2j = 0,
∂λ j

∂L
= 2λ j t j = 0, j = 1,..., ng
∂t j

The last two equations imply that when an inequality constraint is

not critical (so that the corresponding slack variable is non-zero) then the

Lagrange multiplier associated with the constraint is zero.

These three equations are the necessary conditions for a stationary

regular point. Note that for inequality constraints a regular point is one

where the gradients of the active constraints are linearly independent.

These conditions are modified slightly to yield the necessary

conditions for a minimum which is known as the Kuhn-Tucker conditions.

The Kuhn-Tucker conditions may be summarized as follows:

A point x is a local minimum of an inequality constrained problem

-3-
only if a set of nonnegative λj’s may be found such that:

∂L ∂f
ng
∂g j
1.
= − ∑ λg = 0, i = 1,..., n
∂xi ∂xi j =1 ∂xi

is satisfied.

2. The corresponding λj is zero if a constraint is not active.

Figure 1 A geometrical interpretation of Kuhn-Tucker condition

for the case of two constraints.

A geometrical interpretation of the Kuhn-Tucker conditions is

illustrated in Fig 1 for the case of two constraints. ∇g1 and ∇g 2 denote

the gradients of the two constraints which are orthogonal to the respective

constraint surfaces. The vector s shows a typical feasible direction which

does not lead immediately to any constraint violation. For the

-4-
two-constraints case. We get

−∇f = −(λ1∇g1 + λ2∇g 2 )

Assume that we want to determine whether point A is a minimum or

not. To improve the design we proceed from A in a direction s that is

usable and feasible.

To be usable, a small move along the direction should decrease the

objective function, so it must form an acute angle with −∇f . To be feasible,

s should form an obtuse angle with −∇g1 and −∇g 2 .

Clearly from the Fig 1, any vector which forms an acute angle with

−∇f will also form an acute angle with either −∇g1 or −∇g 2 .

Thus, Kuhn-Tucker conditions mean that no feasible design with

reduced objective function is to be found in the neighborhood of A.

Mathematically, the condition that a direction s be feasible is written

sT ∇g j ≥ 0, j ∈ I A
IA is the set of active constraints. Equality is permitted only for linear

or concave constraints. The condition for a usable direction (one that

decreases the objective function) is

s T ∇f < 0

-5-
Multiplying by si and summing over i we obtain

s ∇f = ∑ λ j s T ∇ g j
T

j =1

So it is impossible if the λj ’s are positive.

If the Kuhn-Tucker conditions are satisfied at a point it is impossible

to find a direction with a negative slope for objective function that does

not violate the constraints. In some cases, though, it is possible to move in

a direction which is tangent to the active constraints and perpendicular to

the gradient, that is

sT ∇f = sT ∇g j = 0, j ∈ IA

The effect of such a move on the objective function and constraints

can be determined only from higher derivatives. In some cases, a move in

this direction could reduce the objective function without violating the

constraints even though the Kuhn-Tucker conditions are met. Therefore,

the Kuhn-Tucker conditions are necessary but not sufficient for optimality.

The Kuhn-Tucker conditions are sufficient when the number of

active constraints is equal to the number of design variables. In this case

this equation cannot be satisfied with s ≠ 0 because ∇g j includes n

linearly independent directions.

-6-
When the number of active constraints is not equal to the number of

design variables sufficient conditions for optimality require the second

derivatives of the objective function and constraints. A sufficient condition

for optimality is that the Hessian matrix of the Lagrangian function is

positive definite in the subspace tangent to the active constraints. For

example, if we take the case of equality constraints, the Hessian matrix of

the Lagrangian is

ne
∇ 2 L== ∇ 2 f − ∑ λ j ∇ 2 h j
j =1
The sufficient condition for optimality is that

sT (∇ 2 L) s > 0

for all s for which sT ∇h j = 0, j = 1,..., ne

When inequality constraints are present, the vector s also needs to

be orthogonal to the gradients of the active constraints with positive

Lagrange multipliers. For active constraints with zero Lagrange

multipliers, s must satisfy

sT ∇g j ≥ 0, when g j = 0and λj = 0

2. Example

Find the minimum of f = − x13 − 2 x22 + 10 x1 − 6 − 2 x23

Subject to

-7-
g1 = 10 − x1 x2 ≥ 0,
g 2 = x1 ≥ 0,
g3 = 10 − x2 ≥ 0

The Kuhn-Tucker conditions are

−3 x12 + 10 + λ1 x2 − λ2 = 0
−4 x2 − 6 x22 + λ1 x1 + λ3 = 0

We have to check for all possibilities of active constraints.

The simplest case is when no constraints are active, λ1 = λ2 = λ3 = 0 .

we get x1=1.826, x2=0, f=6.17. The Hessian matrix of the Lagrangian

⎛ −6 x1 λ1 ⎞
∇2 L = ⎜ ⎟
⎝ λ1 −4 − 12 x2 ⎠

is clearly negative definite, so that this point is a maximum.

We assume that the first constraints is active, x1 x2 = 10 , so that x1 ≠ 0

and g2 is inactive and therefore λ2 = 0 . We have two possibilities for the

third constraint. If it is active, we get x1=1, x2=10, λ1=-0.7, λ3=639.3 so

this point is neither a minimum nor a maximum.

If the third constraint is not active λ3=0, we obtain the following

equations
−3x12 + 10 + λ1 x2 = 0
−4 x2 − 6 x22 + λ1 x1 = 0
x1 x2 = 0

The only solution for these equations that satisfies the constraints on

x1 and x2 is

x1=3.847, x2=2.599, λ1=13.24, f=-73.08.

This point satisfies the KT condition for a minimum. However, the

-8-
Hessian of the Lagrangian at that point

⎛ −23.08 13.24 ⎞
∇2 L = ⎜ ⎟
⎝ 13.24 −35.19 ⎠

is negative definite, so that it cannot satisfy the sufficiency

condition. In fact, an examination of the function f at neighboring points

along x1 x2 = 10 reveals that the point is not a minimum.

Next we consider the possibility that g1 is not active, so that λ1=0, and
−3 x12 + 10 − λ2 = 0
−4 x2 − 6 x22 + λ3 = 0

We have already considered the possibility of both λ’s being zero, so

we need to consider only three possibilities of one of these Lagrange

multipliers being nonzero, or both being nonzero. The first case is

λ2 ≠ 0, λ3 = 0 , so that g2=0, and we get

x1=0, x2=0, λ2=10, f=-6 or x1=0, x2=-2/3, λ2=10, f=-6.99.

Both points satisfy the KT conditions for a minimum, but not the

sufficiency conditions. In fact, the vectors tangent to the active constraints

(x1=0 is the only one) have the form sT = (0, a) , and it is easy to check that

sT ∇ 2 Ls < 0 . It is also easy to check that these points are indeed no minima

by reducing x2 slightly.

The next case is λ2 = 0, λ3 ≠ 0 , so that g3=0. We get

x1=1.826, x2=10, λ3=640, f=-2194

This point satisfies the KT conditions, but it is not a minimum either.

It is easy to check that ∇ 2L= is negative definite in this case, so that the

-9-
sufficiency condition could not be satisfied.

Finally, we consider the case x1=0, x2=10, λ2=10, λ3=640, f=-2206.

Now the KT conditions are satisfied, and the number of active constraints

is equal to the number of design variables, so that it is the minimum.

3. Convex Problems

There is a class of problems namely convex problems, for which the

Kuhn-Tucker conditions are not only necessary but also sufficient for a

global minimum.

A set of points S is convex whenever the entire line segment

connecting two points that are in S is also in S. That is

If x1 , x2 ∈ S , then α x1 + (1 − α ) x2 ∈ S , 0 <α <1

A function is convex if

f [α x2 + (1 − α ) x1 ] ≤ α f ( x2 ) + (1 − α ) f ( x1 ), 0 <α <1

It can be shown that a function of n variables is convex if its matrix

of second derivatives is positive semi-definite.

A convex optimization problem has a convex objective function and a

convex feasible domain. It can be shown that the feasible domain is

convex if all the inequality constraints gj are concave and the equality

constraints are linear. A convex optimization problem has only one

minimum, and the Kuhn-Tucker conditions are sufficient to establish it.

Most optimization problems encountered in practice cannot be shown

to be convex. However, the theory of convex programming is still very

- 10 -
important in structural optimization, as we often approximate optimization

problems by a series of convex approximations.

- 11 -

KKT Intro
No ratings yet
KKT Intro
8 pages
Week 5
No ratings yet
Week 5
14 pages
Kuhn-Tucker Conditions in Optimization
No ratings yet
Kuhn-Tucker Conditions in Optimization
18 pages
Lagrange Multipliers & KKT Conditions
No ratings yet
Lagrange Multipliers & KKT Conditions
41 pages
Operation Research
No ratings yet
Operation Research
23 pages
Optimization With Equality Constraints
No ratings yet
Optimization With Equality Constraints
34 pages
Problems 9
No ratings yet
Problems 9
7 pages
CH 8 Inequality Constraint
No ratings yet
CH 8 Inequality Constraint
22 pages
Lecture 1
No ratings yet
Lecture 1
5 pages
2.3 Optimization With Inequality Constraints: Kuhn-Tucker Conditions
No ratings yet
2.3 Optimization With Inequality Constraints: Kuhn-Tucker Conditions
7 pages
Dd7ea95f 3fa0 4fa9 Be6a 34cccecb3a97
No ratings yet
Dd7ea95f 3fa0 4fa9 Be6a 34cccecb3a97
16 pages
Optimization for Beginners
No ratings yet
Optimization for Beginners
9 pages
Lagrangian Optimization & KKT Examples
No ratings yet
Lagrangian Optimization & KKT Examples
7 pages
Lecture 6: Optimality Conditions For Nonlinear Programming
No ratings yet
Lecture 6: Optimality Conditions For Nonlinear Programming
28 pages
Constrained Optimization Techniques
No ratings yet
Constrained Optimization Techniques
10 pages
Karush Kuhn Tucker
No ratings yet
Karush Kuhn Tucker
14 pages
Lecture 3, 2020
No ratings yet
Lecture 3, 2020
5 pages
4 Handling Constraints: F (X) X R C J 1, - . - , M C 0, K 1, - . - , M
No ratings yet
4 Handling Constraints: F (X) X R C J 1, - . - , M C 0, K 1, - . - , M
10 pages
ECON1003 Tut08 2023s2
No ratings yet
ECON1003 Tut08 2023s2
31 pages
Solutions 9: Demo 1: KKT Conditions With Inequality Constraints
No ratings yet
Solutions 9: Demo 1: KKT Conditions With Inequality Constraints
11 pages
CSC 435 Study Session 3
No ratings yet
CSC 435 Study Session 3
17 pages
Sheet4 SOL
No ratings yet
Sheet4 SOL
6 pages
Karush Kuhn Tucker Slides
No ratings yet
Karush Kuhn Tucker Slides
45 pages
Non-linear Programming Guide
No ratings yet
Non-linear Programming Guide
8 pages
6252 Slides13
No ratings yet
6252 Slides13
8 pages
M Hemn1
No ratings yet
M Hemn1
16 pages
7 - 3 - The Sufficiency of The Kuhn-Tucker Conditions
No ratings yet
7 - 3 - The Sufficiency of The Kuhn-Tucker Conditions
6 pages
Kuhn Tucker Conditions
No ratings yet
Kuhn Tucker Conditions
11 pages
Lagrange Multipliers in Optimization
No ratings yet
Lagrange Multipliers in Optimization
4 pages
Understanding Karush-Kuhn-Tucker Conditions
100% (1)
Understanding Karush-Kuhn-Tucker Conditions
42 pages
In Microeconomics and Optimization Theory
No ratings yet
In Microeconomics and Optimization Theory
4 pages
Nonlinear Programming
No ratings yet
Nonlinear Programming
35 pages
Math for ML Students
No ratings yet
Math for ML Students
11 pages
Unlocking Optimisation The Kuhn Tucker Conditions - PDF - 20250822 - 130607 - 0000
No ratings yet
Unlocking Optimisation The Kuhn Tucker Conditions - PDF - 20250822 - 130607 - 0000
10 pages
6e4f6lagrange Multiplier LN 5
No ratings yet
6e4f6lagrange Multiplier LN 5
6 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
51 pages
Pages Used in Last Problem 1
No ratings yet
Pages Used in Last Problem 1
5 pages
Appendix E. Lagrange Multipliers
No ratings yet
Appendix E. Lagrange Multipliers
4 pages
KKT Theorems in Convex Optimization
No ratings yet
KKT Theorems in Convex Optimization
59 pages
Gradient Lagrange
No ratings yet
Gradient Lagrange
4 pages
Constrained Optimisation
No ratings yet
Constrained Optimisation
11 pages
Non Linear Programming
No ratings yet
Non Linear Programming
38 pages
Appendix E. Lagrange Multipliers
No ratings yet
Appendix E. Lagrange Multipliers
4 pages
1527251033E textofChapter10Module2
No ratings yet
1527251033E textofChapter10Module2
8 pages
Lecture14 KKT
No ratings yet
Lecture14 KKT
37 pages
Problem Set 3: Solutions: Damien Klossner Damien - Klossner@epfl - CH Extranef 128 October 19, 2018
No ratings yet
Problem Set 3: Solutions: Damien Klossner Damien - Klossner@epfl - CH Extranef 128 October 19, 2018
6 pages
KKT Theorem: Conditions for Optimization
No ratings yet
KKT Theorem: Conditions for Optimization
15 pages
7 - 2 - The Necessity of The Kuhn-Tucker Conditions
No ratings yet
7 - 2 - The Necessity of The Kuhn-Tucker Conditions
2 pages
Chapitre3 FF-English
No ratings yet
Chapitre3 FF-English
16 pages
Basic Mathematical Economics
0% (1)
Basic Mathematical Economics
41 pages
Gould Tolle Necessary Sufficient Qualification Constrained Opt
No ratings yet
Gould Tolle Necessary Sufficient Qualification Constrained Opt
9 pages
Roadmap For The NPP Segment:: Bordered Hessians Pseudo-Concavity
No ratings yet
Roadmap For The NPP Segment:: Bordered Hessians Pseudo-Concavity
24 pages
Quadratic Programming
No ratings yet
Quadratic Programming
19 pages
Solving Linear Inequalities (Including Word Problems) : Example
No ratings yet
Solving Linear Inequalities (Including Word Problems) : Example
6 pages
FST 4 X Class
No ratings yet
FST 4 X Class
4 pages
Computation of The Distribution of The Sum of Inde
No ratings yet
Computation of The Distribution of The Sum of Inde
9 pages
Bozorgzadeh Et Al. (2018) - Comp. Stat. Analysis of Intact Rock Strength For Reliability-Based Design
No ratings yet
Bozorgzadeh Et Al. (2018) - Comp. Stat. Analysis of Intact Rock Strength For Reliability-Based Design
14 pages
Linear Equation 1
No ratings yet
Linear Equation 1
2 pages
Lesson 1 - Basic Integration Formulas
No ratings yet
Lesson 1 - Basic Integration Formulas
9 pages
Understanding BENG 2142 Statistics
No ratings yet
Understanding BENG 2142 Statistics
18 pages
IOQM Preparation Book
No ratings yet
IOQM Preparation Book
16 pages
Fractions: Chapter Overview
No ratings yet
Fractions: Chapter Overview
18 pages
Cable Dynamics A Review
No ratings yet
Cable Dynamics A Review
7 pages
Financial Analytics: Sessions 1
No ratings yet
Financial Analytics: Sessions 1
6 pages
Bayesian Networks & Decision Theory
No ratings yet
Bayesian Networks & Decision Theory
19 pages
4037 Additional Mathematics: MARK SCHEME For The October/November 2006 Question Paper
No ratings yet
4037 Additional Mathematics: MARK SCHEME For The October/November 2006 Question Paper
7 pages
Difference GMM vs. System GMM Term Paper
No ratings yet
Difference GMM vs. System GMM Term Paper
23 pages
Couette Flow Solution Using Analytic and Crank-Nicolson Numerical
No ratings yet
Couette Flow Solution Using Analytic and Crank-Nicolson Numerical
16 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Understanding Utility Theory Basics
100% (2)
Understanding Utility Theory Basics
3 pages
Lesson 1 - The Historical Development of Number and Number Systems1
100% (1)
Lesson 1 - The Historical Development of Number and Number Systems1
71 pages
A Hypothetical Learning Trajectory For Logarithms (Carreras-Jusino, 2016)
No ratings yet
A Hypothetical Learning Trajectory For Logarithms (Carreras-Jusino, 2016)
1 page
3 - Differentiation Rules
No ratings yet
3 - Differentiation Rules
5 pages
Chapter 11
No ratings yet
Chapter 11
23 pages
Mathematics 20042018 Quiz
No ratings yet
Mathematics 20042018 Quiz
10 pages
NMEE Assigment 1 TO 7
No ratings yet
NMEE Assigment 1 TO 7
8 pages
Advanced Precalculus for Students
No ratings yet
Advanced Precalculus for Students
2 pages
Ss Notes
No ratings yet
Ss Notes
13 pages
Understanding the Affine Cipher
No ratings yet
Understanding the Affine Cipher
15 pages
3 6+Practice+and+KEY
No ratings yet
3 6+Practice+and+KEY
2 pages
Extending Adams' Theorem From Singly Generated To Periodic Cohomology
No ratings yet
Extending Adams' Theorem From Singly Generated To Periodic Cohomology
14 pages
A New Method of Target Tracking by EKF Using Bearing and Elevation
No ratings yet
A New Method of Target Tracking by EKF Using Bearing and Elevation
8 pages
Numerics of Special Functions, Nico M. Temme
No ratings yet
Numerics of Special Functions, Nico M. Temme
83 pages