0% found this document useful (0 votes)

32 views3 pages

Convex Optimization Notes

Uploaded by

sandy.kgml

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views3 pages

Convex Optimization Notes

Uploaded by

sandy.kgml

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics in Convex Optimisation (Lent 2023) Lecturer: Hamza Fawzi

3 Smoothness and strong convexity

3.1 Dual norms
Recall that if k · k is a norm on Rn , then the dual norm is defined by

kyk∗ = sup hy, xi .

kxk=1

In particular we have the generalized Cauchy-Schwarz inequality

hx, yi ≤ kxkkyk∗ ∀x, y ∈ Rn .

p
= hx, xi is the Euclidean norm
Exercise: Show that the dual norm of the Euclidean norm kxk2 P
itself. More generally show the dual of the p-norm kxkp = ( i |xi |p )1/p is the q norm where
1/p + 1/q = 1.

3.2 L-smoothness
We say that a differentiable function f : Rn → R is L-smooth with respect to a norm k · k, if for
any x, y ∈ int dom(f ),
k∇f (x) − ∇f (y)k∗ ≤ Lkx − yk, (1)
where k · k∗ is the dual norm to k · k. (We will sometimes omit the reference to the norm, in which
case this means we work with the Euclidean norm.) The following lemma will be important in the
analysis of optimization algorithms.

Lemma 1 (Descent lemma). If f is L-smooth, then for any x ∈ int dom(f ) and y ∈ dom(f ),
L
f (y) ≤ f (x) + h∇f (x), y − xi + ky − xk2 . (2)
2
Remark 1. To appreciate the implication of the inequality above, assume k · k = k · k2 is the
Euclidean norm, and consider taking y = x − (1/L)∇f (x), i.e., one step of the gradient method
1
with step size t = 1/L. Then we get f (y) ≤ f (x) − 2L k∇f (x)k22 < f (x), i.e., the function value
decreases at each iteration.

Proof of Lemma 1. Let h = y − x and φ(t) = f (x + th) − (f (x) + t h∇f (x), hi). Then φ is differen-
tiable and φ0 (t) = h∇f (x + th) − ∇f (x), hi ≤ k∇f (x + th) − ∇f (x)k 2
R 1 ∗0khk ≤ Ltkhk where we used
the Lipschitz assumption (1). Thus it follows that φ(1) = φ(0) + 0 φ (t)dt ≤ L/2khk2 which gives
precisely the desired inequality (2).

A simple way to check L-smoothness, is by analyzing the Hessian matrix. One can show the
following proposition:

Proposition 3.1. Assume f : Rn → R is such that dom(f ) is open, and f is twice continuously
differentiable on its domain. Then f is L-smooth if, and only if,

∀u, v ∈ Rn , ∇2 f (x)u, v ≤ Lkukkvk. (3)

1
Remark 2. Condition (3) can be equivalently written as k∇2 f (x)uk∗ ≤ Lkuk for all u ∈ Rn .
Equivalently, this is saying that the (Rn , k · k) → (Rn , k · k∗ ) induced norm of the linear map ∇2 f (x)
is at most L.

Proof. ⇐ Let x, y ∈ dom(f ). The fundamental theorem of R 1calculus applied to the function t 7→
∇f (x + th) with h = y − x tells us that ∇f (y) − ∇f (x) = 0 ∇2 f (x + th)hdt. Thus we can write
Z 1 Z 1
2
k∇f (y) − ∇f (x)k∗ ≤ k∇ f (x + th)hk∗ dt ≤ Lkhkdt = Lkhk
0 0

as desired.
⇒ Assume f is L-smooth. Let u, v be arbitrary vectors, and define ψ(t) = h∇f (x + tu) − ∇f (x), vi.
Then by L-smoothness, ψ(t) ≤ Ltkukkvk, and so ψ 0 (0) = limt→0 (ψ(t) − ψ(0))/t ≤ Lkukkvk. But
ψ 0 (0) = ∇2 f (x)u, v .

Remark 3. If k · k = k · k2 is the Euclidean norm, then condition (3) is equivalent to saying that
the eigenvalues of ∇2 f (x) are all in [−L, L].

3.3 Strong convexity

We say that f is m-strongly convex (with respect to the norm k · k) if for any x, y ∈ dom(f ), and
t ∈ [0, 1]
m
f (tx + (1 − t)y) ≤ tf (x) + (1 − t)f (y) − t(1 − t)kx − yk2 . (4)
2
• If f is m-strongly convex and differentiable at x, then for any y ∈ dom(f ) we have
m
f (y) ≥ f (x) + h∇f (x), y − xi + ky − xk2 . (5)
2
(This can be proved simply by subtracting f (x) from both sides of (4), dividing by t, and
letting t → 0.) The converse is also true, i.e., if dom(f ) is open, f is differentiable everywhere
on its domain, and (5) holds for all x, y ∈ dom(f ), then f is m-strongly convex. (Exercise)

• If f is twice continuously differentiable on its domain (assumed open), then strong convexity
is equivalent to ∇2 f (x)h, h ≥ mkhk2 for all x ∈ dom(f ) and h ∈ Rn . (Proof left as an
exercise.)

Remark 4. When considering the Euclidean norm, we see that a convex function f is L-smooth
if, and only if, ∇2 f (x) LI, i.e., LI − ∇2 f (x) is positive semidefinite (where I is the identity
matrix), i.e., all the eigenvalues of f are ≤ L. Similarly, a function f is m-strongly convex if, and
only if, ∇2 f (x) mI, i.e., all the eigenvalues of ∇2 f (x) are ≥ m.

To summarize, if a function f is L-smooth, and m-strongly convex, then we can find, at any
point x ∈ int dom(f ) global quadratic lower and upper bounds on f :

m L
f (x) + h∇f (x), y − xi + ky − xk2 ≤ f (y) ≤ f (x) + h∇f (x), y − xi + ky − xk2 . (6)
| {z 2 } | {z 2 }
strong convexity L-smoothness

The ratio κ = L/m can be interpreted as a condition number of f . This quantity will play a
prominent role in the convergence analysis of optimization algorithms for strongly convex functions.

2
The inequalities (6) can be expressed more concisely if we introduce the so-called Bregman
divergence of f , defined as the gap between f and its linear approximation:

Df (y|x) = f (y) − (f (x) + h∇f (x), y − xi).

The inequalities above can then be written as:

m L
ky − xk2 ≤ Df (y|x) ≤ ky − xk2 .
2 2

Lecture 12
No ratings yet
Lecture 12
4 pages
Convex Optimization Cheatsheet
No ratings yet
Convex Optimization Cheatsheet
2 pages
Some Special Class of Functions in Optimization: Convex, Lipschitz, Strongly Convex
No ratings yet
Some Special Class of Functions in Optimization: Convex, Lipschitz, Strongly Convex
17 pages
Convex Functions in Optimization
No ratings yet
Convex Functions in Optimization
14 pages
Lipschitz Gradient and Strong Convexity
No ratings yet
Lipschitz Gradient and Strong Convexity
37 pages
Convex Functions Lecture Notes
No ratings yet
Convex Functions Lecture Notes
14 pages
Optimality Conditions: Unconstrained Optimization: 1.1 Differentiable Problems
No ratings yet
Optimality Conditions: Unconstrained Optimization: 1.1 Differentiable Problems
10 pages
Notes ch0
No ratings yet
Notes ch0
12 pages
Lec3 Convex Function Exercise
No ratings yet
Lec3 Convex Function Exercise
4 pages
Analiza Convexa
No ratings yet
Analiza Convexa
4 pages
1 Convex Analysis: 1.1 Motivations: Convex Optimization Problems
No ratings yet
1 Convex Analysis: 1.1 Motivations: Convex Optimization Problems
24 pages
Convex and Concave Function Analysis
No ratings yet
Convex and Concave Function Analysis
5 pages
Algorithmic Stability
No ratings yet
Algorithmic Stability
87 pages
03 Convex Functions
No ratings yet
03 Convex Functions
31 pages
Lecture 3 Si416 2025
No ratings yet
Lecture 3 Si416 2025
23 pages
Convexity and Continuity in Optimization
No ratings yet
Convexity and Continuity in Optimization
16 pages
Chapter 3
No ratings yet
Chapter 3
43 pages
Convex Optimization L2 18
No ratings yet
Convex Optimization L2 18
11 pages
Convex Optimization Insights
No ratings yet
Convex Optimization Insights
3 pages
A Smoothing Technique For Nondifferentiable Optimization Problem
No ratings yet
A Smoothing Technique For Nondifferentiable Optimization Problem
11 pages
Lecture 4 Si416 2025
No ratings yet
Lecture 4 Si416 2025
22 pages
Lecture 1 2 Background
No ratings yet
Lecture 1 2 Background
6 pages
Bregman
No ratings yet
Bregman
9 pages
O4MD 02 Foundations
No ratings yet
O4MD 02 Foundations
8 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
Homework 4 2024
No ratings yet
Homework 4 2024
3 pages
Section05 Solutions
No ratings yet
Section05 Solutions
5 pages
Lecture 15 Projected Gradient
No ratings yet
Lecture 15 Projected Gradient
8 pages
Methods For Convex - Smooth Optimization: Clipping, Acceleration, and Adaptivity
No ratings yet
Methods For Convex - Smooth Optimization: Clipping, Acceleration, and Adaptivity
51 pages
Convexity, Lipschitzness, Smoothness
No ratings yet
Convexity, Lipschitzness, Smoothness
5 pages
Lect3 Removed
No ratings yet
Lect3 Removed
44 pages
03 Convex Functions Notes Cvxopt f22
No ratings yet
03 Convex Functions Notes Cvxopt f22
21 pages
Optimality Conditions
No ratings yet
Optimality Conditions
10 pages
Gradient Method in Convex Optimization
No ratings yet
Gradient Method in Convex Optimization
31 pages
Convex Functions and Means of Matrices
No ratings yet
Convex Functions and Means of Matrices
17 pages
Bregman Divergence & Mirror Descent
No ratings yet
Bregman Divergence & Mirror Descent
8 pages
Convex Optimization for Students
No ratings yet
Convex Optimization for Students
12 pages
Characterization of Lipschitz
No ratings yet
Characterization of Lipschitz
8 pages
Appendix PDF
No ratings yet
Appendix PDF
6 pages
Introductory Course On Non-Smooth Optimisation: Lecture 01 - Gradient Methods
No ratings yet
Introductory Course On Non-Smooth Optimisation: Lecture 01 - Gradient Methods
49 pages
Optimization Algorithms Guide
No ratings yet
Optimization Algorithms Guide
71 pages
14.451 Notes: 1 Mathematical Preliminaries
No ratings yet
14.451 Notes: 1 Mathematical Preliminaries
5 pages
Chapter 2 Basis Math
No ratings yet
Chapter 2 Basis Math
14 pages
Lecture 11 AGD Restart Lower Bounds
No ratings yet
Lecture 11 AGD Restart Lower Bounds
5 pages
Lect4 Removed
No ratings yet
Lect4 Removed
32 pages
Jan Van Tiel - Convex Analysis - An Introductory Text-Wiley (1984) PDF
No ratings yet
Jan Van Tiel - Convex Analysis - An Introductory Text-Wiley (1984) PDF
135 pages
Functions
No ratings yet
Functions
44 pages
Lecture Notes PDF
No ratings yet
Lecture Notes PDF
143 pages
Func 20160919
No ratings yet
Func 20160919
35 pages
Convex Optimization Prerequisite - Topics
No ratings yet
Convex Optimization Prerequisite - Topics
6 pages
Convex Sets and Functions Guide
No ratings yet
Convex Sets and Functions Guide
42 pages
Convex Functions: Renu M. R
No ratings yet
Convex Functions: Renu M. R
43 pages
Convex Functions: September 2, 2008
No ratings yet
Convex Functions: September 2, 2008
21 pages
Fast Algorithms for Convex Optimization
No ratings yet
Fast Algorithms for Convex Optimization
114 pages
CS726 Lecture 3: Differentiability
No ratings yet
CS726 Lecture 3: Differentiability
22 pages
Evans PDE Solution Chapter 3 Nonlinear First-Order PDE
No ratings yet
Evans PDE Solution Chapter 3 Nonlinear First-Order PDE
6 pages
Data Science - Convex Optimization and Examples PDF
No ratings yet
Data Science - Convex Optimization and Examples PDF
9 pages
DSPy Not Your Average Prompt Engineering 1
No ratings yet
DSPy Not Your Average Prompt Engineering 1
42 pages
2023 MWPLS Type Directed Prompt Construction For LLM Powered Programming Assistants
No ratings yet
2023 MWPLS Type Directed Prompt Construction For LLM Powered Programming Assistants
57 pages
One-For-All: Generalized Lora For Parameter-Efficient Fine-Tuning
No ratings yet
One-For-All: Generalized Lora For Parameter-Efficient Fine-Tuning
14 pages
Neurips - Verl Workshop
No ratings yet
Neurips - Verl Workshop
48 pages
Turpentine Talk
No ratings yet
Turpentine Talk
24 pages
A198056 Compressed
No ratings yet
A198056 Compressed
29 pages
Mdrive Plus: Stepper Motors With Integrated Electronics
No ratings yet
Mdrive Plus: Stepper Motors With Integrated Electronics
8 pages
Thermal Integrity Profiling of Concrete Deep Foundations: Standard Test Methods For
100% (1)
Thermal Integrity Profiling of Concrete Deep Foundations: Standard Test Methods For
7 pages
Oracle ARCS Fixed Assets Reconciliation
No ratings yet
Oracle ARCS Fixed Assets Reconciliation
3 pages
Online Chapter Tests: 7. Integrals
No ratings yet
Online Chapter Tests: 7. Integrals
3 pages
Constraint Satisfaction Problems
No ratings yet
Constraint Satisfaction Problems
2 pages
Anomaly Detection On Active Directory Log Data For Insider Threat Monitoring
100% (1)
Anomaly Detection On Active Directory Log Data For Insider Threat Monitoring
6 pages
Lecture Notes 2 by Prof. Kaushik Pal
No ratings yet
Lecture Notes 2 by Prof. Kaushik Pal
34 pages
TB Statement Modeling
No ratings yet
TB Statement Modeling
3 pages
Unit - 4 Binomial
No ratings yet
Unit - 4 Binomial
21 pages
Generator CB Interruption of Current With Non-Zero Passage
No ratings yet
Generator CB Interruption of Current With Non-Zero Passage
10 pages
Rating Matrix
No ratings yet
Rating Matrix
2 pages
BCA - 440-18, BCA-440-20, BCA-CS-440-20 Java
No ratings yet
BCA - 440-18, BCA-440-20, BCA-CS-440-20 Java
2 pages
Lecture 04 - Chap 3 Modeling of Generators and Transformers
No ratings yet
Lecture 04 - Chap 3 Modeling of Generators and Transformers
20 pages
CXC Math Exam Paper 1 and Paper 2 Exam Questions
29% (34)
CXC Math Exam Paper 1 and Paper 2 Exam Questions
4 pages
Casio - pcr250,110cr, 150cr Tecnico
No ratings yet
Casio - pcr250,110cr, 150cr Tecnico
51 pages
Automatic Control
No ratings yet
Automatic Control
16 pages
Informe de Hidrologia... Cuenca Hidrografica
33% (3)
Informe de Hidrologia... Cuenca Hidrografica
47 pages
Lab03 Solutions - DBMS - Queries
No ratings yet
Lab03 Solutions - DBMS - Queries
4 pages
SDL50 Service Manual
67% (3)
SDL50 Service Manual
382 pages
Olympus TG-6 Settings Guide Video
No ratings yet
Olympus TG-6 Settings Guide Video
2 pages
Higher Order Differential Equations - Dr. M. A. Maleque
No ratings yet
Higher Order Differential Equations - Dr. M. A. Maleque
54 pages
Tron SART20: Users Manual
No ratings yet
Tron SART20: Users Manual
28 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Nptel: Ordinary Differential Equations and Applications - Video Course
No ratings yet
Nptel: Ordinary Differential Equations and Applications - Video Course
3 pages
Class 11 Math Line Equations Worksheet
No ratings yet
Class 11 Math Line Equations Worksheet
2 pages
Built in Selftest For Embedded Systems
No ratings yet
Built in Selftest For Embedded Systems
12 pages
Complex Nonlinear ODEs & Geometry
No ratings yet
Complex Nonlinear ODEs & Geometry
7 pages
Introduction To Surveying (BPD) : Levelling Procedures
No ratings yet
Introduction To Surveying (BPD) : Levelling Procedures
59 pages
Characteristics of Fourth Generation Language
No ratings yet
Characteristics of Fourth Generation Language
2 pages
Excel Review: BUSCARX and Data Functions
No ratings yet
Excel Review: BUSCARX and Data Functions
4 pages

Convex Optimization Notes

Uploaded by

Convex Optimization Notes

Uploaded by

Topics in Convex Optimisation (Lent 2023) Lecturer: Hamza Fawzi

3 Smoothness and strong convexity

kyk∗ = sup hy, xi .

In particular we have the generalized Cauchy-Schwarz inequality

hx, yi ≤ kxkkyk∗ ∀x, y ∈ Rn .

∀u, v ∈ Rn , ∇2 f (x)u, v ≤ Lkukkvk. (3)

3.3 Strong convexity

Df (y|x) = f (y) − (f (x) + h∇f (x), y − xi).

The inequalities above can then be written as:

You might also like