0% found this document useful (0 votes)

46 views3 pages

Optimization For Machine Learning - Lecture Notes

Uploaded by

Marco A. Lantermo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views3 pages

Optimization For Machine Learning - Lecture Notes

Uploaded by

Marco A. Lantermo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Optimization for Machine Learning

Lecture Notes CS-439, Spring 2025

Bernd Gärtner, ETH

Martin Jaggi, EPFL

February 17, 2025

Contents

1 Theory of Convex Functions 2–38

2 Gradient Descent 38–61

3 Projected and Proximal Gradient Descent 61–77

4 Subgradient Descent 77–88

5 Stochastic Gradient Descent 88–96

6 Nonconvex functions 96–115

7 Newton’s Method 115–127

8 Quasi-Newton Methods 127–145

9 Coordinate Descent 145–160

10 The Frank-Wolfe Algorithm 160–180

1
Chapter 1

Theory of Convex Functions

Contents
1.1 Mathematical Background . . . . . . . . . . . . . . . . . . . . 4
1.1.1 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.1.2 The Cauchy-Schwarz inequality . . . . . . . . . . . . 4
1.1.3 The spectral norm . . . . . . . . . . . . . . . . . . . . 6
1.1.4 The mean value theorem . . . . . . . . . . . . . . . . . 7
1.1.5 The fundamental theorem of calculus . . . . . . . . . 7
1.1.6 Differentiability . . . . . . . . . . . . . . . . . . . . . . 8
1.2 Convex sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.2.1 The mean value inequality . . . . . . . . . . . . . . . 10
1.3 Convex functions . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.3.1 First-order characterization of convexity . . . . . . . 16
1.3.2 Second-order characterization of convexity . . . . . . 19
1.3.3 Operations that preserve convexity . . . . . . . . . . 21
1.4 Minimizing convex functions . . . . . . . . . . . . . . . . . . 21
1.4.1 Strictly convex functions . . . . . . . . . . . . . . . . . 23
1.4.2 Example: Least squares . . . . . . . . . . . . . . . . . 24
1.4.3 Constrained Minimization . . . . . . . . . . . . . . . . 25
1.5 Existence of a minimizer . . . . . . . . . . . . . . . . . . . . . 26
1.5.1 Sublevel sets and the Weierstrass Theorem . . . . . . 27
1.6 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
1.6.1 Handwritten digit recognition . . . . . . . . . . . . . 28
1.6.2 Master’s Admission . . . . . . . . . . . . . . . . . . . 29

Convex It y 2015
0% (1)
Convex It y 2015
437 pages
Co 463
No ratings yet
Co 463
116 pages
Convexity and Well-Posed Problems (CMS Books in Mathematics) by Roberto Lucchetti
No ratings yet
Convexity and Well-Posed Problems (CMS Books in Mathematics) by Roberto Lucchetti
321 pages
ECE 236B Course Notes
No ratings yet
ECE 236B Course Notes
90 pages
Eecs127 Reader
No ratings yet
Eecs127 Reader
199 pages
Machine Learning
No ratings yet
Machine Learning
662 pages
Main
No ratings yet
Main
166 pages
Machine Learning
No ratings yet
Machine Learning
674 pages
OPTIIILN2023Spring ConvexOpti
No ratings yet
OPTIIILN2023Spring ConvexOpti
341 pages
Op Tim Ization Book
No ratings yet
Op Tim Ization Book
548 pages
Mathematical Economics Lecture Notes: Alexander W. Richter
No ratings yet
Mathematical Economics Lecture Notes: Alexander W. Richter
128 pages
Notes On Luenberger's Vector Space Optimization
100% (3)
Notes On Luenberger's Vector Space Optimization
131 pages
Paul G.bamberg-Convexity and Optimization With Applications
No ratings yet
Paul G.bamberg-Convexity and Optimization With Applications
131 pages
EE227C Course Notes: Convex Optimization
No ratings yet
EE227C Course Notes: Convex Optimization
122 pages
Ee227c Notes PDF
No ratings yet
Ee227c Notes PDF
122 pages
OptimisationII Notes
100% (2)
OptimisationII Notes
94 pages
Mathematics
No ratings yet
Mathematics
20 pages
Convergence Theorems For (Stochastic) Gradient Methods
No ratings yet
Convergence Theorems For (Stochastic) Gradient Methods
84 pages
Stanford Statistics311 InformationTheoryAndStatistics
No ratings yet
Stanford Statistics311 InformationTheoryAndStatistics
304 pages
Nonlinear Optimization CO 367
No ratings yet
Nonlinear Optimization CO 367
105 pages
Math for Econ Students
No ratings yet
Math for Econ Students
159 pages
ROB501 Textbook2022 03 21
No ratings yet
ROB501 Textbook2022 03 21
142 pages
Math Tools for Economics Grads
No ratings yet
Math Tools for Economics Grads
105 pages
Lecture Notes
100% (2)
Lecture Notes
324 pages
Numerical Computions
No ratings yet
Numerical Computions
103 pages
Lecture Notes
No ratings yet
Lecture Notes
495 pages
Math 4 ML
100% (1)
Math 4 ML
47 pages
Math Foundations for Machine Learning
No ratings yet
Math Foundations for Machine Learning
47 pages
MA4K0 Notes
No ratings yet
MA4K0 Notes
189 pages
Week 1 Lecture Slides
100% (1)
Week 1 Lecture Slides
30 pages
Barbu V., Precupanu T. Convexity and Optimization in Banach Spaces (4ed., Springer, 2012) (ISBN 9789400722460) (O) (381s) - MOc - PDF
0% (1)
Barbu V., Precupanu T. Convexity and Optimization in Banach Spaces (4ed., Springer, 2012) (ISBN 9789400722460) (O) (381s) - MOc - PDF
381 pages
Main
No ratings yet
Main
515 pages
Full Notes
No ratings yet
Full Notes
197 pages
Num
No ratings yet
Num
114 pages
Economic Mathematical Methods Notes
No ratings yet
Economic Mathematical Methods Notes
244 pages
Econ Analysis: Math Methods
100% (1)
Econ Analysis: Math Methods
245 pages
Convex Functions and Their Applications: Constantin P. Niculescu Lars-Erik Persson
No ratings yet
Convex Functions and Their Applications: Constantin P. Niculescu Lars-Erik Persson
430 pages
Math For Data Science
No ratings yet
Math For Data Science
538 pages
Introduction Numerical Analysis
No ratings yet
Introduction Numerical Analysis
252 pages
Notes Ipad
No ratings yet
Notes Ipad
263 pages
Eigenvalues and Functions in Analysis
No ratings yet
Eigenvalues and Functions in Analysis
139 pages
Introduction To Optimization - Jean-François Aujol
No ratings yet
Introduction To Optimization - Jean-François Aujol
51 pages
Mathbook 200810 (1054)
No ratings yet
Mathbook 200810 (1054)
202 pages
SGOS Book
No ratings yet
SGOS Book
238 pages
Mathbook-Econ Prep
100% (1)
Mathbook-Econ Prep
278 pages
Mfepoly PDF
No ratings yet
Mfepoly PDF
168 pages
Optimization Algorithms Guide
No ratings yet
Optimization Algorithms Guide
71 pages
Advanced Optimization Course Guide
No ratings yet
Advanced Optimization Course Guide
86 pages
PRSSA Alumni & Student News
No ratings yet
PRSSA Alumni & Student News
7 pages
3rd Periodical Test in Grade 3
100% (1)
3rd Periodical Test in Grade 3
8 pages
Filipino Youth: Culture and Values
No ratings yet
Filipino Youth: Culture and Values
8 pages
Lady Clementine Marie Benedict Instructor Test Bank
No ratings yet
Lady Clementine Marie Benedict Instructor Test Bank
311 pages
Ensemble Learning for Satellite Object Detection
No ratings yet
Ensemble Learning for Satellite Object Detection
12 pages
Meeting Transcription 23-10-2020
No ratings yet
Meeting Transcription 23-10-2020
9 pages
Module 3A: Designing Instruction in The Different Learning Delivery Modalities
No ratings yet
Module 3A: Designing Instruction in The Different Learning Delivery Modalities
5 pages
Tspec-Llm: An Open-Source Dataset For LLM Understanding of 3Gpp Specifications
No ratings yet
Tspec-Llm: An Open-Source Dataset For LLM Understanding of 3Gpp Specifications
6 pages
Jefferson County Voters Guide 2022
No ratings yet
Jefferson County Voters Guide 2022
13 pages
Google Scholar Assignment
100% (1)
Google Scholar Assignment
2 pages
CPA Approach
No ratings yet
CPA Approach
20 pages
A Project Report On
No ratings yet
A Project Report On
17 pages
Annexure ECE.21.12.1 & 21.12.2 - de & OE Courses
No ratings yet
Annexure ECE.21.12.1 & 21.12.2 - de & OE Courses
2 pages
Traffic Sign Recognition Project
No ratings yet
Traffic Sign Recognition Project
13 pages
Astrology Basics: The Third House
No ratings yet
Astrology Basics: The Third House
3 pages
2989111
No ratings yet
2989111
21 pages
Reading & Writing: Text & Structure
No ratings yet
Reading & Writing: Text & Structure
19 pages
UFT
No ratings yet
UFT
13 pages
Inst. For Correction of Data Base
No ratings yet
Inst. For Correction of Data Base
2 pages
Cambridge Primary Mathematics Learner S Book 6 Second Edition Sample Pages 9781398301108
No ratings yet
Cambridge Primary Mathematics Learner S Book 6 Second Edition Sample Pages 9781398301108
16 pages
FINAL DEMONSTRATION English
No ratings yet
FINAL DEMONSTRATION English
9 pages
Report
No ratings yet
Report
2 pages
Classroom Improvement Plan 5 E. AGUINALDO
94% (36)
Classroom Improvement Plan 5 E. AGUINALDO
3 pages
Explicit Instruction For Developing Critical Thinking in English
No ratings yet
Explicit Instruction For Developing Critical Thinking in English
17 pages
Passive Voice: Grade 9 - Unit 2
No ratings yet
Passive Voice: Grade 9 - Unit 2
5 pages
Visualization and Representation Part 2
No ratings yet
Visualization and Representation Part 2
17 pages
Cultural Anthropology in A Globalizing World 3rd Edition Edition Barbara D. Miller Instant Download
No ratings yet
Cultural Anthropology in A Globalizing World 3rd Edition Edition Barbara D. Miller Instant Download
92 pages
Market Research Analyst Profile Summary
No ratings yet
Market Research Analyst Profile Summary
3 pages
BST Class 12 Case Studies of All Lesson
No ratings yet
BST Class 12 Case Studies of All Lesson
120 pages
El Zaki Transform 1
No ratings yet
El Zaki Transform 1
9 pages

Optimization For Machine Learning - Lecture Notes

Uploaded by

Optimization For Machine Learning - Lecture Notes

Uploaded by

Optimization for Machine Learning

Lecture Notes CS-439, Spring 2025

Bernd Gärtner, ETH

February 17, 2025

1 Theory of Convex Functions 2–38

2 Gradient Descent 38–61

3 Projected and Proximal Gradient Descent 61–77

4 Subgradient Descent 77–88

5 Stochastic Gradient Descent 88–96

6 Nonconvex functions 96–115

7 Newton’s Method 115–127

8 Quasi-Newton Methods 127–145

9 Coordinate Descent 145–160

10 The Frank-Wolfe Algorithm 160–180

Theory of Convex Functions

You might also like