0% found this document useful (0 votes)

13 views21 pages

l1 Ext Slides

The document discusses ℓ1-norm methods for convex-cardinality problems, focusing on total variation reconstruction, which minimizes the difference between a piecewise constant estimate and a corrupted signal while controlling the number of jumps. It also introduces an iterated weighted ℓ1 heuristic to improve solutions for sparse linear inequalities and time series models. Additionally, it covers extensions to matrix rank problems and factor modeling, utilizing nuclear norms and trace heuristics for approximation.

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views21 pages

l1 Ext Slides

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ℓ1-norm Methods for

Convex-Cardinality Problems
Part II

• total variation
• iterated weighted ℓ1 heuristic
• matrix rank constraints

EE364b, Stanford University

Total variation reconstruction
• fit xcor with piecewise constant x̂, no more than k jumps

• convex-cardinality problem: minimize kx̂ − xcork2 subject to

card(Dx) ≤ k (D is first order difference matrix)

• heuristic: minimize kx̂ − xcork2 + γkDxk1; vary γ to adjust number of

jumps

• kDxk1 is total variation of signal x̂

• method is called total variation reconstruction

• unlike ℓ2 based reconstruction, TVR filters high frequency noise out

while preserving sharp jumps

EE364b, Stanford University 1

Example (§6.3.3 in BV book)
signal x ∈ R2000 and corrupted signal xcor ∈ R2000
2

0
x

−1

−2
0 500 1000 1500 2000

1
xcor

−1

−2
0 500 1000 1500 2000

EE364b, Stanford University 2

Total variation reconstruction
for three values of γ
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000

EE364b, Stanford University 3

ℓ2 reconstruction
for three values of γ
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000

EE364b, Stanford University 4

Example: 2D total variation reconstruction
• x ∈ Rn are values of pixels on N × N grid (N = 31, so n = 961)

• assumption: x has relatively few big changes in value (i.e., boundaries)

• we have m = 120 linear measurements, y = F x (Fij ∼ N (0, 1))

• as convex-cardinality problem:

minimize card(xi,j − xi+1,j ) + card(xi,j − xi,j+1)

subject to y = F x

• ℓ1 heuristic (objective is a 2D version of total variation)

P P
minimize |xi,j − xi+1,j | + |xi,j − xi,j+1|
subject to y = F x

EE364b, Stanford University 5

TV reconstruction
original TV reconstruction

1.5 1.5

1 1

0.5 0.5

0 0

−0.5 −0.5
0 0
5 5
10 30 10 30
15 25 15 25
20 20 20 20
15 15
25 25
10 10
30 30
5 5
35 35

. . . not bad for 8× more variables than measurements!

EE364b, Stanford University 6

ℓ2 reconstruction
original ℓ2 reconstruction

1.5 1.5

1 1

0.5 0.5

0 0

−0.5 −0.5
0 0
5 5
10 30 10 30
15 25 15 25
20 20 20 20
15 15
25 25
10 10
30 30
5 5
35 35

. . . this is what you’d expect with 8× more variables than measurements

EE364b, Stanford University 7

Iterated weighted ℓ1 heuristic

• to minimize card(x) over x ∈ C

w := 1
repeat
minimize k diag(w)xk1 over x ∈ C
wi := 1/(ǫ + |xi|)

• first iteration is basic ℓ1 heuristic

• increases relative weight on small xi
• typically converges in 5 or fewer steps
• often gives a modest improvement (i.e., reduction in card(x)) over
basic ℓ1 heuristic

EE364b, Stanford University 8

Interpretation
• wlog we can take x 0 (by writing x = x+ − x−, x+, x− 0, and
replacing card(x) with card(x+) + card(x−))

• we’ll use approximation card(z) ≈ log(1 + z/ǫ), where ǫ > 0, z ∈ R+

• using this approximation, we get (nonconvex) problem

Pn
minimize i=1 log(1 + xi /ǫ)
subject to x ∈ C, x 0

• we’ll find a local solution by linearizing objective at current point,

n n n (k)
X X (k)
X xi − x i
log(1 + xi/ǫ) ≈ log(1 + xi /ǫ) + (k)
i=1 i=1 i=1 ǫ+ xi

EE364b, Stanford University 9

and solving resulting convex problem
Pn
minimize i=1 wi xi
subject to x ∈ C, x 0

with wi = 1/(ǫ + xi), to get next iterate

• repeat until convergence to get a local solution

EE364b, Stanford University 10

Sparse solution of linear inequalities
• minimize card(x) over polyhedron {x | Ax b}, A ∈ R100×50
• ℓ1 heuristic finds x ∈ R50 with card(x) = 44
• iterated weighted ℓ1 heuristic finds x with card(x) = 36
(global solution, via branch & bound, is card(x) = 32)
50

40
card(x)

10
iterated ℓ1
ℓ1
0
1 2 3 4 5 6

iteration

EE364b, Stanford University 11

Detecting changes in time series model
• AR(2) scalar time-series model

y(t + 2) = a(t)y(t + 1) + b(t)y(t) + v(t), v(t) IID N (0, 0.52)

• assumption: a(t) and b(t) are piecewise constant, change infrequently

• given y(t), t = 1, . . . , T , estimate a(t), b(t), t = 1, . . . , T − 2
• heuristic: minimize over variables a(t), b(t), t = 1, . . . , T − 1
PT −2
(y(t + 2) − a(t)y(t + 1) − b(t)y(t))2
t=1
PT −2
+γ t=1 (|a(t + 1) − a(t)| + |b(t + 1) − b(t)|)

• vary γ to trade off fit versus number of changes in a, b

EE364b, Stanford University 12

Time series and true coefficients

3
1

2 0.8
0.6

1 0.4
0.2
y(t)

0 0
−0.2 b(t)
−1 −0.4
−0.6
a(t)
−2 −0.8
−1
−3
0 50 100 150 200 250 300 50 100 150 200 250 300

t t

EE364b, Stanford University 13

TV heuristic and iterated TV heuristic

left: TV with γ = 10; right: iterated TV, 5 iterations, ǫ = 0.005

1 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0 0
−0.2 −0.2
−0.4 −0.4
−0.6 −0.6
−0.8 −0.8
−1 −1

50 100 150 200 250 300 50 100 150 200 250 300

t t

EE364b, Stanford University 14

Extension to matrices

• Rank is natural analog of card for matrices

• convex-rank problem: convex, except for Rank in objective or

constraints

• rank problem reduces to card problem when matrices are diagonal:

Rank(diag(x)) = card(x)
P
• analog of ℓ1 heuristic: use nuclear norm, kXk∗ = i σi (X)
(sum of singular values; dual of spectral norm)

• for X 0, reduces to Tr X (for x 0, kxk1 reduces to 1T x)

EE364b, Stanford University 15

Factor modeling
• given matrix Σ ∈ Sn+, find approximation of form Σ̂ = F F T + D, where
F ∈ Rn×r , D is diagonal nonnegative
• gives underlying factor model (with r factors)

x = F z + v, v ∼ N (0, D), z ∼ N (0, I)

• model with fewest factors:

minimize Rank X
subject to X 0, D 0 diagonal
X +D ∈C

with variables D, X ∈ Sn
C is convex set of acceptable approximations to Σ

EE364b, Stanford University 16

• e.g., via KL divergence

C = {Σ̂ | − log det(Σ−1/2Σ̂Σ−1/2) + Tr(Σ−1/2Σ̂Σ−1/2) − n ≤ ǫ}

• trace heuristic:

minimize Tr X
subject to X 0, D 0 diagonal
X +D ∈C

with variables d ∈ Rn, X ∈ Sn

EE364b, Stanford University 17

Example
• x = F z + v, z ∼ N (0, I), v ∼ N (0, D), D diagonal; F ∈ R20×3

• Σ is empirical covariance matrix from N = 3000 samples

• set of acceptable approximations

C = {Σ̂ | kΣ−1/2(Σ̂ − Σ)Σ−1/2k ≤ β}

• trace heuristic
minimize Tr X
subject to X 0, d 0
kΣ−1/2(X + diag(d) − Σ)Σ−1/2k ≤ β

EE364b, Stanford University 18

Trace approximation results

2
16 10

0
12 10
Rank(X)

λi(X)
10

−2
8 10

−4
4 10

2 −2 −1 0 −2 −1 0
10 10 10 10 10 10

β β

EE364b, Stanford University 19

• for β = 0.1357 (knee of the tradeoff curve) we find
T
range(X), range(F F ) = 6.8◦

– 6
– kd − diag(D)k/k diag(D)k = 0.07

• i.e., we have recovered the factor model from the empirical covariance

EE364b, Stanford University 20

ℓ1-Norm Heuristics for Cardinality Problems
No ratings yet
ℓ1-Norm Heuristics for Cardinality Problems
31 pages
Norm Methods For Convex-Cardinality Problems
No ratings yet
Norm Methods For Convex-Cardinality Problems
31 pages
Convex Cardinality Optimization
No ratings yet
Convex Cardinality Optimization
26 pages
Regularization & Generalized Linear Models
No ratings yet
Regularization & Generalized Linear Models
135 pages
Advances in Sketching for Linear Algebra
No ratings yet
Advances in Sketching for Linear Algebra
139 pages
PRML Exercise Solutions Guide
No ratings yet
PRML Exercise Solutions Guide
87 pages
Optimizing Model Complexity in ML
No ratings yet
Optimizing Model Complexity in ML
32 pages
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
0% (1)
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
4 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Wainwrightslides 2
No ratings yet
Wainwrightslides 2
77 pages
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
Lasso Algorithms and Applications
No ratings yet
Lasso Algorithms and Applications
44 pages
S M S T C Lecture Notes Lecture2
No ratings yet
S M S T C Lecture Notes Lecture2
9 pages
Total Least Squares Method Explained
No ratings yet
Total Least Squares Method Explained
11 pages
Cs419 Closed Form Derv
No ratings yet
Cs419 Closed Form Derv
5 pages
Sparse Inverse Covariance Estimation With The Graphical Lasso
No ratings yet
Sparse Inverse Covariance Estimation With The Graphical Lasso
14 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
12 pages
Scribe Notes Fall 2022
No ratings yet
Scribe Notes Fall 2022
41 pages
ML Module III
No ratings yet
ML Module III
64 pages
hw1 Sol
No ratings yet
hw1 Sol
12 pages
Practice 1130
No ratings yet
Practice 1130
20 pages
189 Cheat Sheet Minicards
No ratings yet
189 Cheat Sheet Minicards
2 pages
Skript Opt Mach
No ratings yet
Skript Opt Mach
49 pages
Week2 Summary Detail
No ratings yet
Week2 Summary Detail
13 pages
DSGE Model Linearization Techniques
No ratings yet
DSGE Model Linearization Techniques
31 pages
Fundamentals of Linear Algebra For Signal Processing 2022 09 22
No ratings yet
Fundamentals of Linear Algebra For Signal Processing 2022 09 22
321 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
Lecture 15
No ratings yet
Lecture 15
7 pages
Convex Optimization Prerequisite - Topics
No ratings yet
Convex Optimization Prerequisite - Topics
6 pages
Notes Linearregression
No ratings yet
Notes Linearregression
4 pages
Lecture 17 Least Squares, State Estimation
No ratings yet
Lecture 17 Least Squares, State Estimation
29 pages
Tối Ưu Hóa Cho Khoa Học Dữ Liệu
No ratings yet
Tối Ưu Hóa Cho Khoa Học Dữ Liệu
64 pages
EE364b Optimization Exercises Guide
No ratings yet
EE364b Optimization Exercises Guide
48 pages
Rkhussain Sir - 1
No ratings yet
Rkhussain Sir - 1
4 pages
Week2 Summary Detail
No ratings yet
Week2 Summary Detail
17 pages
L1-Understanding Diffusion Models A Unified Persp
No ratings yet
L1-Understanding Diffusion Models A Unified Persp
27 pages
ML Module 2,3,4
No ratings yet
ML Module 2,3,4
13 pages
Homework 9: Nhi Ly 2025-04-10
No ratings yet
Homework 9: Nhi Ly 2025-04-10
6 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
A Stochastic Composite Gradient Method With Incremental Variance Reduction
No ratings yet
A Stochastic Composite Gradient Method With Incremental Variance Reduction
21 pages
Tikhonov Regularization
No ratings yet
Tikhonov Regularization
8 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
Advanced Nonlinear Optimization
No ratings yet
Advanced Nonlinear Optimization
12 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
Total Least Squares
No ratings yet
Total Least Squares
11 pages
Bayesian Inference & Gaussian Processes
No ratings yet
Bayesian Inference & Gaussian Processes
2 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
Fall2020 CS395T Mock Midterm Solutions
No ratings yet
Fall2020 CS395T Mock Midterm Solutions
4 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Advanced Signal Modeling Techniques
No ratings yet
Advanced Signal Modeling Techniques
34 pages
Lecture 9: October 2: 9.1.1 Stochastic Block Model
No ratings yet
Lecture 9: October 2: 9.1.1 Stochastic Block Model
6 pages
SSRN 5196021
No ratings yet
SSRN 5196021
22 pages
L18 Backprop
No ratings yet
L18 Backprop
18 pages
L08 MaximumLikelihoodEstimation
No ratings yet
L08 MaximumLikelihoodEstimation
5 pages
Managing Geopolitical Risk in Investments
No ratings yet
Managing Geopolitical Risk in Investments
68 pages
L06 Vectors
No ratings yet
L06 Vectors
26 pages
Lectures HD
No ratings yet
Lectures HD
301 pages
Chance Constr
No ratings yet
Chance Constr
22 pages
Commodity
No ratings yet
Commodity
91 pages
Connor Sensible Return Forecasting 1997
No ratings yet
Connor Sensible Return Forecasting 1997
8 pages
Commodity Misperceptions January 30 2017
No ratings yet
Commodity Misperceptions January 30 2017
9 pages
663 Detecting 2021
No ratings yet
663 Detecting 2021
78 pages
Investment Course Class Topics Overview
No ratings yet
Investment Course Class Topics Overview
12 pages
Probability and Statistics Course Overview
No ratings yet
Probability and Statistics Course Overview
3 pages
Homework 5
No ratings yet
Homework 5
1 page
Commodity February 11 2005
No ratings yet
Commodity February 11 2005
57 pages
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
No ratings yet
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
1 page
A System To Filter Unwanted Messages From The OSN User Walls
No ratings yet
A System To Filter Unwanted Messages From The OSN User Walls
22 pages
Karnataka Registration Act 2025 Updates
No ratings yet
Karnataka Registration Act 2025 Updates
7 pages
Predictive Big Data Analytics For Supply Chain Demand Forecasting: Methods, Applications, and Research Opportunities
No ratings yet
Predictive Big Data Analytics For Supply Chain Demand Forecasting: Methods, Applications, and Research Opportunities
22 pages
Computer Hardware Components Explained
No ratings yet
Computer Hardware Components Explained
33 pages
Internship Report
No ratings yet
Internship Report
74 pages
DevOps Culture and Practices Guide
No ratings yet
DevOps Culture and Practices Guide
57 pages
5000 Moisture Analyzer: With Multi-Point Analysis
No ratings yet
5000 Moisture Analyzer: With Multi-Point Analysis
110 pages
Pinterest Study Case
No ratings yet
Pinterest Study Case
1 page
Faceplate Configuration Guide
No ratings yet
Faceplate Configuration Guide
36 pages
MCR3U 1.2 Domain and Range Filled
No ratings yet
MCR3U 1.2 Domain and Range Filled
11 pages
IoT Heart Monitoring for ECE Students
No ratings yet
IoT Heart Monitoring for ECE Students
4 pages
Experiment No. - 1: Design PHP Based Web Pages Using Correct PHP, CSS, and XHTML Syntax, Structure
100% (1)
Experiment No. - 1: Design PHP Based Web Pages Using Correct PHP, CSS, and XHTML Syntax, Structure
6 pages
Ai Code Explainer
No ratings yet
Ai Code Explainer
12 pages
MTP1 May2022 - Paper 7 EISSM
No ratings yet
MTP1 May2022 - Paper 7 EISSM
19 pages
Smart Locks Installation and Support Walnut in Creek
No ratings yet
Smart Locks Installation and Support Walnut in Creek
3 pages
3d-Aware Conditional Image Synthesis: Kangle Deng Gengshan Yang Deva Ramanan Jun-Yan Zhu Carnegie Mellon University
No ratings yet
3d-Aware Conditional Image Synthesis: Kangle Deng Gengshan Yang Deva Ramanan Jun-Yan Zhu Carnegie Mellon University
15 pages
OV 2500 NMS-E 4.8R2 Installation and Upgrade Guide - RevA
No ratings yet
OV 2500 NMS-E 4.8R2 Installation and Upgrade Guide - RevA
261 pages
Welding Comment Resolution Sheet
No ratings yet
Welding Comment Resolution Sheet
4 pages
Y12 End of Year Computer Science Revision List
No ratings yet
Y12 End of Year Computer Science Revision List
4 pages
Manual Unity Pro PDF
0% (2)
Manual Unity Pro PDF
612 pages
Assignment 3 Sol
No ratings yet
Assignment 3 Sol
8 pages
(PDF) Fundamentals of Network Analysis and Synthesis
No ratings yet
(PDF) Fundamentals of Network Analysis and Synthesis
3 pages
How To Virtualise Your Fuji SP3000 With VMWare ESXi - Jack Chauvel Photography
No ratings yet
How To Virtualise Your Fuji SP3000 With VMWare ESXi - Jack Chauvel Photography
23 pages
Lab File: Creative Problem Solving
No ratings yet
Lab File: Creative Problem Solving
25 pages
Cinderella: and Other Stories
No ratings yet
Cinderella: and Other Stories
16 pages
2019 - Book - Data Analytics and Learning
100% (1)
2019 - Book - Data Analytics and Learning
450 pages
Loox5 LED Strip Light Specifications
No ratings yet
Loox5 LED Strip Light Specifications
8 pages
AWS Cloud Practitioner Quiz
No ratings yet
AWS Cloud Practitioner Quiz
1 page
Convergence of Recursive Sequence to 2
No ratings yet
Convergence of Recursive Sequence to 2
2 pages
Iso 9244
No ratings yet
Iso 9244
86 pages

l1 Ext Slides

Uploaded by

l1 Ext Slides

Uploaded by

ℓ1-norm Methods for

EE364b, Stanford University

• convex-cardinality problem: minimize kx̂ − xcork2 subject to

• heuristic: minimize kx̂ − xcork2 + γkDxk1; vary γ to adjust number of

• kDxk1 is total variation of signal x̂

• method is called total variation reconstruction

• unlike ℓ2 based reconstruction, TVR filters high frequency noise out

EE364b, Stanford University 1

EE364b, Stanford University 2

EE364b, Stanford University 3

EE364b, Stanford University 4

• assumption: x has relatively few big changes in value (i.e., boundaries)

• we have m = 120 linear measurements, y = F x (Fij ∼ N (0, 1))

minimize card(xi,j − xi+1,j ) + card(xi,j − xi,j+1)

• ℓ1 heuristic (objective is a 2D version of total variation)

EE364b, Stanford University 5

. . . not bad for 8× more variables than measurements!

EE364b, Stanford University 6

. . . this is what you’d expect with 8× more variables than measurements

EE364b, Stanford University 7

• to minimize card(x) over x ∈ C

• first iteration is basic ℓ1 heuristic

EE364b, Stanford University 8

• we’ll use approximation card(z) ≈ log(1 + z/ǫ), where ǫ > 0, z ∈ R+

• using this approximation, we get (nonconvex) problem

• we’ll find a local solution by linearizing objective at current point,

EE364b, Stanford University 9

with wi = 1/(ǫ + xi), to get next iterate

• repeat until convergence to get a local solution

EE364b, Stanford University 10

EE364b, Stanford University 11

y(t + 2) = a(t)y(t + 1) + b(t)y(t) + v(t), v(t) IID N (0, 0.52)

• assumption: a(t) and b(t) are piecewise constant, change infrequently

• vary γ to trade off fit versus number of changes in a, b

EE364b, Stanford University 12

EE364b, Stanford University 13

left: TV with γ = 10; right: iterated TV, 5 iterations, ǫ = 0.005

EE364b, Stanford University 14

• Rank is natural analog of card for matrices

• convex-rank problem: convex, except for Rank in objective or

• rank problem reduces to card problem when matrices are diagonal:

• for X  0, reduces to Tr X (for x  0, kxk1 reduces to 1T x)

EE364b, Stanford University 15

x = F z + v, v ∼ N (0, D), z ∼ N (0, I)

• model with fewest factors:

EE364b, Stanford University 16

C = {Σ̂ | − log det(Σ−1/2Σ̂Σ−1/2) + Tr(Σ−1/2Σ̂Σ−1/2) − n ≤ ǫ}

with variables d ∈ Rn, X ∈ Sn

EE364b, Stanford University 17

• Σ is empirical covariance matrix from N = 3000 samples

• set of acceptable approximations

C = {Σ̂ | kΣ−1/2(Σ̂ − Σ)Σ−1/2k ≤ β}

EE364b, Stanford University 18

EE364b, Stanford University 19

EE364b, Stanford University 20

You might also like

• for X 0, reduces to Tr X (for x 0, kxk1 reduces to 1T x)