0% found this document useful (0 votes)

31 views44 pages

(Seminar) An Introduction To Simulation-Based Inference

The document presents an introduction to simulation-based inference, highlighting its significance in statistical analysis and the advancements made possible by deep learning. It discusses various algorithms for inference, including neural ratio estimation and diagnostics for validating results. Challenges such as the curse of dimensionality and the need for extensive data are acknowledged as areas requiring further development.

Uploaded by

Yiqiao Jin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views44 pages

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

Yiqiao Jin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

An introduction to

simulation-based inference

51st SLAC Summer Institute

August 16, 2023

Gilles Louppe
[email protected]

1 / 36
2 / 36
vx = v cos(α), vy = v sin(α),
dx dy dvy
= vx , = vy , = −G.
dt dt dt

3 / 36
def simulate(v, alpha, dt=0.001):
v_x = v * np.cos(alpha) # x velocity m/s
v_y = v * np.sin(alpha) # y velocity m/s
y = 1.1 + 0.3 * random.normal()
x = 0.0

while y > 0: # simulate until ball hits floor

v_y += dt * -G # acceleration due to gravity
x += dt * v_x
y += dt * v_y

return x + 0.25 * random.normal()

4 / 36
5 / 36
What parameter values θ are the most plausible?

6 / 36
7 / 36
Outline
1. Simulation-based inference

2. Algorithms

Neural ratio estimation

Neural posterior estimation

Neural score estimation

3. Diagnostics

8 / 36
Simulation-based inference

8 / 36
Scienti c simulators

9 / 36
θ, z, x ∼ p(θ, z, x)

10 / 36
θ, z ∼ p(θ, z∣x)

11 / 36
12 / 36
12 / 36
12 / 36
12 / 36
p(x∣θ) = ∭ p(zp ∣θ)p(zs ∣zp )p(zd ∣zs )p(x∣zd )dzp dzs dzd

yikes!

13 / 36
Bayesian inference

Start with

a simulator that can generate N samples xi ∼ p(xi ∣θi ),

a prior model p(θ),

observed data xobs ∼ p(xobs ∣θtrue ).

Then, estimate the posterior

p(xobs ∣θ)p(θ)
p(θ∣xobs ) = .
p(xobs )

14 / 36
15 / 36
Algorithms

15 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 16 / 36
Approximate Bayesian Computation (ABC)

Issues:

How to choose x′ ? ϵ? ∣∣ ⋅ ∣∣?

No tractable posterior.

Need to run new simulations for new data or new prior.

―
Credits: Johann Brehmer. 17 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 18 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 18 / 36
Neural ratio estimation
p(x∣θ) p(x,θ)
The likelihood-to-evidence r(x∣θ) = p(x) = p(x)p(θ) ratio can be learned, even
if neither the likelihood nor the evidence can be evaluated:

x, θ ∼ p(x, θ)

r^(x∣θ)

x, θ ∼ p(x)p(θ)

―
Credits: Cranmer et al, 2015; Hermans et al, 2020. 19 / 36
The solution d found after training approximates the optimal classi er

p(x, θ)
d(x, θ) ≈ d∗ (x, θ) = .
p(x, θ) + p(x)p(θ)
Therefore,

p(x∣θ) p(x, θ) d(x, θ)

r(x∣θ) = = ≈ = r^(x∣θ).
p(x) p(x)p(θ) 1 − d(x, θ)

20 / 36
p(θ∣x) ≈ r^(x∣θ)p(θ)

21 / 36
Constraining dark matter with stellar streams

.]
Interaction of Pal 5 with two …

―
Image credits: C. Bickel/Science; D. Erkal. 22 / 36
―
Credits: Hermans et al, 2021. 23 / 36
Preliminary results for GD-1 suggest a preference for CDM over WDM.

24 / 36
Neural Posterior Estimation

min Ep(x) [KL(p(θ∣x)∣∣qϕ (θ∣x))]

qϕ

25 / 36
Normalizing ows

A normalizing ow is a sequence of invertible transformations fk that map a

simple distribution p0 to a more complex distribution pK :

By the change of variables formula, the log-likelihood of a sample x is given by

K
log p(x) = log p(z0 ) − ∑ log ∣det Jfk (zk−1 )∣ .
k=1

26 / 36
Exoplanet atmosphere characterization

―
Credits: NSA/JPL-Caltech, 2010. 27 / 36
―
Credits: Vasist et al, 2023. 28 / 36
Diagnostics

28 / 36
p^(θ∣x) = sbi(p(x∣θ), p(θ), x)

We must make sure our approximate

simulation-based inference
algorithms can (at least) actually
realize faithful inferences on the
(expected) observations.

How do we know this is good enough?

29 / 36
Mode convergence

The maximum a posteriori estimate converges towards the nominal value θ∗

for an increasing number of independent and identically distributed observables
xi ∼ p(x∣θ∗ ):

lim arg max p(θ∣{xi }N

i=1 )
N →∞ θ

= lim arg max p(θ) ∏ r(xi ∣θ) = θ∗

N →∞ θ
xi

―
Credits: Brehmer et al, 2019. 30 / 36
Coverage diagnostic

For x, θ ∼ p(x, θ), compute the 1 − α

credible interval based on p^(θ∣x).

If the fraction of samples for which θ is

contained within the interval is larger than the
nominal coverage probability 1 − α, then the
approximate posterior p^(θ∣x) has coverage.

―
Credits: Hermans et al, 2021; Siddharth Mishra-Sharma, 2021. 31 / 36
―
Credits: Hermans et al, 2021. 32 / 36
What if diagnostics fail?

33 / 36
Balanced NRE
Enforce neural ratio estimation to be conservative by using binary classi ers d^
that are balanced, i.e. such that

Ep(θ,x) [ d^(θ, x)] = Ep(θ)p(x) [1 − d^(θ, x)] .

―
Credits: Delaunoy et al, 2022. 34 / 36
―
Credits: Delaunoy et al, 2022. 35 / 36
Summary
Advances in deep learning have enabled new approaches to statistical
inference.

This is major evolution in the statistical capabilities for science, as it

enables the analysis of complex models and data without simplifying
assumptions.

Inference remains approximate and requires careful validation.

Obstacles remain to be overcome, such as the curse of dimensionality and

the need for large amounts of data.

36 / 36
The end.

36 / 36

UNIT 3-Bayesian Statistics
No ratings yet
UNIT 3-Bayesian Statistics
80 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Sar 2000
No ratings yet
Sar 2000
22 pages
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
No ratings yet
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
13 pages
Basics of Bayesian Modeling and Estimation
No ratings yet
Basics of Bayesian Modeling and Estimation
21 pages
Class19 Approxinf
No ratings yet
Class19 Approxinf
45 pages
Lecture 12 Bayesian Neural Network
No ratings yet
Lecture 12 Bayesian Neural Network
46 pages
1 Inference
No ratings yet
1 Inference
9 pages
Lec 24
No ratings yet
Lec 24
39 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
17 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
16 pages
(Seminar) Simulation-Based Inference - Where Classical Statistics Meets Machine Learning
No ratings yet
(Seminar) Simulation-Based Inference - Where Classical Statistics Meets Machine Learning
55 pages
FunctionSpace Regularization in Neural NetworksA Probabilistic Perspective
No ratings yet
FunctionSpace Regularization in Neural NetworksA Probabilistic Perspective
16 pages
Structure Learning in Graphical Models
No ratings yet
Structure Learning in Graphical Models
49 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
hw2b 2017
No ratings yet
hw2b 2017
7 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Wilson2020 Part2
No ratings yet
Wilson2020 Part2
47 pages
Latent Variable Models: Stefano Ermon
No ratings yet
Latent Variable Models: Stefano Ermon
26 pages
2014 - Picchini - Inference For SDE Models Via ABC
No ratings yet
2014 - Picchini - Inference For SDE Models Via ABC
22 pages
Probabilistic Neural Networks: Original Contribution
No ratings yet
Probabilistic Neural Networks: Original Contribution
10 pages
Prof. Richardson Neuralnetworks
No ratings yet
Prof. Richardson Neuralnetworks
61 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Bayesian Estimation in State Space
No ratings yet
Bayesian Estimation in State Space
33 pages
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
Time Grad
No ratings yet
Time Grad
11 pages
Answer Key
No ratings yet
Answer Key
12 pages
Talk On Regression Based Method For Bayesian Nonparanormal Graphical Models
No ratings yet
Talk On Regression Based Method For Bayesian Nonparanormal Graphical Models
40 pages
L1-Understanding Diffusion Models A Unified Persp
No ratings yet
L1-Understanding Diffusion Models A Unified Persp
27 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
Main 2
No ratings yet
Main 2
37 pages
MCMC and Bayesian Modeling Overview
No ratings yet
MCMC and Bayesian Modeling Overview
27 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
Pattern Recognition Techniques
No ratings yet
Pattern Recognition Techniques
10 pages
KSMF
No ratings yet
KSMF
35 pages
SVM 2
No ratings yet
SVM 2
8 pages
Bayesian Optimization for ML Experts
No ratings yet
Bayesian Optimization for ML Experts
84 pages
33790-Article Text-37858-1-2-20250410
No ratings yet
33790-Article Text-37858-1-2-20250410
10 pages
ML 3
No ratings yet
ML 3
66 pages
Probabilistic Models in Supervised Learning
No ratings yet
Probabilistic Models in Supervised Learning
32 pages
Instrumental Variable Algorithms Explained
No ratings yet
Instrumental Variable Algorithms Explained
31 pages
Two-Layer Neural Networks Overview
No ratings yet
Two-Layer Neural Networks Overview
10 pages
Probabilistic Models
No ratings yet
Probabilistic Models
47 pages
CS 182 Berkeley 2021 Discussion 1
No ratings yet
CS 182 Berkeley 2021 Discussion 1
7 pages
Chapter 4 ML Parametric Classification
No ratings yet
Chapter 4 ML Parametric Classification
42 pages
Statistical Learning Theory Guide
No ratings yet
Statistical Learning Theory Guide
4 pages
Lec 37
No ratings yet
Lec 37
5 pages
Unit 2
No ratings yet
Unit 2
88 pages
Classification
No ratings yet
Classification
47 pages
DM See M4
No ratings yet
DM See M4
8 pages
Lecture 2
No ratings yet
Lecture 2
6 pages
Barthelme EP2
No ratings yet
Barthelme EP2
58 pages
Advanced Density Estimation
No ratings yet
Advanced Density Estimation
20 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
17 pages
Journal Homepage: - : Introduction
No ratings yet
Journal Homepage: - : Introduction
11 pages
Ge3361 PD Lab
No ratings yet
Ge3361 PD Lab
4 pages
Real-Time Performance Monitoring On Juniper Networks Devices
No ratings yet
Real-Time Performance Monitoring On Juniper Networks Devices
35 pages
Go Both Ways: Summit & 4th
No ratings yet
Go Both Ways: Summit & 4th
34 pages
SW Username & Password
No ratings yet
SW Username & Password
4 pages
Application Note Abstract: Implementation of MPPT Solar Charge Controller With Powerpsoc
No ratings yet
Application Note Abstract: Implementation of MPPT Solar Charge Controller With Powerpsoc
14 pages
Product PDF 4956
No ratings yet
Product PDF 4956
2 pages
Vintage Lens Guillotine Shutter Guide
No ratings yet
Vintage Lens Guillotine Shutter Guide
10 pages
In Focus Sacred Geometry Your Personal Guide Complete Chapter Download
100% (16)
In Focus Sacred Geometry Your Personal Guide Complete Chapter Download
16 pages
Skeletal System Stem Lesson - Emily Chmura
No ratings yet
Skeletal System Stem Lesson - Emily Chmura
2 pages
Olympic Flat Bench With Dumbbell Storage and Weight Set
No ratings yet
Olympic Flat Bench With Dumbbell Storage and Weight Set
48 pages
Theories of Accounting Regulation
No ratings yet
Theories of Accounting Regulation
39 pages
Hospital Pharmacy Terms & Definitions
No ratings yet
Hospital Pharmacy Terms & Definitions
7 pages
SunshineTT 6
No ratings yet
SunshineTT 6
7 pages
Complete Family Wealth 2nd Edition James E. Hughes ebook accessible full chapters
No ratings yet
Complete Family Wealth 2nd Edition James E. Hughes ebook accessible full chapters
41 pages
Novel Study Notes PT 2
No ratings yet
Novel Study Notes PT 2
5 pages
Quiz - Week 09 Quiz
No ratings yet
Quiz - Week 09 Quiz
2 pages
Impact of Extracurriculars on Grade 11 Performance
No ratings yet
Impact of Extracurriculars on Grade 11 Performance
43 pages
12 Year Curriculum Plan (Scope and Sequence)
100% (14)
12 Year Curriculum Plan (Scope and Sequence)
3 pages
Danfoss Series 90 Pump and Motor Guide
100% (1)
Danfoss Series 90 Pump and Motor Guide
34 pages
Ambedkar and Hindutva
No ratings yet
Ambedkar and Hindutva
17 pages
MLR - R and R2
No ratings yet
MLR - R and R2
17 pages
BP 102t Pharmaceutical Analysis 1 Jun 2020
No ratings yet
BP 102t Pharmaceutical Analysis 1 Jun 2020
3 pages
Cardiovascular MCQs for Med Students
100% (2)
Cardiovascular MCQs for Med Students
9 pages
Grade V Exam Syllabus & Schedule
No ratings yet
Grade V Exam Syllabus & Schedule
4 pages
TOEIC Part 5 Detailed Solutions
No ratings yet
TOEIC Part 5 Detailed Solutions
32 pages
Lesson 6 Powers of The Mind
100% (1)
Lesson 6 Powers of The Mind
36 pages
Conveyor Belt Specs for Engineers
No ratings yet
Conveyor Belt Specs for Engineers
1 page
Mathematics Alternative B
No ratings yet
Mathematics Alternative B
5 pages
Guidelines For Referencing - SACE
No ratings yet
Guidelines For Referencing - SACE
14 pages

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

An introduction to

51st SLAC Summer Institute

August 16, 2023

while y > 0: # simulate until ball hits floor

return x + 0.25 * random.normal()

Neural ratio estimation

Neural posterior estimation

Neural score estimation

a simulator that can generate N samples xi ∼ p(xi ∣θi ),

a prior model p(θ),

observed data xobs ∼ p(xobs ∣θtrue ).

Then, estimate the posterior

How to choose x′ ? ϵ? ∣∣ ⋅ ∣∣?

Need to run new simulations for new data or new prior.

p(x∣θ) p(x, θ) d(x, θ)

min Ep(x) [KL(p(θ∣x)∣∣qϕ (θ∣x))]

A normalizing ow is a sequence of invertible transformations fk that map a

By the change of variables formula, the log-likelihood of a sample x is given by

We must make sure our approximate

How do we know this is good enough?

The maximum a posteriori estimate converges towards the nominal value θ∗

lim arg max p(θ∣{xi }N

= lim arg max p(θ) ∏ r(xi ∣θ) = θ∗

For x, θ ∼ p(x, θ), compute the 1 − α

If the fraction of samples for which θ is

Ep(θ,x) [ d^(θ, x)] = Ep(θ)p(x) [1 − d^(θ, x)] .

This is major evolution in the statistical capabilities for science, as it

Inference remains approximate and requires careful validation.

Obstacles remain to be overcome, such as the curse of dimensionality and

You might also like