0% found this document useful (0 votes)

22 views17 pages

Regression Models

The document covers Bayesian regression modeling using the brms/stan packages in R, detailing parameter estimation, generating posterior predictions, and logistic regression models. It includes examples of implementing Bayesian models, defining priors, and analyzing posterior estimates. The content emphasizes the importance of model assumptions and the interpretation of results in the context of hypothesis testing.

Uploaded by

choudhury.devesh5905

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views17 pages

Regression Models

Uploaded by

choudhury.devesh5905

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CGS698C, Lectures 11-14: Bayesian regression modeling

Himanshu Yadav
2024-03-07

Contents

1 Parameter estimation using brms/stan 2

1.1 Implementation 2
1.2 Generating posterior predictions 5
2 Bayesian regression models 8
2.1 Inferences based on posterior estimates 10
2.2 Prior sensitivity analysis 12
2.3 Another way to write the regression equation 14
2.4 The lognormal likelihood 14
3 Logistic regression models 16
3.1 Implementation 17

See chapters 3 and 4 of the book “An Introduction to Bayesian Data Analysis for Cognitive Science
([Link] for reference.
bayesian models & data analysis 2

1 Parameter estimation using brms/stan

• There are packages in R and Python that can estimate model parameters using one of the posterior
simulation algorithms introduced in the previous lectures.

• You only need to define your likelihood and priors in a given syntax; the algorithm in the back-
ground will start drawing samples from the posterior.

• A popular one is Rstan/pystan/brms package, which uses a Hamiltonian Monte Carlo algorithm
for sampling.

1.1 Implementation
Example. A normal model with unknown mean and unknown variance
Suppose you are given 100 independent and identically distributed data points that are assumed to
come from a Normal distribution with mean µ and standard deviation σ. Let yi be the ith data point,
Likelihood:
yi ∼ Normal (µ, σ )
Priors:
µ ∼ Normal (50, 10)
σ ∼ Normal+ (0, 5)

# Assuming, true parameter values, mu=60, sigma=3

# Observed (fake) data
y <- rnorm(100,60,3)
hist(y)

Histogram of y
Frequency

20
10
0

55 60 65 70

# Estimate the parameters mu and sigma

# Prepare the data for brms; it needs a dataframe

bayesian models & data analysis 3

dat <- [Link](y=y)

head(dat)

## y
## 1 59.62189
## 2 57.69213
## 3 56.86780
## 4 64.54228
## 5 58.42823
## 6 60.25551

# Define priors
priors <- c(prior(normal(50, 10), class = Intercept),
prior(normal(0, 5), class = sigma))

# Fit the model (estimate parameters)

mfit <-
brm(formula = y ~ 1,
data=dat,
family = gaussian(),
prior = priors,
chains = 4,cores = 4,
iter = 2000,warmup = 1000)

save(mfit,file="FittedModels/[Link]")

# You can use pickle in python for saving large model fits

summary(mfit)

## Family: gaussian
## Links: mu = identity; sigma = identity
## Formula: y ~ 1
## Data: dat (Number of observations: 100)
## Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
## total post-warmup draws = 4000
##
## Population-Level Effects:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## Intercept 59.43 0.35 58.75 60.12 1.00 3490 2713
##
## Family Specific Parameters:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## sigma 3.49 0.25 3.05 4.03 1.00 3281 2790
##
## Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS
bayesian models & data analysis 4

## and Tail_ESS are effective sample size measures, and Rhat is the potential
## scale reduction factor on split chains (at convergence, Rhat = 1).

A summary statistic of parameter estimates:

1. There is 95% certainty (0.95 probability) that the true value of parameter mu lies between 58.75
and 60.12

2. The true value of sigma lies between 3.05 and 4.03 with 95% certainty

mcmc_trace(mfit)

b_Intercept sigma lp__

−270

4.0 −272
60 Chain
1
−274
2
3.5
59 3
−276 4
3.0
−278
58
0 200 400 600 8001000 0 200 400 600 8001000 0 200 400 600 8001000

mcmc_hist_by_chain(mfit,pars = c("b_Intercept","sigma"))

## Warning: The ‘facets‘ argument of ‘facet_grid()‘ is deprecated as of ggplot2 2.2.0.

## i Please use the ‘rows‘ argument instead.
## i The deprecated feature was likely used in the bayesplot package.
## Please report the issue at <[Link]
## This warning is displayed once every 8 hours.
## Call ‘lifecycle::last_lifecycle_warnings()‘ to see where this warning was
## generated.

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

bayesian models & data analysis 5

b_Intercept sigma

1
2
3
4
58 59 60 3.0 3.5 4.0

mcmc_hist(mfit,pars = c("b_Intercept","sigma"))

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

b_Intercept sigma

58 59 60 3.0 3.5 4.0

1.2 Generating posterior predictions

# You can directly use the function from bayesplot package
pp_check(mfit,ndraws = 100, type = "dens_overlay")
bayesian models & data analysis 6

y
y rep

45 50 55 60 65 70 75

Figure description: Here, each density curve represents the distribution of data: the “dark blue”
curve represents the observed data used to fit the model, and the “light blue” curves represent the
posterior predictive data, which is predicted by the model after the model was fitted to observed
data.

# You can generate posterior predictions manually

# First, extract posterior samples from the model fit

post_samples <- posterior_samples(mfit)

## Warning: Method ’posterior_samples’ is deprecated. Please see ?as_draws for

## recommended alternatives.

# Second, create a dataframe containing posterior samples

ysim <- [Link](matrix(nrow=4000*length(y),ncol=4))
colnames(ysim) <- c("sample_id","mu","sigma","ypred")
ysim$sample_id <- rep(1:4000,each=length(y))
ysim$mu <- rep(post_samples$b_Intercept,each=length(y))
ysim$sigma <- rep(post_samples$sigma,each=length(y))

# Third, generate data from the model

# conditional on each set of parameter values
for(k in 1:100){
iter_range <- (length(y)*(k-1) + 1):(length(y)*k)
ysim$ypred[iter_range] <-
rnorm(length(y),post_samples$b_Intercept[k],post_samples$sigma[k])
}

dat$sample_id <- 0
ggplot(subset(ysim,sample_id<100),aes(x=ypred,group=sample_id))+
bayesian models & data analysis 7

geom_density(alpha=0.05,color="lightblue")+
geom_density(data=dat,aes(x=y),color="black",size=1)

## Warning: Using ‘size‘ aesthetic for lines was deprecated in ggplot2 3.4.0.
## i Please use ‘linewidth‘ instead.
## This warning is displayed once every 8 hours.
## Call ‘lifecycle::last_lifecycle_warnings()‘ to see where this warning was
## generated.

0.15
density

0.10

0.05

0.00
45 50 55 60 65 70
ypred

# Summary statistics of your posterior predictions

ysim.m <- ysim %>% group_by(sample_id) %>%
summarise(mean_pred=mean(ypred),
var_pred=var(ypred))

# Distribution of predicted means

hist(ysim.m$mean_pred)
bayesian models & data analysis 8

Histogram of ysim.m$mean_pred
10 20 30
Frequency

58.0 58.5 59.0 59.5 60.0 60.5 61.0

ysim.m$mean_pred

# Distribution of predicted variance

hist(ysim.m$var_pred)

Histogram of ysim.m$var_pred
Frequency

20
10
0

10 15 20

ysim.m$var_pred

2 Bayesian regression models

Suppose, in an experiment, you are asked to identify a triangle among many other shapes shown on
a screen. Your response time is being recorded.
The experimenter hypothesizes that the background color of the screen (“black” or “blue”) affects
your response time.
We do not have any further information about how exactly the background color affects the re-
sponse time.
We can assume a linear relationship between the background color and the response time. Such
that, the mean response time changes as a linear function of the background color
bayesian models & data analysis 9

µrt = α + βX (1)
where murt is the mean response time, X is the background color and can take values 0 (for “black”)
or 1 (for “blue”); al pha is the intercept of the straight line, and β is the slope of the straight line.

The slope β represents the extent to which the background color affects the mean response time,
and the intercept α represents the mean response time when X = 0 i.e., when the background is
black.
Now, suppose you collect n repeated observations in your experiment such that around half of
them have black background and other half have blue background. The mean response time in a trial
i would depend on the background color of the stimuli.

µi = α + βXi (2)
If you assume that the response times are normally distributed, you can write

rti ∼ Normal (mui , σ ) (3)

In other words,

rti ∼ Normal (α + βXi , σ ) (4)

The above equation represents a linear regression model, where rti is the dependent variable, Xi is
an independent (or predictor) variable, and α, β, σ are the parameters of the model.
You can set priors on α, β, and σ.

α ∼ Normal (300, 50)

β ∼ Normal (0, 20)

σ ∼ Normal+ (0, 10)

bayesian models & data analysis 10

You can estimate the parameters α, β, and σ using brms; we are primarily interested in the esti-
mates of β because we want to test the experimenter’s hypothesis that said β ̸= 0.

2.1 Inferences based on posterior estimates

# Data
alpha <- 250
beta <- 20
sigma <- 10
X <- rep(0:1,50)
rt <- rep(NA,100)
dat <- [Link](X=X,rt=rt)
for(i in 1:nrow(dat)){
dat$rt[i] <- rnorm(1,alpha + beta*X[i],sigma)
}
head(dat)

## X rt
## 1 0 247.5418
## 2 1 264.3281
## 3 0 261.5266
## 4 1 279.1902
## 5 0 271.1362
## 6 1 263.2989

# Define priors
priors <- c(prior(normal(300, 50), class = Intercept),
prior(normal(0, 20), class = b, coef=X),
prior(normal(0, 10), class = sigma))

# Fit the model (estimate parameters)

mfit <-
brm(formula = rt ~ 1+X,
data=dat,
family = gaussian(),
prior = priors,
chains = 4,cores = 4,
iter = 2000,warmup = 1000)

save(mfit,file="FittedModels/[Link]")

summary(mfit)

## Family: gaussian
## Links: mu = identity; sigma = identity
## Formula: rt ~ 1 + X
bayesian models & data analysis 11

## Data: dat (Number of observations: 100)

## Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
## total post-warmup draws = 4000
##
## Population-Level Effects:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## Intercept 251.48 1.58 248.46 254.64 1.00 4074 3082
## X 20.39 2.21 16.00 24.64 1.00 3941 3125
##
## Family Specific Parameters:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## sigma 11.18 0.81 9.74 12.89 1.00 3564 2680
##
## Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS
## and Tail_ESS are effective sample size measures, and Rhat is the potential
## scale reduction factor on split chains (at convergence, Rhat = 1).

mcmc_trace(mfit,pars = c("b_Intercept","b_X","sigma"))

b_Intercept b_X sigma

255 25
Chain
12 1
252 20 2
3
10
4
249
15

0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000

mcmc_hist(mfit,pars = c("b_Intercept","b_X"))

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

bayesian models & data analysis 12

b_Intercept b_X

246 249 252 255 15 20 25

Based on the above posterior estimates, you can say the following:

1. The data are consistent with the experimenter’s hypothesis that background color affects the re-
sponse times because the 95% credible interval for beta is completely in the positive direction and
does not cross zero.

2. The data suggest that β > 0, i.e., the mean response time is higher when the background color is
blue.

However, you cannot say that there is evidence for the experimenter’s hypothesis. Because the
evidence for any model assumption is always computed with respect to a baseline model assumption.
No model is absolutely correct; a model can be relatively better than the other.
All you can say given the above results is that the data are consistent with what the experimenter
predicted.

2.2 Prior sensitivity analysis

What if another experimenter challenges your results saying that your prior assumptions are unrea-
sonable for the β parameter?
You can illustrate how the posterior estimate of β changes under different prior assumptions.
Let us choose 7 different priors on β:

1. β ∼ Normal (0, 2)

2. β ∼ Normal (0, 5)

3. β ∼ Normal (0, 10)

4. β ∼ Normal (0, 15)

5. β ∼ Normal (0, 20)

6. β ∼ Normal (0, 25)

bayesian models & data analysis 13

7. β ∼ Normal (0, 30)

# Define priors
priors <- c(prior(normal(300, 50), class = Intercept),
prior(normal(0, 20), class = b, coef=X),
prior(normal(0, 10), class = sigma))

prior_sd <- c(2,5,10,15,20,25,30)

[Link] <- [Link](matrix(nrow=0,ncol=4))
colnames([Link]) <- c("prior","[Link]","[Link]","[Link]")

# Fit the model (estimate parameters)

for(p in prior_sd){
priors[2,] <- set_prior(paste("normal(0, ",p,")",sep=""), class = "b", coef="X")
mfit <-
brm(formula = rt ~ 1+X,
data=dat,
family = gaussian(),
prior = priors,
chains = 4,cores = 4,
iter = 2000,warmup = 1000)

save(mfit,file=paste("FittedModels/Simple-linear-regression-prior-sd-",[Link](p),".Rda",sep=""))
post_samples <- posterior_samples(mfit)
[Link][nrow([Link])+1,] <-
c(paste("normal(0, ",p,")",sep=""),
mean(post_samples$b_X),
unname(quantile(post_samples$b_X,probs=c(.025,.975))))
}

save([Link],file="FittedModels/[Link]")

load("FittedModels/[Link]")
[Link]

## prior [Link] [Link] [Link]

## 8 normal(0, 2) 7.80440654632153 4.35792141388259 11.1519352077392
## 9 normal(0, 5) 17.1524443112993 12.9185487822437 21.2308021883041
## 10 normal(0, 10) 19.6630358513428 15.2200820790493 23.9755812727125
## 11 normal(0, 15) 20.2361781846795 15.8609839600651 24.4945166632383
## 12 normal(0, 20) 20.3728699565605 15.7195366832652 24.9504187637311
## 13 normal(0, 25) 20.4577454028757 16.0581165460414 24.8583639200569
## 14 normal(0, 30) 20.5156150048918 16.1419497687793 24.8608227487119

The posterior estimates are stable arcoss priors Normal (0, 10), Normal (0, 15), Normal (0, 20),
bayesian models & data analysis 14

Normal (0, 25), and Normal (0, 30).

2.3 Another way to write the regression equation

The regression model rti ∼ Normal (α + βXi , σ ) can be rewritten as

rti = α + βXi + ϵi where ϵi ∼ Normal (0, σ)

2.4 The lognormal likelihood

We have observed previously that the typical response times are lognormally distributed.
We can revise our regression model as

rti ∼ Lognormal (α + βXi , σ )

log rti = α + βXi + ϵi where ϵi ∼ Normal (0, σ )

An important thing to note is that the parameters in the above model are in the log space. Thus,
we need to update our priors accordingly,

α ∼ Normal (6, 2)

β ∼ Normal (0, 3)

σ ∼ Normal+ (0, 2)

# Define priors
priors <- c(prior(normal(6, 2), class = Intercept),
bayesian models & data analysis 15

prior(normal(0, 3), class = b, coef=X),

prior(normal(0, 2), class = sigma))

# Fit the model (estimate parameters)

mfit <-
brm(formula = rt ~ 1+X,
data=dat,
family = lognormal(),
prior = priors,
chains = 4,cores = 4,
iter = 2000,warmup = 1000)

save(mfit,file="FittedModels/[Link]")

summary(mfit)

## Family: lognormal
## Links: mu = identity; sigma = identity
## Formula: rt ~ 1 + X
## Data: dat (Number of observations: 100)
## Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
## total post-warmup draws = 4000
##
## Population-Level Effects:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## Intercept 5.53 0.01 5.51 5.54 1.00 3600 3006
## X 0.08 0.01 0.06 0.10 1.00 3274 2570
##
## Family Specific Parameters:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## sigma 0.04 0.00 0.04 0.05 1.00 2695 2306
##
## Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS
## and Tail_ESS are effective sample size measures, and Rhat is the potential
## scale reduction factor on split chains (at convergence, Rhat = 1).

What is the effect size?

The above posterior estimates tell you effect sizes on log milliseconds scale, but we typically need
them on raw milliseconds scale.
You can backtransform the effect of predictor X on the milliseconds scale.
The effect size on milliseconds scale is basically the difference in predicted response times when
X = 1 compared to X = 0
Effect = rt predX=1 − rt predX=0
We know,
log rt predi = α + βXi
bayesian models & data analysis 16

when Xi = 0
log rt pred = α
hence,
rt predX=0 = eα
when Xi = 1
log rt pred = α + β
rt predX=1 = eα+ β
The effect size on milliseconds scale: rt predX=1 − rt predX=0 = eα+ β − eα .

post_samples <- posterior_samples(mfit)

## Warning: Method ’posterior_samples’ is deprecated. Please see ?as_draws for

## recommended alternatives.

beta_raw <- exp(post_samples$b_Intercept + post_samples$b_X)-exp(post_samples$b_Intercept)

c(mean=mean(beta_raw),quantile(beta_raw,probs = c(.025,.975)))

## mean 2.5% 97.5%

## 20.71092 16.39950 25.16856

3 Logistic regression models

Consider another dependent variable in the same experiment: the response accuracy. If the partic-
ipant was able to correctly identify the target shape, the response is recorded as 1 otherwise 0. The
experimenter hypothesizes that the responses are on average more accurate when the background
color is black.
Suppose, correcti is the vector of responses containing 1 for correct responses and 0 for incorrect
ones.
The resoonses can be assumed to be generated by a Bernoulli distribution.

correcti ∼ Bernoulli (θi )

where θi represents the average probability of producing a correct response in trial i. The linear
regression is defined for the log-odds of parameter θi

logit θi = α + βXi
or,

θi
log = α + βXi
1 − θi
You can directly write,

correcti ∼ Bernoulli (inverse-logit (α + βXi ))

The above model is called a logistic regression model.
bayesian models & data analysis 17

The parameters α and β are in the log-odds (logit) space, we need to define the priors accordingly

α ∼ Normal (0, 1.5)

β ∼ Normal (0, 0.5)

3.1 Implementation
dat$correct <- NA
for(i in 1:nrow(dat)){
dat$correct[i] <- rbinom(1,size=1, prob = plogis(1.5 - 0.5 * dat$X[i]))
}

# Define priors
priors <- c(prior(normal(0, 1.5), class = Intercept),
prior(normal(0, 0.5), class = b, coef=X))

# Fit the model (estimate parameters)

mfit <-
brm(formula = correct ~ 1+X,
data=dat,
family = bernoulli(link = logit),
prior = priors,
chains = 4,cores = 4,
iter = 2000,warmup = 1000)

save(mfit,file="FittedModels/[Link]")

summary(mfit)

## Family: bernoulli
## Links: mu = logit
## Formula: correct ~ 1 + X
## Data: dat (Number of observations: 100)
## Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
## total post-warmup draws = 4000
##
## Population-Level Effects:
## Estimate [Link] l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
## Intercept 1.68 0.33 1.06 2.37 1.00 2961 2567
## X 0.08 0.36 -0.65 0.79 1.00 3254 2518
##
## Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS
## and Tail_ESS are effective sample size measures, and Rhat is the potential
## scale reduction factor on split chains (at convergence, Rhat = 1).

Model Comparison
No ratings yet
Model Comparison
22 pages
Bayesian Power Analysis
No ratings yet
Bayesian Power Analysis
7 pages
L31 Bayesian Logistic Regression PDF
No ratings yet
L31 Bayesian Logistic Regression PDF
8 pages
ML Assignment 1
No ratings yet
ML Assignment 1
7 pages
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
No ratings yet
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
10 pages
BayesianThinking Day1 Albert WORKSHOP Ppts PDF
No ratings yet
BayesianThinking Day1 Albert WORKSHOP Ppts PDF
188 pages
Lecture 9: Predictive Inference
No ratings yet
Lecture 9: Predictive Inference
10 pages
Bayesian Linear Regression-II
No ratings yet
Bayesian Linear Regression-II
12 pages
Bayesian Analysis Module 2
No ratings yet
Bayesian Analysis Module 2
45 pages
Course Notes
No ratings yet
Course Notes
141 pages
Bayesian Linear Regression - GeeksforGeeks
No ratings yet
Bayesian Linear Regression - GeeksforGeeks
15 pages
Course Notes
No ratings yet
Course Notes
141 pages
Linear Regression Lecture Notes
100% (2)
Linear Regression Lecture Notes
228 pages
1
No ratings yet
1
130 pages
5
No ratings yet
5
2 pages
Making Models With Bayes
No ratings yet
Making Models With Bayes
51 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Confidence and Credential Intervals Explained
No ratings yet
Confidence and Credential Intervals Explained
15 pages
Bayesian Statistical Methods
100% (11)
Bayesian Statistical Methods
288 pages
Bayesian Estimation vs. t-Test
No ratings yet
Bayesian Estimation vs. t-Test
15 pages
LectureNotes22 WI4455
No ratings yet
LectureNotes22 WI4455
154 pages
Homework 8
100% (1)
Homework 8
6 pages
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
No ratings yet
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
3 pages
Gaussian Process Tutorial by Andrew NG
No ratings yet
Gaussian Process Tutorial by Andrew NG
13 pages
MIT 401 - Tutorial 02
No ratings yet
MIT 401 - Tutorial 02
7 pages
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
No ratings yet
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
288 pages
R Commands
No ratings yet
R Commands
5 pages
Murphy's Machine Learning Solutions Manual
No ratings yet
Murphy's Machine Learning Solutions Manual
100 pages
6A. Econometrics Review
No ratings yet
6A. Econometrics Review
8 pages
6 R Session: Multivariate Extremes and Bayesian Inference
No ratings yet
6 R Session: Multivariate Extremes and Bayesian Inference
7 pages
Implementation
No ratings yet
Implementation
22 pages
Bayesian Estimation Package for R
No ratings yet
Bayesian Estimation Package for R
26 pages
Bayes
No ratings yet
Bayes
825 pages
Regression GL M
No ratings yet
Regression GL M
315 pages
Data Science
No ratings yet
Data Science
15 pages
Merge
No ratings yet
Merge
240 pages
Data Analysis with Python
No ratings yet
Data Analysis with Python
38 pages
Bayes PDF
No ratings yet
Bayes PDF
634 pages
Dataanalyticsunit 2
No ratings yet
Dataanalyticsunit 2
24 pages
UnivariateRegression Summary
No ratings yet
UnivariateRegression Summary
36 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
No ratings yet
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
78 pages
Bayesian Linreg
No ratings yet
Bayesian Linreg
36 pages
Computer Intensive Methods in Statistics
No ratings yet
Computer Intensive Methods in Statistics
227 pages
Bayesian Inference Fundamentals
No ratings yet
Bayesian Inference Fundamentals
195 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
Essential Statistical Tests for Engineers
No ratings yet
Essential Statistical Tests for Engineers
51 pages
Econometrics for Advanced Learners
No ratings yet
Econometrics for Advanced Learners
129 pages
Lab 8: Introduction To Winbugs: Goals
No ratings yet
Lab 8: Introduction To Winbugs: Goals
8 pages
Statistics Notes Based On Pattern Recognition and Machine Learning (PRML)
No ratings yet
Statistics Notes Based On Pattern Recognition and Machine Learning (PRML)
5 pages
Statistics For Applied Science 200l
No ratings yet
Statistics For Applied Science 200l
122 pages
Exam P Formula Sheet
No ratings yet
Exam P Formula Sheet
5 pages
Engineering Data Analysis 2 Tables
No ratings yet
Engineering Data Analysis 2 Tables
6 pages
AMOS Software FAQs and Solutions
No ratings yet
AMOS Software FAQs and Solutions
41 pages
Data Analytics (Da) by I Tech World
No ratings yet
Data Analytics (Da) by I Tech World
65 pages
Variance and Standard Deviation Ungrouped Data ROSARIO R. GILLADOGA
No ratings yet
Variance and Standard Deviation Ungrouped Data ROSARIO R. GILLADOGA
6 pages
Statistical Hypothesis Testing in Context Reproducibility, Inference, and Science (Michael P. Fay, Erica H. Brittain) (Z-Library)
No ratings yet
Statistical Hypothesis Testing in Context Reproducibility, Inference, and Science (Michael P. Fay, Erica H. Brittain) (Z-Library)
449 pages
Unit 3 Descriptive Statistics
No ratings yet
Unit 3 Descriptive Statistics
4 pages
CA35P Business Data Analytics
No ratings yet
CA35P Business Data Analytics
4 pages
SPSS ICC Guide for Researchers
No ratings yet
SPSS ICC Guide for Researchers
5 pages
Topic 14 Length of Confidence Interval and Appropriate Sample Size PDF
No ratings yet
Topic 14 Length of Confidence Interval and Appropriate Sample Size PDF
7 pages
Ug Stat Pract Manual
100% (1)
Ug Stat Pract Manual
108 pages
RMCT Course Outline
No ratings yet
RMCT Course Outline
2 pages
Lecture 7
No ratings yet
Lecture 7
6 pages
Garch Model
No ratings yet
Garch Model
4 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
53 pages
Comparing Means and Proportions
No ratings yet
Comparing Means and Proportions
17 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Effectiveness of Orange Citrus Sinensis Fruit Peels
No ratings yet
Effectiveness of Orange Citrus Sinensis Fruit Peels
6 pages
Introduction to Business Statistics
No ratings yet
Introduction to Business Statistics
175 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Anova
No ratings yet
Anova
3 pages
Logistic Regression Guide for SPSS
100% (4)
Logistic Regression Guide for SPSS
65 pages
Rainfall Prediction Using Modified Linear Regression: Submitted By: John Philip O. Echevarria Lazan, Rolan
No ratings yet
Rainfall Prediction Using Modified Linear Regression: Submitted By: John Philip O. Echevarria Lazan, Rolan
10 pages
Understanding P-Value and Confidence Intervals
No ratings yet
Understanding P-Value and Confidence Intervals
9 pages
Done Problems Encountered in The Implementation of Juvenile Justice and Welfare Act of 2006 Ra 9344
No ratings yet
Done Problems Encountered in The Implementation of Juvenile Justice and Welfare Act of 2006 Ra 9344
11 pages
Multiple Correlation
No ratings yet
Multiple Correlation
2 pages
Amazon Data Analyst Interview Questions - 1
No ratings yet
Amazon Data Analyst Interview Questions - 1
22 pages
Big Grocery Store Location Analysis
No ratings yet
Big Grocery Store Location Analysis
4 pages
Logistic Regression for Classification
No ratings yet
Logistic Regression for Classification
32 pages
Hypothesis 2
No ratings yet
Hypothesis 2
26 pages

Regression Models

Uploaded by

Regression Models

Uploaded by

CGS698C, Lectures 11-14: Bayesian regression modeling

1 Parameter estimation using brms/stan 2

1 Parameter estimation using brms/stan

# Assuming, true parameter values, mu=60, sigma=3

# Estimate the parameters mu and sigma

# Prepare the data for brms; it needs a dataframe

dat <- [Link](y=y)

# Fit the model (estimate parameters)

A summary statistic of parameter estimates:

b_Intercept sigma lp__

## Warning: The ‘facets‘ argument of ‘facet_grid()‘ is deprecated as of ggplot2 2.2.0.

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

58 59 60 3.0 3.5 4.0

1.2 Generating posterior predictions

# You can generate posterior predictions manually

# First, extract posterior samples from the model fit

## Warning: Method ’posterior_samples’ is deprecated. Please see ?as_draws for

# Second, create a dataframe containing posterior samples

# Third, generate data from the model

# Summary statistics of your posterior predictions

# Distribution of predicted means

58.0 58.5 59.0 59.5 60.0 60.5 61.0

# Distribution of predicted variance

2 Bayesian regression models

rti ∼ Normal (mui , σ ) (3)

rti ∼ Normal (α + βXi , σ ) (4)

α ∼ Normal (300, 50)

β ∼ Normal (0, 20)

σ ∼ Normal+ (0, 10)

2.1 Inferences based on posterior estimates

# Fit the model (estimate parameters)

## Data: dat (Number of observations: 100)

b_Intercept b_X sigma

## ‘stat_bin()‘ using ‘bins = 30‘. Pick better value with ‘binwidth‘.

246 249 252 255 15 20 25

2.2 Prior sensitivity analysis

3. β ∼ Normal (0, 10)

4. β ∼ Normal (0, 15)

5. β ∼ Normal (0, 20)

6. β ∼ Normal (0, 25)

7. β ∼ Normal (0, 30)

prior_sd <- c(2,5,10,15,20,25,30)

# Fit the model (estimate parameters)

## prior [Link] [Link] [Link]

Normal (0, 25), and Normal (0, 30).

2.3 Another way to write the regression equation

rti = α + βXi + ϵi where ϵi ∼ Normal (0, σ)

2.4 The lognormal likelihood

rti ∼ Lognormal (α + βXi , σ )

log rti = α + βXi + ϵi where ϵi ∼ Normal (0, σ )

prior(normal(0, 3), class = b, coef=X),

# Fit the model (estimate parameters)

What is the effect size?

post_samples <- posterior_samples(mfit)

## Warning: Method ’posterior_samples’ is deprecated. Please see ?as_draws for

beta_raw <- exp(post_samples$b_Intercept + post_samples$b_X)-exp(post_samples$b_Intercept)

## mean 2.5% 97.5%

3 Logistic regression models

correcti ∼ Bernoulli (θi )

correcti ∼ Bernoulli (inverse-logit (α + βXi ))

α ∼ Normal (0, 1.5)

β ∼ Normal (0, 0.5)

# Fit the model (estimate parameters)

You might also like