0% found this document useful (0 votes)

34 views7 pages

Bayesian Power Analysis

This document describes a planned Bayesian analysis to test the hypothesis that the proportion of positive results in sport and exercise science research articles is greater than 80%. The analysis will use a beta prior distribution centered at 85% based on previous research. A simulation is described that replicates the analysis 1000 times to estimate the power. The simulation finds the analysis would yield evidence for the hypothesis in 86.7% of simulations and 41.6% of credible intervals had a lower bound greater than 80%. The conclusion is the planned study would adequately test the hypothesis.

Uploaded by

maria.sans.fuentes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views7 pages

Bayesian Power Analysis

Uploaded by

maria.sans.fuentes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Bayesian Sample Size Justification

Aaron R. Caldwell

01 September, 2020

Introduction
In this study we plan on collecting data on 300 sport and exercise science research articles (100 from 3
journals). Based on the work of Büttner et al. (2020), we anticipate at least 150 (50%) of the articles will
include a hypothesis that was tested. Further, the work of Fanelli (2010), Scheel, Schijen, and Lakens (2020),
and Büttner et al. (2020) we believe that the percentage of articles that find support for their hypothesis
is greater than 80%. Given this existing data, we believe we have an informative prior on the underlying
distribution of postive results, and have opted for a Bayesian analysis of this primary endpoint. For this
analysis, we will use the brms R package (Bürkner 2017).

Hypothesis
For this study, we hypothesize that the rate of positive results (i.e., studies that find at least partial support
for their hypothesis) is greater than 80%. Therefore, the null hypothesis (H0 ) is that the proportion of
positive results is less than .8 and our alternative is greater than .8. There is no other effect being estimated
in this study therefore the intercept of the model is what will be tested.
H0 : Intercept ≤ .8
H1 : Intercept > .8

1
Prior Choice
The prior we selected for this analysis was informed by the previous studies assuming the true positive rate
is approximately 85% (Fanelli 2010). However, we would like to avoid “spiking” the prior in favor of our
hypothesis and therefore want a skeptical prior. Based on the work of Scheel, Schijen, and Lakens (2020) and
Büttner et al. (2020) the estimated positive rates in original research investigations ranged from 82%-92%,
and even some fields included in the survey by Fanelli (2010) observed rates at low as ~70%. Therefore, we
selected a prior of β(17, 3), and is visualized it below. This prior is centered around .85, but includes the
possibility of higher (.9) and much lower (.7) proportions as compatible parameter estimates.

4
Density

0
1 2 3 4 5 6 7 8 9
0. 0. 0. 0. 0. 0. 0. 0. 0.
θ

2
Data Analysis Example

Below, we have incorporated this prior (prior_1) into a simulated dataset (test_df) and then analyzed this
with the brm function (saved as m_test).

#Set prior
prior_1 = set_prior("beta(17, 3)", class = "b", lb = 0, ub = 1)

#Generate test data

test_df = data.frame(run = 1,
pos = rbinom(1, 150, .85),
N = rep(150, 1)) %>%
mutate(rate = pos/N)

#Build model
m_test <- brm(
pos | trials(N) ~ 0 + Intercept,
family = binomial(link = "identity"),
prior = prior_1,
data = test_df,
sample_prior = TRUE,
iter = 1e4,
cores = 4,
refresh = 0
)

We can then visualize the performance of the prior and the posterior from this model.

10
Type
Density

Posterior
Prior
5

0
1 2 3 4 5 6 7 8 9
0. 0. 0. 0. 0. 0. 0. 0. 0.
θ

In addition, the hypothesis can be tested with the hypothesis function and the posterior compatibility
intervals (C.I.).

h_test <- hypothesis(m_test, "Intercept > 0.8")

knitr::kable(h_test$hypothesis, caption = "Hypothesis Test")

3
Table 1: Hypothesis Test

Hypothesis Estimate Est.Error CI.Lower CI.Upper Evid.Ratio Post.Prob Star

(Intercept)-(0.8) > 0 0.0583896 0.0266125 0.0126561 0.0998934 48.62779 0.97985 *

test_pos = posterior_interval(m_test,
prob = .95)
knitr::kable(test_pos, caption = "95% Posterior C.I.")

Table 2: 95% Posterior C.I.

2.5% 97.5%
b_Intercept 0.8027355 0.9068057
prior_b 0.6707599 0.9668113
lp__ -5.3816301 -2.8567919

From the simulated scenario we find that given the data the hypothesis that the true positive result rate
is greater than 80% is 48.63 times more likely than the true value being less than 80%. Now, this is only
over 1 simulated dataset, and, in order to estimate our "power, we will need to replicate this process over a
thousand simulations.

Simulations
Now that we have established the process by which the data are analyze I will summarize the results of
a simulation (1000 iterations) of the performance of this model. Please note that code to reproduce these
analyses can be found at the end of the document.
First, this analysis, under the previously stated assumptions, would be able to yield a Bayes Factor in favor
of our hypothesis (BF > 3) 86.7% of the time. Below is a plot of the simulated Bayes Factors (excluding
Bayes Factors > 100). As a note, in the final manuscript we also plan to report the posterior probabilites of
our selected hypotheses.

Distribution of Bayes Factors in Simulation

15
count

0
5 15 25 35 45 55 65 75 85 95
Bayes Factor

4
Second, we have included a plot of the distribution of posterior credible intervals below. Approximately
41.6% of all CI lower bounds were greater than 80%.

15
40

10 30
density

density
20
5
10

0 0
00 25 50 75 00 8 9 0 1 2
0.8 0.8 0.8 0.8 0.9 0.0 0.0 0.1 0.1 0.1
Estimate Width

15
10
density

density

5
5

0 0
25 50 75 00 25 50 50 75 00 25 50
0.7 0.7 0.7 0.8 0.8 0.8 0.8 0.8 0.9 0.9 0.9
Lower Bound Upper Bound

Conclusion
Overall, the data from this study will be adequate to test our hypothesis since the 86.7% of the simulations
demonstarted at least some evidence for our hypothesis. Also, we only assumed 150 manuscripts would be
analyzed and the underlying distribution is exactly 85%. In reality, we anticipate that we should actually
have more than 180 manuscripts (60% of the sample) with hypotheses to test which will only increase the
“power” of this Bayesian analysis.

5
Appendix: Code to Reproduce the Simulations

# Data generating function ---------------------------

gen_data = function(run,n,prop){
df = data.frame(run = run,
pos = rbinom(1, n, prop),
N = n) %>%
mutate(rate = pos/N)
return(df)
}

# Build the initial model ---------------------------

initial_form = function(n = 150,
prop = .85,
sim_prior = set_prior("beta(17, 3)", class = "b", lb = 0, ub = 1)){
init_df = data.frame(run = 1,
pos = rbinom(1, n, prop),
N = n) %>%
mutate(rate = pos/N)
fit <- brm(
pos | trials(N) ~ 0 + Intercept,
family = binomial(link = "identity"),
prior = sim_prior,
data = init_df)
return(fit)
}

# Set the parameters for the simulation ---------------------------

set.seed(08202020)
nsims = 1000
ci = .95
hyp_test = "Intercept > 0.8"
fit = initial_form(n = 150,
prop = .85,
sim_prior = set_prior("beta(17, 3)", class = "b", lb = 0, ub = 1))
bin_sims = data.frame(run = NA,
d = NA,
fit = NA)
bin_sims = bin_sims[FALSE,]

## Split simulations ---------------------------

# Must run in parts due to C error (possibly memory issues)
for (i in 1:10) {
bin_run = tibble(run = 1:(nsims/10)) %>%
mutate(d = map(run, gen_data, n = 150, prop = .85)) %>%
mutate(fit = map(d, ~update(fit, newdata = .x, refresh = 0)))
bin_sims = rbind(bin_sims,bin_run)
}

## Calclulate estimates ---------------------------

bin_est = bin_sims %>%
mutate(test = map(fit,tidy,prob=ci)) %>%
unnest(test) %>%

6
filter(term == "b_Intercept") %>%
select(-d,-fit) %>%
mutate(width = upper-lower)

## Calclulate hypothesis tests ---------------------------

bin_hyp = bin_sims %>%
mutate(hyp = map(fit,hypothesis,hyp_test)) %>%
select(run,hyp)

hyp_df = data.frame(1,2,3,4,5,6,7,8)
colnames(hyp_df) = colnames(bin_hyp$hyp[[1]]$hypothesis)
hyp_df = hyp_df[FALSE,]

for (i in 1:nrow(bin_hyp)){
hyp_df = rbind(hyp_df, as.data.frame(bin_hyp$hyp[[i]]$hypothesis))

#save.image(file = "sin_v2.RData")

References
Bürkner, Paul-Christian. 2017. “brms: An R Package for Bayesian Multilevel Models Using Stan.” Journal
of Statistical Software 80 (1): 1–28. https://doi.org/10.18637/jss.v080.i01.
Büttner, Fionn, Elaine Toomey, Shane McClean, Mark Roe, and Eamonn Delahunt. 2020. “Are Questionable
Research Practices Facilitating New Discoveries in Sport and Exercise Medicine? The Proportion of Supported
Hypotheses Is Implausibly High.” British Journal of Sports Medicine, July, bjsports–2019–101863. https:
//doi.org/10.1136/bjsports-2019-101863.
Fanelli, Daniele. 2010. “‘Positive’ Results Increase down the Hierarchy of the Sciences.” Edited by Enrico
Scalas. PLoS ONE 5 (4): e10068. https://doi.org/10.1371/journal.pone.0010068.
Scheel, Anne M., Mitchell Schijen, and Daniel Lakens. 2020. “An Excess of Positive Results: Comparing the
Standard Psychology Literature with Registered Reports,” February. https://doi.org/10.31234/osf.io/p6e9c.

Model Comparison
No ratings yet
Model Comparison
22 pages
Regression Models
No ratings yet
Regression Models
17 pages
Homework 8
100% (1)
Homework 8
6 pages
Bayesian Inference Fundamentals
No ratings yet
Bayesian Inference Fundamentals
195 pages
FSMLecture6 - Statistics
No ratings yet
FSMLecture6 - Statistics
61 pages
Probability and Statistics Lab Submission 4
No ratings yet
Probability and Statistics Lab Submission 4
8 pages
Bayesian Thinking in Biostatistics - 1st Edition PDF Ebook With Full Chapters
No ratings yet
Bayesian Thinking in Biostatistics - 1st Edition PDF Ebook With Full Chapters
15 pages
Bayesian Learning: Thanks To Nir Friedman, HU
No ratings yet
Bayesian Learning: Thanks To Nir Friedman, HU
41 pages
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
No ratings yet
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
10 pages
Babies Learning Language - Methods (05-06)
No ratings yet
Babies Learning Language - Methods (05-06)
2 pages
Exp 7
No ratings yet
Exp 7
8 pages
Programming With R Test 2
50% (2)
Programming With R Test 2
5 pages
Studio 5 Questions
No ratings yet
Studio 5 Questions
8 pages
STAT40950 2 HypothesisTesting
No ratings yet
STAT40950 2 HypothesisTesting
13 pages
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
No ratings yet
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
36 pages
1
No ratings yet
1
130 pages
LectureNotes22 WI4455
No ratings yet
LectureNotes22 WI4455
154 pages
3 Practical
No ratings yet
3 Practical
2 pages
238 03242024 - Final 课后
No ratings yet
238 03242024 - Final 课后
10 pages
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
No ratings yet
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
288 pages
5
No ratings yet
5
2 pages
Ass 3 Skeleton - 1
No ratings yet
Ass 3 Skeleton - 1
4 pages
ProbList5 24 SLN
No ratings yet
ProbList5 24 SLN
9 pages
Murphy's Machine Learning Solutions Manual
No ratings yet
Murphy's Machine Learning Solutions Manual
100 pages
25 Intro To Bayesian Inference
No ratings yet
25 Intro To Bayesian Inference
31 pages
Probability Basics Lab Activity Guide
No ratings yet
Probability Basics Lab Activity Guide
3 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Bayesian Statistical Methods 1st Edition Brian J. Reich Download
No ratings yet
Bayesian Statistical Methods 1st Edition Brian J. Reich Download
142 pages
Exercises - (Activity #6)
No ratings yet
Exercises - (Activity #6)
15 pages
Babybayes Master
No ratings yet
Babybayes Master
172 pages
Notes On Applied Statistics
No ratings yet
Notes On Applied Statistics
16 pages
UL3
No ratings yet
UL3
2 pages
The Beta Distribution
No ratings yet
The Beta Distribution
11 pages
Bayes 2 V
No ratings yet
Bayes 2 V
32 pages
Assign 1
No ratings yet
Assign 1
5 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
Con Dence: ECON 226 - J L. G
No ratings yet
Con Dence: ECON 226 - J L. G
54 pages
Group 5 Practical
No ratings yet
Group 5 Practical
6 pages
Ba Yes Factor
No ratings yet
Ba Yes Factor
55 pages
Making Models With Bayes
No ratings yet
Making Models With Bayes
51 pages
Bayesian Optimization for ML Experts
No ratings yet
Bayesian Optimization for ML Experts
84 pages
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
No ratings yet
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
3 pages
Jawaban PSM by Blackbox
No ratings yet
Jawaban PSM by Blackbox
2 pages
Machine Learning for Price Prediction and Data Analysis
No ratings yet
Machine Learning for Price Prediction and Data Analysis
5 pages
cs447 - Tool Using Simulation To Test A Hypothesis
No ratings yet
cs447 - Tool Using Simulation To Test A Hypothesis
4 pages
Lab Kamal Sir
No ratings yet
Lab Kamal Sir
5 pages
Tinywow Matlabworkbookstathw4 83108852
No ratings yet
Tinywow Matlabworkbookstathw4 83108852
16 pages
Bayesian Estimation vs. t-Test
No ratings yet
Bayesian Estimation vs. t-Test
15 pages
R Programming: Control Structures & Functions
No ratings yet
R Programming: Control Structures & Functions
4 pages
PracticeProblems Bayesian
No ratings yet
PracticeProblems Bayesian
10 pages
Cheat Sheet Final
No ratings yet
Cheat Sheet Final
2 pages
Running Master
No ratings yet
Running Master
57 pages
Regression in R
No ratings yet
Regression in R
40 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
8 pages
T3 Probability and Statistics
No ratings yet
T3 Probability and Statistics
11 pages
Grade 11 Hypothesis Testing Guide
No ratings yet
Grade 11 Hypothesis Testing Guide
15 pages
An Overview of Probability
No ratings yet
An Overview of Probability
79 pages
Business Research Exam
No ratings yet
Business Research Exam
4 pages
Quantitative Analysis for Psych Students
No ratings yet
Quantitative Analysis for Psych Students
6 pages
Mathematics Applications and Interpretation Paper 2 TZ1 SL
No ratings yet
Mathematics Applications and Interpretation Paper 2 TZ1 SL
12 pages
Statistics MCQ
100% (1)
Statistics MCQ
6 pages
Analysis of Factors Affecting Students Going To School Toilets in A Rural Primary School in China
No ratings yet
Analysis of Factors Affecting Students Going To School Toilets in A Rural Primary School in China
11 pages
Biomedical Signal Processing and Signal Modeling - Bruce PDF
No ratings yet
Biomedical Signal Processing and Signal Modeling - Bruce PDF
14 pages
Samorin - Kiven - Statiscal Computation
No ratings yet
Samorin - Kiven - Statiscal Computation
7 pages
Comprehensive Guide to Data Analysis
No ratings yet
Comprehensive Guide to Data Analysis
9 pages
HC-A Training Manual en V1.1 Filled in
No ratings yet
HC-A Training Manual en V1.1 Filled in
60 pages
Intro to Descriptive Statistics
100% (2)
Intro to Descriptive Statistics
57 pages
Test Score Distribution Analysis
No ratings yet
Test Score Distribution Analysis
21 pages
Investigation of Pile Construction and Productivity Loss An Analysis of Macro Impact Factor - Minhaz Ahmed and Wang Xu
No ratings yet
Investigation of Pile Construction and Productivity Loss An Analysis of Macro Impact Factor - Minhaz Ahmed and Wang Xu
33 pages
BANBEIS
50% (2)
BANBEIS
2 pages
Optimization of Cutting Fluids and Cutting Parameters During End Milling by Using
No ratings yet
Optimization of Cutting Fluids and Cutting Parameters During End Milling by Using
8 pages
Ecological Models Bibliography
No ratings yet
Ecological Models Bibliography
4 pages
Hypothetico Deductive Process
No ratings yet
Hypothetico Deductive Process
17 pages
Morrison - Thompson - Assessing The Admissibility of A New Generation of Forensic Voice Comparison
No ratings yet
Morrison - Thompson - Assessing The Admissibility of A New Generation of Forensic Voice Comparison
80 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
16 pages
BBA-213 Business Research Methodology
No ratings yet
BBA-213 Business Research Methodology
39 pages
Notes of AEM (3CS1-01,3AM1-01,3AD1-01) - Unit 2 by Dr. RM - 2024-25
No ratings yet
Notes of AEM (3CS1-01,3AM1-01,3AD1-01) - Unit 2 by Dr. RM - 2024-25
46 pages
Sampling and Sampling Distribution-Vii
No ratings yet
Sampling and Sampling Distribution-Vii
15 pages
MAT301 Unit III Notes 2 Chi Square Test Statistics by Robert S Witte John S Witte
No ratings yet
MAT301 Unit III Notes 2 Chi Square Test Statistics by Robert S Witte John S Witte
21 pages
Mean of Discrete Probability Distribution
No ratings yet
Mean of Discrete Probability Distribution
7 pages
Rbl632wuic2abxymaa-U2 5eckg670
No ratings yet
Rbl632wuic2abxymaa-U2 5eckg670
46 pages
Binomial Distribution. (Application)
100% (1)
Binomial Distribution. (Application)
5 pages
WHP Atomic Spectroscopy-Effects On Accuracy and Detection Limits 013559 01 PDF
No ratings yet
WHP Atomic Spectroscopy-Effects On Accuracy and Detection Limits 013559 01 PDF
11 pages
Thesis Writing Guide: Process Validation
100% (3)
Thesis Writing Guide: Process Validation
8 pages
BK Chap12
No ratings yet
BK Chap12
74 pages

Bayesian Power Analysis

Uploaded by

Bayesian Power Analysis

Uploaded by

Bayesian Sample Size Justification

#Generate test data

h_test <- hypothesis(m_test, "Intercept > 0.8")

Hypothesis Estimate Est.Error CI.Lower CI.Upper Evid.Ratio Post.Prob Star

Table 2: 95% Posterior C.I.

Distribution of Bayes Factors in Simulation

# Data generating function ---------------------------

# Build the initial model ---------------------------

# Set the parameters for the simulation ---------------------------

## Split simulations ---------------------------

## Calclulate estimates ---------------------------

## Calclulate hypothesis tests ---------------------------

You might also like