Power and Sample-Size Calculations For Poisson Regression

The document discusses methods for calculating power and sample size for Poisson regression, highlighting the limitations of standard software and the use of Normal approximation and simulation. It provides formulas for sample size calculations under various test scenarios, including non-equality, equivalence, and non-inferiority tests, while emphasizing the importance of identifying the response variable correctly. Additionally, it addresses the effects of covariates and the need for sensitivity analysis in determining sample size and power adjustments.

Uploaded by

Peter Lane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views4 pages

Power and Sample-Size Calculations For Poisson Regression

Uploaded by

Peter Lane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Power and sample-size calculation for Poisson regression

6 February 2003, revised 7 Feb

Peter Lane, Research Statistics Unit

Background
Standard software such as nQuery and Pass do not provide for the calculation of
power or sample size from Poisson regression. There may be specialist packages,
but I am not aware of any. The GSK Technical Reference Document on sample
size also does not mention Poisson models. The two approaches I have used are to
use a Normal approximation, and to use simulation.

One reference is: David F Signorini (1991), Sample size for Poisson regression.
Biometrika, 78, 446-450. But this deals specifically with a quantitative
explanatory variable and is concerned with the distribution of values of that
variable.

Normal approximation
In a simple Poisson regression model, the response is assumed to have a Poisson
distribution with a separate mean for each treatment group. The model can be
extended to include other explanatory variables, including covariates and class
effects, in the same way as in a linear model except that it is usual to combine the
effects multiplicatively. This is done because it has been found in practice to
provide concise explanation of variation in data with Poisson distributions, and
because count data can often be modelled in terms of probabilities which
combine multiplicatively. It is effected in practice by using a logarithmic "link
function", with effects contributing to an additive "linear predictor" on this
transformed scale.

In the simple model with two treatment groups, we have

yij ~ P(μi), i=1,2, j=1… ni
The estimate of each mean μi is
mi = Σ(yij, j=1… ni)/ni
which has an approximate Normal distribution by the Central Limit Theorem
mi ~ N(μi, μi/ni)
so the difference between the two means is also approximately Normal
m2– m1 ~ N(μ2–μ1, (μ1+μ2)/n), if n1 = n2 = n
If there is overdispersion, the variance of this estimate is modified to
φ(μ1+μ2)/n
where is an estimate of overdispersion. I recommend the deviance estimate,
φD = (residual deviance) / (residual d.f.)
which corresponds to the DSCALE option of the MODEL statement in Proc
Mixed of SAS. The alternative is the Pearson estimate, corresponding to the
PSCALE option.

With a modest number of patients in each arm, say ni >10, the Normal
approximation will be good, as long as the means are not too close to 0, say μi >5.
(I will produce a table indicating how good this approximation is for a range of
combinations of n and μ.) The standard sample-size formula for a t-test with
Normal data can then be applied, giving the number required for each arm, with
equal allocation, as
n = (Z(1–β)+Z(1–α/2))2 * φ(μ1+μ2)/( μ2–μ1)2
where Z(p) is the Normal equivalent deviate for cumulative probability p, α and β
are the Type I and II Errors. The required values of μ1 and μ2 depend on the type
of test to be carried out.

Unequal allocation
If the allocation ratio n2/n1 is r, the formula becomes
n1 = (Z(1–β)+Z(1–α/2))2 * φ(μ1+μ2/r)/( μ2–μ1)2
So, for example, for r=2, so that there are more observations on the test drug than
for the reference, the total number required is
n1+n2 = (Z(1–β)+Z(1–α/2))2 * 3φ(μ1+μ2/2)/( μ1–μ2)2
which is about 13% higher (taking μ1=μ2) than for r=1. For r=3 it is about 33%
higher than r=1.

Use of the formula

It is important to identify the response variable carefully. The Poisson model is
appropriate for analysing actual counts, because the Poisson distribution is the
distribution of the counts resulting from observing a Poisson process over a fixed
period of time. If the counts are scaled, to give a rate for a different interval of
time than what was observed, then the distribution will no longer be Poisson. If
the counts are scaled to give a rate by dividing by ρ, the mean of the rate will be
μ/ρ but the variance will be μ/ρ2; so the distribution could be modelled as Poisson
with underdispersion (φ=1/ρ). But it is better to model the original counts
themselves.
In a standard two-sided test of non-equality, or difference, between two
treatments (conventionally referred to as a test of superiority), it is usual to
calculate sample size on the basis of a pre-assigned value for a difference between
the means, called the clinically relevant difference, denoted here by δ=μ2–μ1. For
Normal data, all that is actually required is this difference; but for Poisson data,
we need both means (giving the formula as above) or δ and one mean, say μ1 for a
reference treatment (placebo or standard active), for which the formula becomes
n = (Z(1–β)+Z(1–α/2))2 * φ(2μ1+δ)/δ2.

A test of superiority is just a one-sided test of difference. The formula is

n = (Z(1–β)+Z(1–α))2 * φ(2μ1+δ)/δ2,
but in practice, for example for drug submissions to regulators, the Type I Error is
required to be half that for the corresponding test of non-equality, so the same
sample size is needed.

In a test of equivalence between two treatments, it is usual to calculate sample

size based on equality of the means. This is done with an "intersection-union"
test, which is described as TOST or two one-sided tests, which requires the
specification of a tolerance, τ, representing the maximum acceptable value of the
difference between the means. The formula for sample size is similar to that for a
non-equality test, but the Type II Error is divided by 2 and the Type I Error
multiplied by 2:
n = (Z(1–β/2)+Z(1–α))2 * 2φμ1/τ 2.
(But see the modification below, in the section on sensitivity analysis).

The Type I Error for this combined test is actually the same as that for each one-
sided test, but in practice for drug submissions (as laid out in ICH E9) it is
required to be half of what would be used for two-sided tests, so the Z(1–α)
component of this formula will be the same as the corresponding component in a
test of non-equality.

In a test of non-inferiority, sample size is again usually calculated on the basis of

equal means for the two treatments. This test is just a one-sided version of the
equivalence test, and the formula becomes
n = (Z(1–β)+Z(1–α))2 * 2φμ1/τ 2.
If a difference δ is assumed between the treatments (still with δ=μ2–μ1, so that δ
is positive if the test mean is greater than the reference mean), the formula
becomes
n = (Z(1–β)+Z(1–α))2 * φ(2μ1+δ)/(τ+δ) 2.
Effect of covariates
If effects of covariates are to be included in a Poisson regression, they should be
combined multiplicatively. With a linear model, the power of the tests described
above would be decreased slightly by the inclusion of covariates, because there
are fewer d.f. for estimating the variance; but this effect is minimal when there are
many patients. With a Poisson model, there is actually no effect on the power,
though in practice the overdispersion parameter has to be estimated in the same
way as the variance in a Normal model. The means used in the calculations are
also not affected, as long as the usual assumption is made in advance, that the
covariates have zero effect. The fact that the model will actually be analysed with
a log link function does not affect the power calculation, because the same model
is being hypothesized if the means are expressed on the natural scale as when they
are expressed on a log scale.

Sensitivity analysis
As with any model, it is important to conduct a sensitivity analysis to establish
how the required sample size, or the power with a given sample size, changes as
the assumed values change. An extra consideration in this model is the
overdispersion factor. It should also be borne in mind that observed values of a
mean that are greater than the hypothesized clinically relevant difference will be
associated with a greater variance as well. In a non-equality, superiority or non-
inferiority test the extra variance will be more than compensated for by the extra
difference between the means. But, in an equivalence test, if the test mean proves
larger than the reference mean, the variance will also be larger than used in the
formula, so it would be prudent to adjust the formula to
n = (Z(1–β/2)+Z(1–α))2 * 2φ(μ1+τ)/τ 2.

Analysis
In the analysis, the log link has to be used, in order to include the effect of
covariates. With this parameterization, the tolerance level to use in a non-
inferiority test needs to be specified on the log scale as well. It is not practicable
to specify a value on the scale of the response variable itself. Instead, the
tolerance level should be specified as a percentage of the mean of the reference
treatment.

Nonparametric Statistics
No ratings yet
Nonparametric Statistics
32 pages
Sample Size Determination: Maj. Tun Tun Win
No ratings yet
Sample Size Determination: Maj. Tun Tun Win
38 pages
Sample Size and Power Calculations in Repeated Measurement Analysis (2001)
No ratings yet
Sample Size and Power Calculations in Repeated Measurement Analysis (2001)
4 pages
Importance of Sample Size in Research
No ratings yet
Importance of Sample Size in Research
98 pages
Sample Size
No ratings yet
Sample Size
6 pages
Ie352l1 Labmanual
No ratings yet
Ie352l1 Labmanual
90 pages
''Sample Size Calculation For Comparing Proportions'' in - Wiley Encyclopedia of Clinical Trials Réf 241
No ratings yet
''Sample Size Calculation For Comparing Proportions'' in - Wiley Encyclopedia of Clinical Trials Réf 241
11 pages
Comparisons of Superiority, Non-Inferiority, and Equivalence Trials With Sample Size Calculation
No ratings yet
Comparisons of Superiority, Non-Inferiority, and Equivalence Trials With Sample Size Calculation
4 pages
Determinacion Tamaños Muestra Exp Clinicos (Correlación)
No ratings yet
Determinacion Tamaños Muestra Exp Clinicos (Correlación)
7 pages
Lab 4 - Hypothesis Testing
No ratings yet
Lab 4 - Hypothesis Testing
5 pages
G Power Calculation
100% (1)
G Power Calculation
9 pages
Pi Is 2058534917300732
No ratings yet
Pi Is 2058534917300732
3 pages
Sample Size & Power Analysis Guide
No ratings yet
Sample Size & Power Analysis Guide
11 pages
Stat Module 2 Q3
No ratings yet
Stat Module 2 Q3
21 pages
Tests For Paired Sensitivities
No ratings yet
Tests For Paired Sensitivities
10 pages
Inferential Statistics For Print Part I
No ratings yet
Inferential Statistics For Print Part I
23 pages
Nays
No ratings yet
Nays
89 pages
B2 - 1 Sample Size
No ratings yet
B2 - 1 Sample Size
22 pages
Sample Size Determination in Statistics
No ratings yet
Sample Size Determination in Statistics
14 pages
Poisson Regression Analysis in Stata
No ratings yet
Poisson Regression Analysis in Stata
23 pages
Data Analysis of Research Methodology
No ratings yet
Data Analysis of Research Methodology
14 pages
Stats For Primary FRCA
No ratings yet
Stats For Primary FRCA
7 pages
Statistical Concepts in Probability Distributions
No ratings yet
Statistical Concepts in Probability Distributions
59 pages
Theory of Statistics Exam Papers
No ratings yet
Theory of Statistics Exam Papers
42 pages
Community Project: Checking Normality For Parametric Tests in SPSS
No ratings yet
Community Project: Checking Normality For Parametric Tests in SPSS
4 pages
Samenvatting Statistiek 10tm17
No ratings yet
Samenvatting Statistiek 10tm17
11 pages
Tests For One Poisson Mean
No ratings yet
Tests For One Poisson Mean
9 pages
Stat Lea Int Cal PDF
No ratings yet
Stat Lea Int Cal PDF
5 pages
Molecular Genetics Exam
No ratings yet
Molecular Genetics Exam
3 pages
Statistical Hypothesis Testing Exercises
No ratings yet
Statistical Hypothesis Testing Exercises
5 pages
Metlit 10-Besar Sampel - 20210920
No ratings yet
Metlit 10-Besar Sampel - 20210920
41 pages
Biostatistics L11+12 2021
No ratings yet
Biostatistics L11+12 2021
9 pages
Linear Regression Model - Applied - Part 3
No ratings yet
Linear Regression Model - Applied - Part 3
40 pages
Ho Sample Size
No ratings yet
Ho Sample Size
5 pages
Mohd B. Makmor Bakry, PH.D., R.PH
No ratings yet
Mohd B. Makmor Bakry, PH.D., R.PH
12 pages
How To Calculate Sample Size F
No ratings yet
How To Calculate Sample Size F
9 pages
Two-Sample T-Tests Assuming Equal Variance
No ratings yet
Two-Sample T-Tests Assuming Equal Variance
19 pages
Two-Sample T-Tests Assuming Equal Variance
No ratings yet
Two-Sample T-Tests Assuming Equal Variance
19 pages
Biostatistics Exam Questions & Answers
No ratings yet
Biostatistics Exam Questions & Answers
12 pages
Exercises 5 4
No ratings yet
Exercises 5 4
8 pages
Sample Size Calculations For Evaluating Mediation
No ratings yet
Sample Size Calculations For Evaluating Mediation
17 pages
Sample Size Calculation in Biostatistics
No ratings yet
Sample Size Calculation in Biostatistics
38 pages
CRJ 503 PARAMETRIC TESTS Differences
No ratings yet
CRJ 503 PARAMETRIC TESTS Differences
10 pages
English Papers
No ratings yet
English Papers
6 pages
Assignment 3 Quants-Fomulas New
No ratings yet
Assignment 3 Quants-Fomulas New
6 pages
Tests For The Ratio of Two Poisson Rates
No ratings yet
Tests For The Ratio of Two Poisson Rates
15 pages
Effect Size Calculator Guide in Excel
No ratings yet
Effect Size Calculator Guide in Excel
3 pages
11paired T
No ratings yet
11paired T
49 pages
9 Statistical Considerations 2001 Surgical Research
No ratings yet
9 Statistical Considerations 2001 Surgical Research
10 pages
wk4lectureESCI 24
No ratings yet
wk4lectureESCI 24
112 pages
Biostatistics for Researchers
No ratings yet
Biostatistics for Researchers
33 pages
Understanding Research Design and Statistics
No ratings yet
Understanding Research Design and Statistics
7 pages
Zimmerman 2012 A Note On Consistency of Non-Parametric Rank Tests and Related Rank Transformations
No ratings yet
Zimmerman 2012 A Note On Consistency of Non-Parametric Rank Tests and Related Rank Transformations
23 pages
Nonparametric Effect Size Estimators
No ratings yet
Nonparametric Effect Size Estimators
2 pages
Sample Size Calculation Guide
No ratings yet
Sample Size Calculation Guide
27 pages
Exam November Stats202 2004
No ratings yet
Exam November Stats202 2004
6 pages
Two-Sample T-Tests Using Effect Size
No ratings yet
Two-Sample T-Tests Using Effect Size
11 pages
Power and Sample Size Analysis Guide
No ratings yet
Power and Sample Size Analysis Guide
13 pages
Bivariate Distribution Theory Explained
No ratings yet
Bivariate Distribution Theory Explained
8 pages
Solved Probability Problems and Examples
0% (1)
Solved Probability Problems and Examples
3 pages
Probability - YiXiang PDF
No ratings yet
Probability - YiXiang PDF
15 pages
SRM University - AP, Andhra Pradesh: Neerukonda, Mangalagiri Mandal Guntur District, Mangalagiri, Andhra Pradesh 522240
No ratings yet
SRM University - AP, Andhra Pradesh: Neerukonda, Mangalagiri Mandal Guntur District, Mangalagiri, Andhra Pradesh 522240
5 pages
Probability & Statistics Exam 2025
No ratings yet
Probability & Statistics Exam 2025
2 pages
Variability in The Log Domain and Limitations To Its Approximation - CPT Pharmacom Syst Pharma - 2020 - Elassaiss Schaap
No ratings yet
Variability in The Log Domain and Limitations To Its Approximation - CPT Pharmacom Syst Pharma - 2020 - Elassaiss Schaap
13 pages
Maximum Likelihood Estimator Guide
No ratings yet
Maximum Likelihood Estimator Guide
14 pages
Evans Analytics2e PPT 05
No ratings yet
Evans Analytics2e PPT 05
65 pages
Probability Concepts and Calculations Guide
100% (1)
Probability Concepts and Calculations Guide
2 pages
Probability Concepts and Applications
No ratings yet
Probability Concepts and Applications
33 pages
A Review Constructing Priors That Penalizes The Complexity of Gaussian Random Fields - Fuglstad Et Al
No ratings yet
A Review Constructing Priors That Penalizes The Complexity of Gaussian Random Fields - Fuglstad Et Al
8 pages
EE3110 Jul 2024 Tutorial4
No ratings yet
EE3110 Jul 2024 Tutorial4
3 pages
Chapter 8 Part
No ratings yet
Chapter 8 Part
29 pages
TQ
No ratings yet
TQ
2 pages
HomeworkCh4 22
No ratings yet
HomeworkCh4 22
1 page
Stat
No ratings yet
Stat
2 pages
Mechanical Measurements
No ratings yet
Mechanical Measurements
22 pages
Massachusetts Institute of Technology: (Final Exam - Spring 2009)
No ratings yet
Massachusetts Institute of Technology: (Final Exam - Spring 2009)
14 pages
Embankment Prediction - 12 - Zheng
No ratings yet
Embankment Prediction - 12 - Zheng
13 pages
Probability and Statistics Exam Paper
No ratings yet
Probability and Statistics Exam Paper
8 pages
Cirran - 1995 - Valuing Asian and Portfolio Options by Conditioning On The Geometric Mean Price - 2
No ratings yet
Cirran - 1995 - Valuing Asian and Portfolio Options by Conditioning On The Geometric Mean Price - 2
8 pages
Tabela de Gestao de Banca
No ratings yet
Tabela de Gestao de Banca
59 pages
Statistics Assignment 2
No ratings yet
Statistics Assignment 2
21 pages
5.random Variable
No ratings yet
5.random Variable
28 pages
Standard Normal Curve Table
No ratings yet
Standard Normal Curve Table
1 page
Surveying II (AR SIR) 4th Sem
No ratings yet
Surveying II (AR SIR) 4th Sem
23 pages
Statistics Exercise Solution
100% (1)
Statistics Exercise Solution
19 pages
Introductory Statistics Note Sheet
No ratings yet
Introductory Statistics Note Sheet
2 pages
Inferential Statistics Lecture 4
No ratings yet
Inferential Statistics Lecture 4
4 pages
Maximum Likelihood Estimation Lecture
No ratings yet
Maximum Likelihood Estimation Lecture
22 pages