Set-up
Assume that we have p-values . Assume that they are computed from z-scores
(test statistics following normal distributions). Let
and let
. Without loss of generality, assume that each test statistic
has variance 1. With this, we can express the p-values as
where is the CDF function of the standard normal distribution.
We are interested in testing the global null hypothesis .
The Cauchy combination test
Assume that we have some random vector such that
for all
,
and
is independent of
. The test statistic for the Cauchy combination test, proposed by Liu & Xie 2020 (Reference 1), is
Under the global null, for each
, implying that
has the standard Cauchy distribution (see this previous post). If the p-values are independent, then Proposition 1 in this other previous post implies that
has the standard Cauchy distribution.
In turns out that even if the p-values are dependent, is still approximately Cauchy! This approximation is formalized in Theorem 1 of Reference 1:
Theorem. Suppose that for any
,
follows a bivariate normal distribution. Suppose also that
. Let
be a standard Cauchy random variable. Then for any fixed
and any correlation matrix
, we have
The theorem says that the test statistic has approximately a Cauchy tail even under dependency structures in
. Knowing the (approximate) distribution of
under the global null allows us to use it as a test statistic.
Other notes
- The “bivariate normal distribution” condition is a bit of a technical assumption, the authors claim it is a mild assumption.
- This result bears a lot of similarity with a result by Pillai & Meng 2016 (see this previous post for a description). Section 2.2 of Reference 1 discusses the similarities and the differences.
- Theorem 1 above holds for fixed
(number of p-values). Section 2.3 of Reference 1 has a high-dimensional asymptotic result where
with
.
- The Cauchy combination test is especially powerful when only a small number of
‘s are large, or equivalently when a smaller number of
‘s are very small. We can see this intuitively: small
‘s become very large
‘s, so the test statistic will be dominated by a few very large p-values. See Section 4.2 of Reference 1 for a power comparison study.
References:
- Liu, Y., and Xie, J. (2020). Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures.