Many Variables
Many Variables
ASSESSING METHODS-
MANY VARIABLES
are less familiar in examples (3) to (5) than in examples (1) and diseases) for town B is expected to be 8' 0 less than in town A,
(2). Authors should communicate their findings by mentioning so that if town A's SMR is 100, town B has SMR-- 92, whereas
the methodology briefly but concentrate on the clinical inter- if town A has SMR of 150, town B's SMR will be 138. This
pretation and applicability of the results in language that is is what we mean by a constant proportional effect of water
familiar to readers. A good example is the description by hardness on SMR. A multiplicative (proportional) model for
Clayton et al3 of a predictive index for postoperative deep vein SMR is equivalent to an additive model for log SMR and so
thrombosis. The common thread in the five examples is the Pocock et al used the logarithm of SMR as the dependent
combining of several explanatory variables to describe or variable in a multiple regression model. The dependent and
predict an outcome. Keep that idea firmly in mind and the explanatory variables are different from those described in the
barrier of transforming the problem for analytic reasons multiple regression model above but transforming the outcome
becomes trivial. Also good reporting should emphasise the variable has induced the same structure, and the method of
clinical interpretation of results, and how they should be used- analysis is the same thereafter. Figure 1 shows actual SMR
to help decision making or for test reduction.6 It is important against predicted SMR, prediction being based on the set of
for doctors to recognise when methods, as described here, five explanatory variables in table I. The authors noted that some
might be applied in their specialty and to seek statistical towns, especially in Scotland, had a higher cardiovascular
collaboration on what are often exciting and challenging mortality than the model predicted, and this geographical
problems-statistically as well as clinically. clustering in mortality is being investigated. Departures from a
(1) Mean heart rate during operationto depends on the predicted model, as in this case, often suggest new lines of
surgeon's age and seniority, resting heart rate, the type and research. The performance of a regression model should always
length of the operation, scrub-up time, medication (such as
beta-blockade), and so on. The problem is to combine some or
all of these explanatory variables in a prediction equation for
mean heart rate during the operation to give a better under-
standing of their relative importance. The outcomes for
* Hamilton
individual surgeons will, of course, deviate randomly from the
Kilmarnock,
Rhondda.
Airdrie
.Motherwell
140 Ayr .Dewsbury *
predicted means, and this accounts for the error terms that Dumfries *Haiifax
,
bedevil statistics. Suppose that mean heart rates from 80 to 130- Invernessm-e.rmEn .
140 beats per minute have been observed. Possibly the residual CaerphillyO2 *** 2 *
random variation increases with the increase in predicted mean w 120-
Stafford * 2 2
heart rate, but in the first instance constant variance could be (' 110- * 2.
assumed. Common sense suggests that a reasonable form for the
prediction equation is a baseline mean heart rate Po, which is M 100 ~~~~0
**@-
L
*@@ S
0
*
010 030. *050 070 0.90 1 00 timolol or matched placebo.7 By chance-assuming no randomis-
0
0
Probability, p ation leak-the placebo group includes a higher proportion of
-J -1.0 patients aged 65 years or older, patients with arrhythmias in the
acute stage, patients with a clinical history of treated hypertension,
or patients who had taken diuretics. Of course, these variables
are interrelated, older patients being more likely to have
.
arrhythmias in the acute stage and to have been on diuretic
treatment before admission, but it will be important to check
-3.0 1: that a definite effect of treatment persists after allowing retro-
spectively for moderate imbalance between the groups as
randomised. Attention focuses on the time-specific risk of
dying or hazard, a convenient measure of the changing force
-4-0 0 of mortality in time. If we assume a proportional hazards model
-a reference point that is not always validl3 then we claim
that there is a basic form of hazard which is proportionately
-5.0 J Logit (0-5) =0; Logit (0 3)=- Logit (0.7) etc increased or decreased according to the set of explanatory
The log odds or logit transformation. variables which describes a particular patient.
The constant of proportionality is most often taken to be the
exponential of a linear combination of the explanatory variables-
log odds scale is one of several on which proportions can be exponentiation being to ensure that the final estimates are
analysed.) Clayton et al called the log odds scale the predictive non-negative. Whether the patient was randomised to timolol
index. The five out of 10 variables that they identified as having would be included as one of the explanatory variables and if the
the best predictive power gave the following index treatment effect were still significant after adjustment for risk
factors as described above, then the investigators would be
I -113 0 009x +0 22x2+0 085x3+0 043x, +2 19x5 reassured that the advantage was probably genuine and not an
artefact because of moderate imbalance between the randomised
where xl --euglobulin lysis time (minutes), x2 =concentration of groups.7 It is certainly not the case, as Mitchell suggests,14 that
fibrin-related antigen (mg/l), X3-- age (years), x4 = percentage an imbalance in risk factors, significant at the 1 00 level,
overweight for height, and x1=presence or absence of varicose necessarily translates into a survival difference, significant also
veins (scored as 1 or 0 respectively). at the 100 level. What the implications for survival are may be
High positive values of the predictive index are associated sorted out by an analysis of the type described above. Perhaps
with a high risk of developing postoperative deep vein thrombosis. Professor Mitchell's whimsical account was intended as a rebuke
In a prospective study of 62 new cases,81 using a cut-off point against the failure to report fully a sophisticated analysis ?
of -2 correctly identified nine out of 10 patients and incorrectly Regression models for survival can be adapted easily to give
identified seven out of 52 patients as being at risk of developing a prognostic classification of patients as in chronic lymphocytic
postoperative deep vein thrombosis (see figure 3). This dis- leukaemia'" or breast cancer.8
crimination was as good as that obtained on the original data. Fairly complex statistical methods have been discussed in
Validating prediction models on the original data, which they relation to the problems in this commentary. Complexity is not
were designed to reproduce, gives an overoptimistic view of their always necessary. Comparing statistical techniques in the
performance. Only by testing models on new data is their worth context of diagnosing hypothyroidism, Gardner and Barker'
firmly established. concluded that a simple method-counting the total number of
(4) Kidger et a112 described a scoring system for orofacial symptoms present-was as effective in determining a rule for
904 BRITISH MEDICAL JOURNAL VOLUME 283 3 OCTOBER 1981
(22) Calcium as a measure of water quality was not included in Analysing the completed four-way contingency table (table II)
the five-variable regression model for log SMR reported by answers questions such as: Is tumour size larger when there
Pocock et al.4 Does this mean that water calcium is irrelevant
as anexplanatory variable for cardiovascular mortality ? TABLE iI-Cross tabulation by clinicalfeatures
-no Other signs absent: Other signs present:
fixation/ulceration fixation/ulceration
fg.. variables are 0 AIAI..
-excluded superfluous, 44444
not irrelevant 1 2 3 1 2 3
Nodes 1
Size 1 249 160 4 7 12 1
2 206 531 9 12 103 10
3 74 242 13 16 131 50
COMMENT Nodes 2
Size 1 83 69 0
2 118 305 10
The results
referThralas of Pocock et al4
mrecomplicatdec metods imply that the variables
do notanthomsolnietal modifiedha 3 45 174 11
excluded from the prediction are useless indicators of cardio- Nodes 3
atreditivenA inde forescsfu
wichal patients.solbew asked tougv
vascular mortality. The implication is rather that, after taking Size 1
2
account of the information already in the prediction equation, 3
the remaining variables are redundant in the sense of not
Factor Coding
adding anything further to the explanation. A different regression Tumour size Size .2 cm 1
model-in which water calcium replaced total water hardness 3-4 cm 2
)5 cm 3
as a measure of water quality-could have been proposed Fixation/ulceration No fixation, no ulceration 1
Fixation, no ulceration 2
without serious loss. Regression models give an explanation Ulceration 3
not the explanation, because selection of predictor variables Homolateral axillary nodes Not affected 1
Mobile 2
requires judgment as well as technical skill. Matted 3
treatment
A in. pa4t.44Ho wu you
are other signs present ? Does increased tumour size mean more
disease in the homolateral axillary nodes ? Does the apparent
association of tumour size and fixation resolve if we take into
account the state of the axillary nodes ? Is there an important
second-order interaction of fixation/ulceration, nodal disease,
and presence/absence of other signs, for example ?
These questions are not answered satisfactorily by considering
only two factors at a time. A statistician would usually be
consulted about the analysis of multi-way contingency tables.
expect the statistician to allowfor this in a regression model that The usefulness of this approach is that it leads to general
takes account of several risk factors ? inferences about how a disease such as breast cancer presents.
I am grateful to Dr S J Pocock and colleagues for permission to
-by including an interaction term reproduce figure 1 and to Dr A J Crandon and others for figure 3
under question 21.
COMMENT
The effect of treatment A would be measured by two indicator References
variables, one pointing to male patients, the other to female 1 Gardner MJ, Barker DJP. Diagnosis of hypothyroidism: a comparison of
patients. If the corresponding estimated coefficients were statistical techniques. Br MedJ7 1975;ii:260-2.
significantly different the effect of treatment A probably differs 2Teasdale G, Parker L, Murray G, Knill-Jones R, Jennett B. Predicting
between the sexes. Interaction terms are not convincing unless the outcome of individual patients in the first week after severe head
injury. Acta Neurochir 1979;suppl 28:161-4.
there is a sensible interpretation of them, or prior justification Clayton JK, Anderson JA, McNicol GP. Preoperative prediction of
for their inclusion. Doctors should advise the statistician if they postoperative deep vein thrombosis. Br MedJ3 1976;ii:910-2.
have good reason to suspect that a treatment will be more 4 Pocock SJ, Shaper AG, Cook DG, et al., British Regional Heart Study:
effective in one group of patients than another. geographic variations in cardiovascular mortality, and the role of water
quality. Br Med J 1980;280:1243-9.
5 Bouckaert A. Computer diagnosis of goitres. III Optimal subsympto-
(24) Tumour size, fixation/ulceration, clinical disease of the matologies. 7 Chron Dis 1971 ;24:321-7.
6 Card WI, Emerson PA. Test reduction. I Introduction and review of
homolateral axillary nodes, and presence/absence of other signs published work. Br MedJ 1980;281:543-5.
(associated with poor prognosis) were recorded for 3695 of the 7The Norwegian Multicenter Study Group. Timolol-induced reduction in
BRITISH MEDICAL JOURNAL VOLUME 283 3 OCTOBER 1981 905
mortality and reinfarction in patients surviving acute myocardial Gore SM. Assessing methods-survival. Br Med3' 1981;283:840-3.
infarction. N Eog! 7 Aled 1981 ;304:801-7. ' Mitchell JRA. Timolol after myocardial infarction: an answer or a new
Mycrs MH, Axtell LM, Zelen M. The use of prognostic factors in set of questions ? Br Med )I 1981 ;282:1565-70.
predicting survival for breast cancer patients. 7 Chroni Dis 1966;19: Montserrat E, Rozman C. Subclassification of stage II chronic lymphocytic
923-33. leukaemia with prognostic and therapeutic implications. Lancet 1979;
9 Gore SMNI. Assessing mcthods-a feel for other things. Br Mled _7 1981; ii :854.
283:775-7. '6 Thomson HJ, Miller SS, Gore SM, Bayliss A. Consent for mastectomy.
Foster GE, Evans DF, Hardcastle JD. Heart-rates of surgeons during Br MedJ7 1980;281:1097-8.
operations and other clinical activities and thcir modification by
oxprenolol. Lancet 1978; i:1323-5.
Crandon AJ, Pecel KR, Anderson JA, Thompson V, McNicol GP. Sheila M Gore, MA, is a statistician in the MRC Biostatistics Unit,
Postoperative deep vein thrombosis: identifying high-risk patients. Medical Research Council Centre, Hills Road, Cambridge CB2 2QH.
Br MIcd]7 1980;281:343-4.
' Kidger T, Barnes TRE, Trauer T, Taylor PJ. Sub-syndromes of tardive
dyskinesia. Psychlol Med 1980 ;10 :513-20. No reprints zill be available from the auithor.