Fe01 Introduction
Fe01 Introduction
Epidemiology (FE01)
EPM101 Fundamentals of Epidemiology
Course: PG Diploma/ MSc Epidemiology
This document contains a copy of the study material located within the computer
assisted learning (CAL) session.
If you have any questions regarding this document or your course, please contact
DLsupport via DLsupport@[Link].
Important note: this document does not replace the CAL material found on your
module CDROM. When studying this session, please ensure you work through the
CDROM material first. This document can then be used for revision purposes to
refer back to specific sessions.
These study materials have been prepared by the London School of Hygiene & Tropical Medicine as part of
the PG Diploma/MSc Epidemiology distance learning course. This material is not licensed either for resale
or further copying.
Objectives
After completing this session, you should be able to:
explain the difference between descriptive and analytic studies, and between
observational and interventional epidemiology
express delight and enthusiasm for this study module and epidemiology in general,
and rush off to start the course immediately
Output: (these topics play in a ‘slideshow’ fashion at about 5 second intervals, with
respective headline graphics appearing on the right)
The focus of this study module is primarily the contribution of epidemiology to public
health medicine. However, epidemiology also plays a role in clinical medicine and in
pharmacological studies. These applications of epidemiology are not covered in detail
in EP101, although the concepts and methods we discuss apply equally to studies in
these fields.
By counting the number of health-related events which occur within a specified time
in different populations we can make estimates of the frequency of these events,
which we can compare in different populations.
For example, we can compare the infant mortality rate in different countries and
at different times.
In England and Wales the infant mortality rate was 6.6 per 1000 live births in 1992.
The corresponding figure for Brazil was 36 per 1000 live births.
In epidemiology, we define our population as the collection of units from which our
sample is drawn. This could be people, institutions or events.
For example, in Vienna, in the 1840s, the obstetrician Ignaz Semmelweis noticed
that many women were dying of puerperal fever.
He noticed that there were major differences in the maternal mortality rate from
puerperal fever between two clinics within one hospital. He then looked for
differences between the two clinics which could account for this.
The clinic with the high mortality rate was used to train medical students, and the
clinic with the lower mortality rate was used to train midwives.
Click the "show" button below to highlight the person, place and time in the
description above.
Interaction: button: Show (bold, bold italic, & bold underline styles indicate correct
words below)
"The prevalence of HIV infection among pregnant women of all ages in Kampala,
Uganda in 1995 was 20%."
Now identify the parts of this description of leukaemia in northern England which
specify person, place and time:
"The incidence of lymphoblastic leukaemia among children aged under 15 years
in Northumberland and Durham (two counties in northern England) between
1951 and 1960 was 3.1 per 100 000 person-years."
Exercise
Try again.
The ancient Greek physician Hippocrates (born around 460 BC) used the word
epidemeion to refer to diseases that visit the community, meaning they occurred
from time to time, in contrast to endemeion, diseases which resided within the
community. The expression "endemic disease" comes from this word.
The earliest use of the word "epidemiology" was probably in Spain: in 1598, a book
about plague called Epidemiología was already in its second edition.
The first documented appearance of the English form of the word was in 1850 when
the London Epidemiological Society was formed.
"Whoever wishes to pursue properly the science of medicine should proceed thus.
First he ought to consider what effect each season of the year can produce … He
must consider … how the natives are off for water, whether they use marshy, soft
waters, or such as are hard and come from rocky heights, or brackish and harsh. The
mode of life also of the inhabitants… whether they are heavy drinkers, taking lunch
and inactive, or athletic, industrious, eating much and drinking little."
Farr used his data to compare the mortality rates among different populations. For
example, he compared mortality rates among people of different occupations. The
graph below shows his figures comparing the mortality rates between clergymen and
publicans (men who ran public bars), subdivided by age-group.
It is also an early example of the use of routine data sources - data about health-
related events which are collected systematically. Routine data sources available in
many countries today include census and population registers, birth certificates,
death certificates, cancer registries, notifications of infectious diseases and registers
of congenital abnormalities.
Farr recognised that these data could be used to help understand diseases. He
wrote:
"Different classes of the population experience very different rates of mortality … the
principal causes of these differences, besides the sex, age and hereditary
organization, must be sought in three sources: ordinary occupations of life, supply of
warmth and of food, and differential exposure to poisonous effluvia and to
destructive agents" (Farr, 1885)
During the 19th century, cholera periodically swept across Europe. London was
rather different then - there was no electricity, there were cows and horses in the
streets, and standards of hygiene were poor.
In 1848-9 there was a severe cholera epidemic in London, with 15,000 recorded
deaths. There was considerable debate about the cause of cholera - at the time, the
existence of microbes was hotly contested.
Farr thought that cholera might be caused by breathing unclean air at lower
altitudes.
Toilets were widely introduced in London between 1830 and 1850: main sewers were
introduced in the 1840s. The sewers emptied into the River Thames.
During the 1848-9 cholera epidemic, mortality from cholera was particularly high in
the districts supplied by two particular water companies, the Southwark & Vauxhall
and the Lambeth companies, both of which took water direct from the River Thames
where it flowed through London.
Click below to view part of the original map of this area, including an area where
both companies supplied water. The green area was served by the Southwark and
Vauxhall company, the pink area by the Lambeth company, and the grey area in
between is where the two companies' pipes were intermingled.
There was no further cholera in London until 1853. During this period, the Lambeth
company moved its source so that it now drew water from the Thames upstream of
London (and thus uncontaminated by London sewage). The Southwark and Vauxhall
company continued to draw water from the Thames in London. This provided John
Snow with an ideal opportunity to test his hypothesis.
He then visited the homes of all recorded cholera deaths in these districts, to get
information about which company supplied water to the household. The table shows
his results for the first 334 deaths.
That’s right. Although there were more deaths in houses supplied by the
Southwark and Vauxhall company, this could just be because this company
supplied water to a greater number of people. In order to know whether
one company is more likely to transmit contaminated water, we need to
know, for each company, the number of deaths as a proportion of the
number of people supplied with water.
Snow noted the source of water in the houses of all those who died of cholera from
8th July to 26th August 1854, as shown in this table:
Ideally the denominator should be all people supplied by water from each company.
Different houses may contain different numbers of people. If, for example, the
Lambeth company supplied an area where there were consistently more people per
house than the area supplied by the Southwark and Vauxhall company, then we
might be misled by using the number of houses as a denominator.
However for this example we will assume there was no difference in the number of
people in each house in the areas supplied by the two water companies.
End interaction.
How much more dangerous was it to drink Southwark and Vauxhall water than
Lambeth water (to the nearest whole number)? Click below if you need a hint.
It was
Interaction: calculation:
Incorrect output:
Correct output:
There were 31.5 cholera deaths per 1000 houses supplied by Southwark and
Vauxhall, and 3.8 cholera deaths per 1000 houses supplied by Lambeth, so it was
31.5 / 3.8 = 8 times more dangerous to drink Southwark and Vauxhall water than
Lambeth water.
End interaction.
times more dangerous to drink Southwark and Vauxhall water than Lambeth
water.
Output:
The best way to estimate this is to calculate, for each company, the number of
cholera deaths per house supplied, and then compare the two figures. This could be
expressed as the number of times more deaths per household in the Southwark and
Vauxhall area than the Lambeth area.
3.7: Historical Evolution of Epidemiology
So John Snow started with descriptive epidemiology, obtaining information on the
numerator (the number of cholera deaths) and the denominator (the number of
people supplied with water) so that he could describe the number of cases of cholera
in different areas in relation to the size of the population at risk.
He then went on to compare the death rates from cholera in the two areas. He used
this information to calculate how much more risky Southwark and Vauxhall water
was compared to Lambeth water. In other words, he looked for an association
between water source and the risk of death from cholera.
• The exposure is the process by which an agent comes into contact with a person or
animal in such a way that the person or animal may develop the relevant outcome,
such as a disease
Take a moment to think of some exposures that may be relevant to whether or not
an individual develops lung cancer, then press the button to compare with our list.
Output:
• Smoking habits
• Exposure to asbestos
• Doctor's advice on smoking
• Cultural background
• Region of residence
• Socio-economic class
• Government legislation on tobacco advertising
• Price of cigarettes
- and these are just a few examples - you may have thought of others which are also
correct.
Example
In a study of the effect of cigarette smoking on lung cancer, cigarette smoking is the
exposure, or risk factor, that we are interested in, and the outcome is lung cancer.
No, in fact the exposure that we are interested in is the advice from the midwife. We
would then examine whether or not the mother breast-fed her baby. So here
breast-feeding is the outcome.
End interaction.
Output:
Yes, that’s right. In this example, the exposure is the midwife’s advice, and the
outcome is breast-feeding.
End interaction.
Output:
That’s right. Here breast-feeding is the exposure, and diarrhoeal disease is the
outcome.
End interaction.
This time we are interested in the effect that breast-feeding has on whether a child
gets diarrhoea. So here breast-feeding is the exposure and diarrhoeal disease is the
outcome.
End interaction.
growth
intelligence
multiple pregnancy
fertility
In John Snow's investigation of cholera in 1852 (see section 3, what was the
exposure of interest and what was the outcome?
Options:
Cholera deaths
Cases of cholera
Area of residence
Elevation
Water source
to
Exposure
Outcome
None of the above
Correct responses:
Unless we take this problem into account, we cannot be sure that an association
observed between an exposure and an outcome is genuine, and not the consequence
of a third factor.
We will spend several sessions in this study module (FE09 - FE12) discussing how
to identify and avoid these and other potential pitfalls in epidemiological studies.
For example, William Farr found an association between elevation above sea level
and the risk of death from cholera. This association was because it so happened that
people living in the lower areas near the river were more likely to be supplied with
contaminated water.
So the apparent association between elevation above sea level and cholera was in
fact caused by differences in water supply, which happened to be related to the
elevation above sea level.
So our finding of an association between drinking alcohol and lung cancer is due to
the cigarettes smoked by our subjects who drink alcohol, rather than due to the
alcohol itself.
Lind divided the patients into six groups of two. All received the same basic diet, in
addition to which he assigned different supplements to each group over a six day
period. Click on each of the supplements listed below to find out what he discovered.
The sailors that received the cider showed no improvement in their condition. Try the
other supplements.
"Elixir vitriol" means sulphuric acid mixed with spices, and was the standard
treatment for scurvy at the time. However, this group of sailors showed no change in
their condition.
However there was no change in the condition of the patients who took this
supplement.
Sea water was another standard treatment for scurvy in 1747, but Lind's group
showed no improvement as a result of taking it.
Lind found that the two sailors receiving the oranges and the lemon improved
rapidly, whereas there was no change in the condition of the others.
What was particularly important was that he had comparison or "control" groups of
patients who did not receive the intervention of interest, which meant that he could
compare the outcome in those who received the intervention to those who did not.
Does Lind's experiment prove that oranges and lemons cure scurvy? Write down
some reasons for your answer before you click on the button below.
Output:
Not quite. The allocation to different treatments was not random, so we cannot be
sure that all the patients were all equally unwell at the start of the experiment. Also
the numbers in each group were very small, and in these circumstances, pure
chance may play a major role. Lind was correct, but we would not design a clinical
trial in quite the same way today!
Interaction: button: No
Output:
That’s right. The allocation to different treatments was not random, so we cannot be
sure that all the patients were equally unwell at the start of the experiment. Also the
numbers in each group were very small, and in these circumstances, pure chance
may play a major role. Lind was correct, but we would not design a clinical trial in
quite the same way today!
Randomised controlled trials were first used in the 1950s – an early example was in
the investigation the effect of streptomycin in the treatment of tuberculosis. This
method has been increasingly used to evaluate new treatments and interventions
since then.
For example, a comparison of the incidence of measles before and after the
introduction of measles vaccine would be a quasi-experimental study.
We will discuss the role of chance in epidemiological studies in the statistics with
computing module, intervention studies in session FE17, and how to calculate
sample sizes in the practical epidemiology module.
For example, we might use descriptive epidemiology to examine how the birth-
weight of babies has changed in a particular country over the last 50 years, or to
describe differences in the prevalence of hypertension between men and women, or
to compare the incidence of tuberculosis in different countries.
For example, click below to compare maps of the global prevalence of hepatitis B
infection and of the incidence of primary liver cancer. Notice the similarity in the
distribution of the high risk areas, suggesting that the two diseases are associated.
Adapted from: World Health Organisation. World Health Report 1996. Geneva:
WHO; 1996. p. 61.
The rate of heart attack varies considerably by geographical region, even within a
single country. The rate of heart attack is higher in Scotland than in England. This
does not necessarily mean that living in Scotland causes heart attacks: it is more
likely that people living in Scotland are exposed to other factors which are more
direct causes of heart disease.
In some cases we can suggest appropriate public health action even if we do not
know the precise biological cause of a disease.
We can reduce the risk of lung cancer by not smoking cigarettes, even if we do not
know exactly what it is in cigarettes that causes cancer.
Tabs: Example 2
We can reduce the risk of getting diarrhoea from contaminated water by boiling it,
even if we do not know exactly which pathogen in the water causes the disease.
With observational epidemiology, we can assess the impact of health services on the
health of a community. Click below for an example.
We could look at the differences in maternal mortality rates in communities with and
without access to antenatal care.
For example, in a study in Tanzania to investigate the effect of improved services for
treatment of STIs on the incidence of HIV infection, communities rather than
individuals were randomised.
So in this case, a number of suitable villages were selected, and then each village
(with all its inhabitants) was randomised to receive either improved STD services or
the usual services.
Output:
Output:
Output:
Data concerning exposures and outcomes in which the unit of analysis is the
population rather than individuals.
Interactive: buttons: Individual-based Data
Output:
Data concerning exposures and outcomes derived from individuals within the
population.
Output:
A study in which individuals are randomly assigned to receive the exposure under
investigation or to be a control who does not receive the exposure. See FE17.
Output:
A study where the allocation of the intervention does not happen randomly
For example, a comparison of the incidence of measles before and after the
introduction of measles vaccine would be a quasi-experimental study.
Output:
Output:
A study in which data on average exposure and outcome for a population are used to
compare with similar data for other populations in order to look for associations
between the exposure and the outcome.
A study in which one or more groups of individuals are followed up over a period of
time to determine the frequency of a particular outcome in the group(s).
For example, in a cohort study, the effect of smoking on fatal diseases was studied
by determining the smoking habits of a large group of doctors, and then following
them up to determine the causes of death. The frequency of specific causes of death
(such as lung cancer and heart disease) could then be compared between smokers
and non-smokers.
A study in which one or more groups of individuals are followed up over a period of
time to determine the frequency of a particular outcome in the group(s).
Output:
A study in which individuals with and without the outcome of interest are identified.
Their status with respect to exposures of interest is then determined in order to look
for associations between these exposures and the outcome of interest.
For example: we could use a case-control study to determine risk factors for
diarrhoeal disease among children. We would identify children with and without
diarrhoea and obtain information concerning exposures of interest (breast-feeding,
water supply etc). We would then analyse the data to see if breast-feeding or water
supply was associated with diarrhoea.
Section 8: Using Epidemiology to Investigate a Clinical Problem
Sometimes the investigation of a clinical problem involves a sequence of
epidemiological studies of different types. We can illustrate this with the example of
the iodine deficiency disorders.
Iodine deficiency affects over half a billion people world-wide. It arises in areas
where people depend upon food supplies grown on iodine-deficient soil. Iodine is an
essential micronutrient required by the thyroid gland to make thyroid hormone,
which controls metabolic activity in adults, and is important in foetal growth and
development.
Iodine-deficient soils are most likely to occur at high altitude (e.g. the Andes in
South America and the highlands in Papua New Guinea) where the soil is leached by
water running off the mountain slopes, and in areas far from the sea (e.g. Western
China).
The diagram on the right shows the global distribution of iodine deficiency disorders
(IDDs). Shaded regions are areas where IDDs are prevalent.
Output:
Hetzel BS. Overview of the Prevention and Control of Iodine Deficiency Diseases. In:
Hetzel BS, Dunn JT, Stanbury JB, editors. The Prevention and Control of Iodine
Deficiency Diseases. Amsterdam: Elsevier; 1987. p. 16.
Descriptive Studies
One area that was severely affected by iodine-deficiency in the past was Switzerland.
At the end of the 18th century Napoleon ordered a survey of cretinism to be carried
out in part of Switzerland because many young men from the area were found to be
unfit for military service.
This survey revealed 4 000 people with cretinism in a population of 70 000. This
could be considered as an early descriptive study, quantifying the problem of
cretinism in a particular population.
Descriptive studies are usually the first step in the investigation of a clinical problem.
Another area with a high frequency of iodine-deficiency disorders is Papua New
Guinea (PNG). Surveys in the 1950s revealed a high frequency of goitre and
cretinism.
Descriptive Studies
Population-Based Analysis
The next step was to compare the frequency of goitre and cretinism in different PNG
communities, and to relate this to the iodine level in the food eaten in each
community.
Descriptive Studies
Population-Based Analysis
Individual-Based Analysis
(Case-Control Studies)
As we will see later in the course, ecological studies have drawbacks and are better
used to generate, rather than test, hypotheses. Further evidence is needed in the
form of individual-based analytic studies to confirm the suspected association.
Case-control studies were therefore carried out to establish the association between
cretinism in children and the iodine / thyroid status of their mothers.
Population-Based Analysis
Individual-Based Analysis
Intervention Studies
Once the association between iodine deficiency and cretinism was established, the
next step was to apply this knowledge, and introduce an intervention. Experimental
studies were carried out, in the form of randomised controlled trials, to investigate
the effect of iodine supplementation on the incidence of cretinism.
In the late 1960s, randomised controlled trials were carried out in western PNG to
investigate the effect of giving iodine injections to pregnant women on the incidence
of cretinism among their children. The intervention was highly successful.
Descriptive Studies
Population-Based Analysis
Individual-Based Analysis
Intervention Studies
Population-Based Analysis
Individual-Based Analysis
Intervention Studies
Once the benefits of supplementation have been established, the next stage is the
evaluation of iodine supplementation programmes.
This could be, for example, by monitoring the prevalence of goitre and cretinism in
areas where supplementation has been implemented.
Epidemiologists are sometimes criticised for having too narrow a focus, in that we
identify and measure risk factors for a particular outcome, but tend to ignore the
social context of those risk factors. Thus perhaps as well as "who, where and when?"
we should be asking "and in what social context?"
Epidemiological findings about risk factors such as smoking, high blood cholesterol or
unsafe sexual practices are not always translated into effective disease prevention.
This may be because we do not pay enough attention to the underlying social and
political processes that account for the distribution of these risk factors within a
population.
These social contexts may vary from one population to another, and this may need
to be taken into account in developing strategies to control disease.
For example, if we examine the effect of the hardness of the water supply on death
rates due to cardiovascular disease in Scotland, we will not detect any effect,
because the water is soft throughout Scotland.
The effect can only be detected if we study a larger region with a wider range of
exposure. Click below to see a graph of the correlation between water hardness and
mortality, from Rose's paper.
The graph shows a negative association between water hardness and cardiovascular
mortality - in other words, the harder the water, the lower the risk of death from
cardiovascular disease.
Output:
For example, we might be asked to develop a public health strategy to reduce heart
disease attributable to high cholesterol levels. There are two approaches that we
could use:
● we could decide to target the people with very high cholesterol levels, since they
are the ones who are most at risk (top diagram)
● we could decide to try to reduce the average cholesterol level throughout the
population (bottom diagram)
However this is not always true. For example, in the prevention of STD transmission
by promoting condom use, in some situations it may be more effective to target
high-risk groups than to aim the intervention at the whole population. Statistical
modelling may help to predict what strategy will be most effective.
John Snow's work investigating the Lambeth and the Southwark & Vauxhall water
companies was interrupted at the end of August 1854 by news of a severe outbreak
of cholera in Soho, nearer the centre of London. There were 616 fatal cases which
had their onset between 19 August and 30 September 1854.
John Snow made this map of the area, with a line showing where each person who
had had a fatal case of cholera had lived, and the position of the public water pumps.
Output:
Yes, Pump A on Broad Street is the one on which the fatal cases appear to be
centred.
Output:
From the map, it seems more likely that Pump A in Broad Street is the one on which
fatal cases are centred.
Output:
From the map, it seems more likely that Pump A in Broad Street is the one on which
fatal cases are centred.
10.2: Exercise
John Snow also observed that:
• none of the workers at the Broad Street
brewery had cholera: they were very
close to pump A, but drank beer rather
than water
• an elderly lady and her niece in West
Hampstead (an area some distance
away, which was free of cholera) liked
the taste of Broad Street water, so they
had a bottle brought to them every day
from the pump. They both died of cholera.
Snow became convinced that the Broad Street pump was the source of the outbreak,
and he persuaded the local authorities to remove the pump handle on the 8th
September 1854.
10.3: Exercise
The time distribution of the date of onset of the fatal cases of cholera is shown in the
graph opposite. John Snow had the pump handle removed on 8th September 1854.
Why do you think the epidemic stopped? Think about this for a moment, then press
the button to continue.
Interaction: button: ‘thought bubble’ button
Output:
The graph shows that the epidemic was nearly over by 8th September when John
Snow removed the pump handle. This was probably because the contaminating
sewage had been diluted by this stage. Alternatively, perhaps people had left the
area because there was so much cholera. John Snow's action was important, but
probably not the most important reason for the end of the outbreak!
10.4: Exercise
Did John Snow prove that contaminated drinking water causes cholera?
Output:
It is better to say that John Snow provided strong evidence in support of this idea:
epidemiologists cannot prove causation, rather we provide evidence for or against a
cause. Causality is a complex subject which we will return to later in this study
module.
Interaction: hotspot: No
Output:
That’s right: epidemiologists cannot prove causation, rather we provide evidence for
or against a cause. Causality is a complex subject which we will return to later in
this study module.
10.5: Exercise
Although Snow worked before the era of bacteriology, his observations and
deductions led him to clear descriptions and valid theories about the nature and the
mode of communication of cholera.
He applied his findings with a control intervention (removal of the pump handle), and
went on to make recommendations for prevention (personal hygiene, boiling of
soiled bedclothes of patients, isolation and quarantine, improved waste disposal,
drainage, provision of clean water etc.).
What is epidemiology?
11.1: Summary
The main points of this session will appear below as you click through the step card
opposite. Click on any of the list entries below to go back to that card.
The key elements are to describe the health-related event in terms of its distribution
in person, place and time.
An exposure is a risk factor for the outcome that we are investigating, which may or
may not be the cause of the outcome.
The outcome is the disease, or event, or health-related state, that we are interested
in.