Professional Documents
Culture Documents
The
Covid-19
pandemic
Statistics, statistical concepts and perspectives
W
ith the world in the grip of
a pandemic of coronavirus
disease (Covid-19),
newspapers, TV news
broadcasts, websites and social media are
flooded with numbers. There are daily reports
of cases, treatments and deaths – plus the
subsequent analyses – to digest, but much of
the reporting also discusses concepts such as
SIR models, case–fatality ratios, transmission How do
epidemiologists know how
rates and basic reproduction numbers – ideas
that readers, viewers, listeners might be
unfamiliar with.
It can all be, in a word, overwhelming.
As a publication committed to explaining many people will get Covid-19?
statistical ideas and concepts, we set out to
help readers understand what is going on, to Patrick Ball explains the SIR model
make sense of the data they are confronted
H
with, and to put important information into
the appropriate context. ow many people are infected now Why this is not just arithmetic
Contributors responded to our call “to with the novel coronavirus, SARS- In the United States (as of 9 April 2020), there
explain the statistics of Covid-19”, and what CoV-2? How many will be infected is a severe shortage of tests for SARS-CoV-2,
follows is a selection of the many articles tomorrow? These are hard questions the virus that causes people to become sick
that have been published on our website – because we cannot just see who is infected, with Covid-19. As a consequence, even people
significancemagazine.com – since early April. and we do not know how each infected in hospital with obvious and severe symptoms
As one past contributor remarked recently, person is interacting with others, either of Covid-19 are rarely tested for the virus.
“This may be the most statistically relevant infecting them or not. Here I will explain Furthermore, it is by now becoming clearer
global crisis ever”. Many statisticians are the basic framework used to estimate how that many, perhaps most, people infected with
doing important work to tackle the spread of many people are likely to become sick, and SARS-CoV-2 have mild or no symptoms. The
the disease; others are fighting the spread how many will recover or die. There is some combination of these factors means that only
of false information, which can be just as notation, but I will present each equation in a small fraction of SARS-CoV-2 cases are
hazardous to health. words. At this level, we are reasoning more ever confirmed by a positive test. In practice,
Our hope is that the articles that follow with logic than with mathematics. the confirmed case counts tell us more about
provide some clarity and insight at what is the availability and distribution of tests than
otherwise a confusing and concerning time. about the prevalence of infections.
Stay safe and keep well informed. n Patrick Ball is director of research at Consequently, the true number
Brian Tarran the Human Rights Data Analysis Group. of SARS-CoV-2-positive
I
and ultimately, reducing susceptibility through a
vaccine. The processes over time can be seen in n the United States (as of 9 April 2020), develop mild cases, show no symptoms, and
Figure 1 (page 13). President Donald Trump has said that carry the virus without knowing it because they
The top graph shows the number of deaths testing for novel coronavirus infection are asymptomatic. Thus, efforts to understand
each day, which is the only measure we can will be limited to people who believe they the virus’s penetration into the population must
really observe. The middle graph shows the may be infected. But if we only test people include observation of the asymptomatic.
number of new infections each day. Note that who believe they may be infected, we cannot The estimate of the proportion of the
it leads the deaths by a couple of weeks, and understand how deep the virus has reached population who are infected can be calculated as:
the length of this lead is another variable in the into the population. The only way this could number of symptomatic infections +
model which can only be observed through work is if those who believe they may be number of asymptomatic infections
limited clinical studies. The bottom graph shows infected are representative of the population p=
the SIR values: the susceptible population with respect to novel coronavirus infection. number of symptomatic infections +
number of asymptomatic infections +
(green line) starts with everyone and declines Does anyone believe this is so?
number not infected
over time. The infected group (orange line) rises The common characteristic of those who
for a while then slowly declines. The removed believe they may be infected is that they all So, we need data from a random sample of the
population (blue line) rises over time, eventually show some outward symptoms of infection entire population in order to gather data from
including everyone – the dead and the survivors. by the virus. In other words, people who are infected people who are showing symptoms,
Different modelling projects approach being tested for the novel coronavirus are infected people who are asymptomatic, and
this framework differently. Which parts disproportionately showing severe symptoms. people who are not infected. All have some
are assumed, measured, or modelled vary This would not be a problem if someone who probability of being included in a true random
among different studies. Some models let is infected by the novel coronavirus immediately sample of the population.
the interactive user guess different values of shows symptoms, but this is not the case. As of 23 April, leaders in Germany and
R0 (and thereby β) or p (and thereby γ), while We have strong evidence that some people New York State (see bit.ly/2Kp2iXd and
other models incorporate measures from dailym.ai/3bxZ5Au) had moved to implement
small clinical studies. Some models include James J. Cochran is associate dean for research, random testing to assess how widespread
professor of applied statistics, and the Rogers-Spivey
an intervening term, exposure, between the the virus is, but there has been resistance
faculty fellow at the Culverhouse College of Business,
susceptible population and infection (not University of Alabama. He is vice-chair of the Significance from leaders elsewhere. This could be due to
all susceptible people are exposed, and the Editorial Board. ignorance, disregard, or lack of appreciation of