Professional Documents
Culture Documents
Mohamed Douksieh
Supervisor: Gu Wentao
2020/2021
ton
I. Introduction
1. Research question
2. Objective
Reference
Mohamed Douksieh Thesis proposal 2020-2021 2 / 18
I. Introduction
Coronaviruses are a large family of viruses that can cause severe illness to the
human being. The first known severe epidemic is Severe Acute Respiratory
Syndrome (SARS) occurred in 2003, whereas the second outbreak of severe illness
began in 2012 in Saudi Arabia with the Middle East Respiratory Syndrome (MERS).
. On March 11, as the number of COVID-19 cases has increased thirteen times
apart from China with more than 118,000 cases in 114 countries and over 4,000
deaths, WHO declared this a pandemic.
As the outbreak of the COVID-19 has become a worldwide pandemic, the real-time
analyses of epidemiological data are needed to prepare the society with better
action plans against the disease.
ton
Arlette ANTONI
Mohamed (UBS Vannes)
Douksieh Thesis proposal 2020-20212019-2020
3 / 18 3 / 28
Introduction
As of April 1, 2020, based on the globally shared live data by the Johns Hopkins
dashboard, worldwide there are 932,605 confirmed cases, out of which 193,177
are recovered and 46,809 lost their lives.
In the last few months studies have been carried out to understand the spread of
the disease. For example, the Susceptible-Exposed-Infectious-Removed (SEIR)
model is used to model the outbreak in the city of Wuhan China .
ton
ton
• The aim of this study is to work on the covid19 pandemic data that we
are going through for the purpose of modeling in order to do prediction. It
might be interesting to compare the evolution of covid19 in several
countries and predict the end of the pandemic. So there is two specific
objective
ton
ton
1. Data source
We used the Covid 19 open data set provided by the University Johns Hopkins;
they made an exceptional dashboard using case data affected to date. Apart
from that, they also offer an opportunity to analysts and researchers by providing
the available data in sheet format Google.
As this is chronological data and the number of cases one given day is the
cumulative number. Data are available from January 22, 2020.
ton
This table allows you to follow almost real-time cases of Covid-19 confirmed,
cured and deceased worldwide.
ton
• For a better and clear understanding of the data, the first method will
be the comparative and exploratory analysis to determine the
tendency in different country.
• In this part we can see the evolution and the different number of
death, recovery and confirmed cases of Covid 19. We can also visualize
the recovery rate and mortality rate between countries.
ton
Susceptible (S): The individual hasn't contracted the disease, but she
can be infected due to transmisison from infected people
ton
The rate at which Infected individuals move into the Removed population
is I/r, where r is the recovery delay. The recovery delay represents the
length of time an individual remains infectious. The independent variable
of the model is the time t, and the rates of change of the compartments
are expressed as a set of differential equations:
ton
ton
The prophet model is able to capture daily, weekly and yearly seasonality
along with holiday effects, by
Implementing additive regression model.
• With, g(t) representing the trend. Prophet uses a piecewise linear model
for trend forecasting.
• s(t) represents periodic changes (weekly, monthly, yearly).
• h(t) represents the effects of holidays (recall: Holidays impact
businesses).
• e(t) is the error term.
ton
ton
Weber, A., Ianelli, F., & Goncalves, S. (2020). Trend analysis of the COVID-19
pandemic in China and the rest of the world. arXiv preprint arXiv:2003.09032.
ton