Professional Documents
Culture Documents
Zahid Butt@uwaterloo Ca
Zahid Butt@uwaterloo Ca
Shu-Feng Tsao, MSc1, Helen Chen, PhD1, Therese Tisseverasinghe, MLIS2, Yang Yang,
MSc1, Lianghua Li, BSc3, Zahid A. Butt, PhD1
1School of Public Health and Health Systems, University of Waterloo, Waterloo, Ontario,
Canada
2Seneca College, King City, Ontario, Canada
3Faculty of Science, University of Waterloo, Waterloo, Ontario, Canada
Correspondence to:
With the onset of COVID-19 pandemic, social media has rapidly become a crucial
communication tool for information generation, dissemination, and consumption. In this
scoping review, we selected and examined peer-reviewed empirical studies relating to
COVID-19 and social media during the first outbreak starting in November 2019 until May
2020. From an analysis of 81 studies, we identified five overarching public health themes
concerning the role of online social platforms and COVID-19. These themes focused on: (i)
surveying public attitudes, (ii) identifying infodemics, (iii) assessing mental health, (iv)
detecting or predicting COVID-19 cases, (v) analyzing government responses to the
pandemic, and (vi) evaluating quality of health information in prevention education videos.
Furthermore, our review highlights the paucity of studies on the application of machine
learning on social media data related to COVID-19 and a lack of studies documenting real-
time surveillance developed with social media data on COVID-19. For COVID-19, social
media can play a crucial role in disseminating health information as well as tackling
infodemics and misinformation.
Introduction
The Corona Virus Disease 2019 (COVID-19), or Severe Acute Respiratory Syndrome Novel
Coronavirus (SARS-COV-2), is a significant international public health issue. As of
November 14, 2020, an estimated 53.5 million people around the world have been infected
with the virus, with about 1.3 million deaths. 1 As a consequence of the pandemic, social
media is becoming the platform of choice for public opinions, perceptions, and attitudes
towards various events or public health policies regarding COVID-19. 2 Social media has
become an even more pivotal communication tool for governments, organizations, and
universities to disseminate crucial information to the public. Numerous studies have already
used social media data to help identify and detect infectious disease outbreaks and to
interpret public attitudes, behaviours, and perceptions.3-6 Social media, particularly Twitter,
may be used to explore multiple facets of public health research. A recent systematic review
identified six categories of Twitter use for health research, namely, content analysis,
surveillance, engagement, recruitment, interventions, and frequency analysis of Twitter
users.4 However, this review only included broader research terms such as health, medicine,
or disease using Twitter data and did not focus on specific disease topics such as COVID-
19. Another article analyzed tweets on COVID-19 and identified 12 topics categorized into
four main themes: the origin of COVID-19, the source of the coronavirus, impact of COVID-
19 on individuals, and countries, and methods of decreasing the spread of COVID-19. 7 In
this study data were not available on COVID-19 related tweets before February 2020 thereby
missing the initial part of the epidemic, and the data on tweets were limited to Twitter only
within a one-month period.
Social media can also be effectively used to communicate health information to the general
public during a pandemic. Emerging infectious diseases (EID), such as COVID-19, almost
always results in increased usage and consumption of media of all forms by the general
public for information.8 Therefore, social media plays a vital role in people’s perception of
disease exposure, resultant decision making and risk behaviours.9,10 As information on
social media is generated by users, such information can be subjective or inaccurate, and
frequently includes rumours, misinformation, and conspiracy theories. 11 Hence, it is
imperative that accurate and timely information is disseminated to the general public about
emerging threats such as the novel coronavirus. A systematic review explored the major
approaches used in published research on social media and EID.12 The review identified
three major approaches: (a) assessment of the public’s interest in and responses to EID, (b)
examination of organizations’ use of social media in communicating EID, and (c) evaluation
of the accuracy of EID related medical information on social media. However, this review
did not focus on studies that used social media data to track and predict EID outbreaks.
Analyzing and disseminating information from peer-reviewed published research can guide
policymakers and public health agencies to design interventions for accurate and timely
knowledge translation to the general public. Therefore, keeping in view the gaps mentioned
above, we conducted a scoping review with the aim of understanding what roles social media
have been playing since the beginning of the COVID-19 crisis. We wanted to know
specifically about public attitudes and perceptions towards COVID-19 on social media,
information about COVID-19 on social media, social media use for prediction and detection
of COVID-19, the impact of COVID-19 on mental health and government responses to
COVID-19 on social media. Our objective was to identify studies on social media related to
COVID-19 that focused on the following: (1) Infodemics (2) Public attitudes, (3) Mental
health, (4) Detection or prediction of COVID-19 cases, (5) Government responses to the
pandemic, and (6) Quality of health information in videos.
Methods
Overview
Studies exploring the use of social media on COVID-19 were reviewed using the scoping
review methodology of Arksey and O’Malley13 and Levac et al.14 The five-step scoping
review protocol was followed, and the Preferred Reporting Items for Systematic Reviews
and Meta-Analyses (PRISMA) extension for scoping reviews was employed in this study.
Data Sources
The search strategies utilized index terms, where applicable, and free-text terms to capture
the following two concepts: (a) social media, including both general terms and specific
platform names (e.g., Twitter, Facebook, Weibo, YouTube); (b) COVID-19. For each
database, both indexed terms (i.e., MeSH, Emtree) and natural language keywords were used
with Boolean operators (i.e., AND, OR, NOT) and truncations. Table 1 shows keywords,
indexed terms, and their variations used for the literature search. Since each database has
distinctive search functionality, individually tailored search statements were developed with
appropriate search filters for each database. Final search statements, along with a list of
search results, were downloaded from each database.
Table 1: Keywords, indexed terms, and their variations used in the literature search
Keywords Variations
betacoronavirus; severe acute respiratory syndrome; covid 2019; corona-19; n-cov; novel
COVID-19
coronavirus; sars-cov; wuhan 2019
twitter; tweet; retweet; facebook; weibo; sina; youtube; webcast; user comment; online
social media
post; online discussion; social network; social media; online community; mobile app
Indexed terms Variations
betacoronavirus; coronavirus infections; severe acute respiratory syndrome; coronavirus
COVID-19
disease 2019; covid-19
social networking; social media; mobile applications; blogging; social networking (online);
social media
online social network; webcast; mobile applications; mobile computing; social network
Articles were included if (a) they discussed the use of social media for COVID-19 research,
and (b) were original, empirical studies. Only peer-reviewed articles, including peer-reviewed
preprints, in English or Chinese, were included. A decision to include Chinese publications
was based on the fact that COVID-19 cases were first reported in Wuhan, China and many
initial and relevant studies were published in Chinese; therefore, we wanted to capture a
majority of studies that were on the use of social media for COVID-19 research. All articles
published between November 1, 2019, and November 4, 2020, were included. Publications
such as reviews, opinion pieces, pre-prints, books, book chapters and any empirical study that
were not peer-reviewed or written in languages other than English or Chinese were
automatically excluded. The final reference list was generated based on originality and
relevance to the broad scope of this review.
Screening Procedure
Primary reviewer (SFT) mainly carried out title and abstract screening for each article to
determine if an article met the inclusion criteria. If the criteria were confirmed, then it was
included; otherwise, it was excluded. Articles were broken into paragraphs to contain one code
in each section. Next, quotes were sorted under each code, applying Ose’s method. 15 Braun
and Clark’s thematic analysis method was employed and involved searching for the text that
matched the identified predictors (codes) in from the quantitative analysis and discovering
emergent codes that were relevant to either the study objective or identified in the relevant
literature review.16 Finally, we categorized the codes into main themes. These codes and
themes were compared and clarified to draw conclusions around the main themes by SFT, ZB,
and RY. SFT is fluent in both English and Chinese; the secondary reviewer (ZB) is fluent in
English, and the tertiary reviewer and domain expert (YY, HC) are both fluent in English and
Chinese. Any discrepancies among reviewers were discussed with the research team to reach
consensus.
Results
Figure 1 illustrates the PRISMA flow diagram of the literature search. With the application of
appropriate search filters, a total of 2405 articles were retrieved from the identified databases:
PubMed (n=1084), Scopus (n=1021) and PsycINFO (n=300). Among these, 670 duplicates
were excluded. Of the remaining 1735 articles, 1434 were deemed non-empirical, such as
comments, editorial essays, letters, opinions, and reviews. This left 301 articles for a full-text
review based on the screening results of titles and abstracts. After the full-text review, 81
articles were included in this scoping review.
Identification
81 articles selected
Figure 1: PRISMA flow chart of article extraction from the literature search
Table 21 summarizes the 81 selected articles on COVID-19 and social media. All articles were
written in English. Twitter (n=45) and Weibo (n=16) undoubtedly were the most studied social
media data. To categorize these chosen articles, we adopted a novel framework called “Social
media and Public Health Epidemic and REsponses (SPHERE) and developed a modified
version of SPHERE framework to organize the themes for our scoping review, as shown in
Figure 2.17 Themes were identified through reviewers’ consensus based on our modified
SPHERE framework. We identified six themes: (i) infodemics, (ii) public attitudes, (iii) mental
health, (iv) detecting or predicting COVID-19 cases, (v) government responses, and (vi) quality
of health information in videos.
1. Infodemics
According to the World Health Organization (WHO), the term “infodemic,” a combination of
“information” and “epidemic,” refers to a fast and widespread of both accurate and inaccurate
information about something, such as a disease like COVID-19.18 Twelve articles tackled
COVID-19 related infodemics circulating on social media platforms. Rovetta and
Bhagavathula analyzed over 2 million queries from Google Trends and Instagram between
1
Table 2 is displayed at the end of this paper
February 20 and May 6, 2020. Their findings revealed that as global interest of COVID-19
information increased, so did its infodemic.19 Gallotti et al analyzed over 100 million tweets
and found that even before the onset of the COVID-19 pandemic, entire Inlay’s expose
measurable waves of infodemic has threatened public health.20 Pulido et al sampled and
analyzed 942 tweets which revealed that while false information had higher number of tweets,
it also had less retweets and lower engagement than tweets comprising scientific evidence or
factual statements.21 Kouzy et al22 set out to investigate the extent to which misinformation or
unverifiable information about the COVID-19 pandemic was spread on Twitter by analyzing
673 English tweets. Their results showed that misinformation accounted for 24.8% of all
tweets. Healthcare or public health accounts had the lowest rate; yet still 12.3% of their tweets
included unverifiable information. Moscadelli et al collected and reviewed 2,102 news articles
circulated on the internet. Their analysis revealed that fake news accounted for 23.1% of
articles studied which were shared over 2 million times between December 31, 2019 and April
30, 2020.23 Similarly, another quantitative study by Galhardi et al comparing the proportion of
fake news shared on WhatsApp, Instagram, and Facebook in Brazil revealed that fake news
was predominately shared on WhatsApp.24 A UK study by Ahmed et al, analyzed 22,785 tweets
posted by 11,333 Twitter users with “#FilmYourHospital” hashtag to identify and evaluate the
source of conspiracy theory on Twitter. Their work uncovered that ordinary people were the
major driver behind the spread of conspiracy theories.25 Another research investigated the 5G
and COVID-19 conspiracy theory circulating on Twitter with a random sample of 233 tweets.
The content analysis showed that 34.8% tweets linked 5G and COVID-19 and 32.2%
condemned such theory 26. A similar research by Bruns et al investigated 89,664 distinct
Facebook posts in Australia related to such conspiracy from 1 January to 12 April 2020 using
time series and network analysis.27 The results showed that this conspiracy went viral after
March 19, 2020 with unusual coalition among various groups on Facebook. Islam et al
analyzed 2,311 COVID-19 related infodemic reports from December 31, 2019 to April 5, 2020
and demonstrated misinformation was mainly driven by rumors, stigma, and conspiracy
theories circulating on various social media and other online platforms.28 Associations between
infodemic and bot activities on social media is another important research direction. One study
analyzed 12 million tweets from the US and 15 million tweets from the Philippines from March
5 to 19, 2020 and both countries showed a positive relation between bot activities and rate of
hate speech in denser and more isolated communities.29 Brennen et al qualitatively analyzed
96 samples of visuals from January through March 2020 and categorized misinformation into
6 trends, and found, fortunately, there have been no involvement of AI “deepfakes” techniques
so far.30
Three themes emerged under this category: public attitudes, mental health, and detection or
prediction of COVID-19 cases. The former two themes are reflections regarding the public
perceptions and mental health impacts of the pandemic; the latter includes typical surveillance
studies aiming to propose ways for the detection or prediction of the COVID-19 cases.
1. Public attitudes
Forty-eight selected articles gauged the attitudes and emotions expressed by social media users
regarding the COVID-19 pandemic mainly by employing content and sentiment analysis.
Twitter and Weibo accounted for 34 and 8 articles, respectively. This theme can be further
divided into the following sub-themes: (1) Public sentiment toward the COVID-19 pandemic
and interventions, (2) stigma and racism, and (3) ageism.
Another study investigated how the Chinese government used the popular social media
platform, Sina Weibo, to promote citizen engagement during the COVID-19 crisis.61 Xuehua
Han et al.62 analyzed 1,413,297 Weibo posts and found that the public reacted sensitively to
the epidemic, especially in metro areas.62 Zhao et al studied 4,056 topics from the Sina
Microblog hot search list and found the public emotions shifted from negative to neural to
positive over time, and five major public concerns63. Jiawei Li et al.64 conducted an
observational infoveillance study with a linear regression model by analyzing 115,299 Weibo
posts. The results showed that the number of Weibo posts positively correlated with the number
of reported cases of coronavirus in Wuhan. In addition, the qualitative analysis classified the
topics into the following four overarching themes: (a) cause of the virus; (b) epidemiological
characteristics of the COVID-19; (c) public responses; and (d) others. 65 Qiang Chen et al.
examined relationships between citizen engagement through government social media
(CEGSM) and media richness, dialogic loop, content type, and emotion valence.61 GEGSM
refers to sum of shares, likes, and comments in this study, so the higher the sum, the greater
the GEGSM. Media richness quantifies how much a sender transfers information to a receiver
via a medium based on the media richness theory. Dialogic loop, or dialogic communication
theory, is defined as an approach that promotes a dialogue between a speaker and audience.
According to the American Psychological Association (APA), emotion valence refers to “the
value associated with a stimulus expressed on a continuum from pleasant to unpleasant or from
attractive to aversive”.65 For instance, happiness is typically considered pleasant valence. The
researchers analyzed 1,411 COVID-19 related posts from "Healthy China," an official account
of the National Health Commission of China (NHCC) on Weibo. Findings showed an inverse
association between media richness and CEGSM, indicating that posts with plain texts had
higher CEGSM than posts with pictures or videos. A positive association between dialogic
loop and CEGSM was found, as evidenced by 96% posts using hashtags and 25% employing
questions. CEGSM also had a positive relationship with emotion valence, as evidenced by
higher CEGSM with positive emotions shown in posts.61 The last chosen article proposed a
new multiple-information susceptible-discussing-immune (M-SDL) model to analyze the
public opinion propagation of the COVID-19 from Weibo posts collected from December 31,
2019, to February 27, 2020. The researchers reported that the reproduction rate of this proposed
M-SDL model reached 1.78 in the early stage of COVID-19 but decreased to and maintained
around 0.97. Such a result demonstrated that the information on COVID-19 would continue to
erupt slowly in the future until it stabilized without violent information outbreak.Wang et al
analyzed 999,978 randomly selected COVID-19 related Weibo posts through unsupervised
BERT model for sentiments and TF-IDF model for topic modeling. They identified four public
concerns: “the virus origin,” “symptom,” “production activity,” and “public health control” in
China.66 Xi et al examined 241 topics with their views and comments via thematic and temporal
analysis and found the “Contributing to the community” theme was the most dominant in the
first phase (January 20–February 20, 2020).67 The “Older patients in hospitals” theme was most
prevailing in the second (February 21–March 17, 2020) and third phase (March 18–April 28,
2020). Employing Wilcoxon tests, Su et al examined posts from 850 Weibo users and 14,269
Tweets from Italy.68 The findings showed Italians paid more attention to leisure whereas
Chinese paid more attention on group, religion, and emotions after lockdowns. Analyzing top
200 accounts from WeChat via regressions and content analysis, Ma et al found positive
impacts of nonmedical and medical reports on people’s behaviours. 69 Employing Kendall’s
Tau-B rank test, Xie et al investigated relations among Baidu Attention Index (BAI), daily
Google Trends, and numbers of COVID-19 cases and deaths. 70 Daily GT was correlated to 7
indicators, whereas daily BAI was only correlated to 3 indicators. 71 Eventually, Zhu et al
analyzed 1,858,288 Weibo posts and found topics changed over time, but political and
economic posts attracted greater attentions.71
1.3 Ageism
A study76 investigating Twitter content related to both COVID-19 and older adults analyzed a
random sample of 351 English tweets. It turned out that 21.1% of the tweets implied diminished
regard for older adults by downplaying or dismissing concerns over the high fatality of
COVID-19 on this population.76 Another similar research examined 188 tweets via thematic
analysis and revealed 90% against ageism, whereas 5% back up and 5% neutral.77
2. Mental Health
Two of the 81 reviewed studies, both based in China, focused on assessing social media users’
mental health.78,79 A cross-sectional study78 investigated the relationship between anxiety and
social media exposure (SME), theoretically defined as “the extent to which audience members
have encountered specific messages”.80 The researchers distributed an online survey based on
the Chinese version of WHO-Five Well-Being Index (WHO-5) for depression and the Chinese
version of generalized anxiety disorder scale (GAD-7) for anxiety. Respondents included 4,872
Chinese citizens aged 18 and over from 31 provinces and autonomous regions in China. After
controlling for all covariates through a multivariable logistic regression, the finding uncovered
that frequent SME increased the odds ratio of anxiety, demonstrating frequent SME potentially
contributing to the mental health problems during the COVID-19 outbreak.78 To explore how
people's mental health was influenced by the COVID-19, Li et al.798 analyzed posts from
17,865 active Weibo users to compare pre- and post-declaration of COVID-19 on 20 January,
2020. The researchers uncovered increased negative sentiments including anxiety, depression,
and indignation post-declaration, and decreased positive sentiments expressed in Oxford
happiness score. Additionally, cognitive indicators increased in social risk judgment but
decreased in life satisfaction post-declaration.78
Six of the 81 studies investigated the prediction of COVID-19 outbreaks using social media
data. Qin et al.79 attempted to predict the number of new suspected or confirmed COVID-19
cases by collecting social media search indexes (SMSI) for symptoms such as dry cough, fever,
chest distress, coronavirus, and pneumonia. The data were analyzed using subset selection,
forward selection, lasso regression, ridge regression, and elastic net. Results showed the
optimal model was constructed via the subset selection. The lagged SMSI was a predictor of
new suspected COVID-19 cases and could be detected 6-9 days before the new case
confirmation77. To evaluate the possibility of early prediction of the COVID-19 cases via
internet searches and social media data, Li et al.82 used keywords “coronavirus” and
“pneumonia” to retrieve corresponding trend data from Google Trends, Baidu Index, and Sina
Weibo Index. Using the lag correlation, the results showed that for new lab-confirmed cases,
the highest correlation was found 9-12 days earlier for the keyword "coronavirus" in the three
platforms. Similarly, for the new suspected cases, the highest correlation was found 6-8 days
earlier for “coronavirus.” The keyword “pneumonia” for new suspected cases had the highest
correlation eight days earlier across the three platforms. 82Peng et al studied 1,200 Weibo
records using spatiotemporal analysis, kernel density analysis, and ordinary least square
regression and found that “scattered infection”, “community spread”, and “full-scale outbreak”
were three phases of early COVID-19 transmission in Wuhan, China. 83 Elderly accounted for
over 50% of help seeking on Weibo, thus of high risk. To identify COVID-19 patients with
poor prognosis, Liu et al, analyzed Weibo messages from 599 patients along with telephone
follow-ups.84 The findings suggested risk factors involving older age, diffuse distribution, and
hypoxemia. A regression study analyzed GT searches, Wikipedia page views, and tweets and showed
that using current Wikipedia page views, one‐week‐ahead tweets, and two‐week ahead GT searches
were included to model the number of COVID-19 cases.- To model the number of deaths, all three
variables should be one-week further ahead85.
1. Government Responses
To make the public inoculated against misinformation, public health organizations should
create and spread accurate information on social media because social media have increasingly
played an important role in policy announcement and health education. Six of the 81 articles
were categorized as “government responses” because they examined how government
messages and health education material were generated and consumed on social media
platforms. Two studies analyzed data from Sina Weibo,86,88 and the other four studies analyzed
data from Twitter.88,89
Zhu et al.86 conducted a study to measure the attention of Chinese netizens—“citizen of the
net” according to Michael F. Hauben90—to COVID-19 by analyzing 1,101 Weibo posts. They
found that Chinese netizens paid little attention to the disease until the Chinese government
acknowledged and declared the COVID-19 outbreak on January 20, 2020. Since then, high
levels of social media traffic occurred when Wuhan began its quarantine (January 23-24, 2020),
during a Red Cross Society of China scandal (February 1, 2020), and following the death of
Dr. Li Wenliang (February 6-7, 2020).86 Lifang Li87 collected 36,746 Weibo posts to identify
and categorize the situational information using support vector machines (SVM), naive Bayes
(NB), and RF, as well as features in predicting the number of reports using linear regression.
Except for posts categorized as “counter-rumours,” they found the more word counts, the more
reposts for all the other types. Likewise, posts from unverified users had more reposts in all
types, excluding the counter-rumours. For the counter-rumours, reposts would increase if its
users have a greater number of followers and from urban areas.87 A qualitative content analysis
was conducted to investigate how G7 leaders used Twitter concerning the COVID-19
pandemic by collecting 203 tweets.88 The findings showed that 82.8% were “informative,”
28.6% of them linked to official government resources; 9.4% were "morale-boosting;” and
6.9% were “political”.88 To assess the political partisan polarization in Canada regarding
COVID-19, Merkley et al.89 random sampled 1,260 tweets from 292 federal members of
Parliament’s (MP) social media and collected 87 Google search trends for the search term
"coronavirus". A survey was also conducted with 2,499 Canadian respondents aged 18 years
and above. The results demonstrated that regardless of party affiliation, MP members
emphasized the importance of social distancing measures and proper hand hygiene practices to
cope with the COVID-19 pandemic without tweets exaggerating concerns or misinformation
about COVID-19. Search interest in coronavirus among municipalities was strongly
determined by socioeconomic and urban factors rather than Conservative party vote share.89
Sutton et al studied 149,335 tweets from public health, emergency management and elected
officials and found that messages changed positively and negatively over time.91 Besides,
Wang et al investigated 13,598 COVID-19 related tweets via temporal and network analyses. 92
They categorized 16 types of messages and identified inconsistent and incongruent messages
expressed in 4 critical prevention topics: mask wearing, risk assessments, stay at home order,
and disinfectants/sanitizers.
Eight chosen studies investigated the quality of Youtube videos with COVID-19 prevention
information. Basch et al.93 conducted a cross-sectional study and retrieved the top 100 YouTube
videos with most views uploaded in January 2020 with keywords "Coronavirus" in English,
with English subtitles, or in Spanish. These 100 videos together generated over 125 million
views. However, less than 33 videos included any of seven key prevention behaviours
recommended by the US CDC.93 The same group of researchers conducted a follow-up study
with the same criteria and a successive sampling design to gather top 100 mostly viewed
YouTube videos in January and in March 2020, respectively. Findings showed that the January
sample generated over 125 million views, and the March sample had over 355 million views.
Yet less than 50- videos in either sample contained any of the prevention behaviours
recommended by the US CDC.94 Additionally, a research investigated the first 100 “DIY hand
sanitizer” videos on YouTube with the most views and showed the average number of daily
pediatric poisoning calls increased significantly in March 2020 compared with the previous 2
years.95
To analyze the information quality on YouTube about the COVID-19 pandemic and to compare
the contents in English and Chinese Mandarin videos, Priyanka Khatri et al.39 collected a total
of 150 videos with the keywords “2019 novel coronavirus,” “Wuhan virus” in English and in
Mandarin. The DISCERN score and the medical information and content index (MICI) were
calculated as a reliable way to measure the quality of health information. It turned out the
average DISCERN score for reliability was low: 3.12 out of 5 for English videos and 3.25 for
Mandarin videos. The average cumulative MICI score of useful videos was also undesirable:
6.71 out of 25 for English videos and 6.28 for Mandarin videos. 96 In Spain, a similar research
studied 129 videos in Spanish and found such videos were usually incomplete and differed
according to different uploaders. Likewise, one study in South Korea found misleading videos
accounted for 37% of all collected 105 most viewed, and such videos had more likes, fewer
comments, and longer viewing times than useful videos. Two studies in Turkey investigated
quality of YouTube videos regarding COVID-19 information in dentistry. 99,99 One analyzed
top 116 English videos with at least 300 views and showed moderate quality and useful
information from these videos.99 The other one, however, uncovered poor quality for 24 out of
55 (43.6%) English videos, whereas good quality accounted for only 2 (3.6%) videos. 100
Discussion
Studies on social media data revealed our attitudes and mental state to a certain extent during
the COVID-19 crisis. These studies also showed how we generated, consumed and propagated
information on social media platforms when facing the rapid spread of the novel coronavirus
and extraordinary measures for the containment. In our review, public attitudes accounted for
nearly 60% of the reviewed articles. In terms of social media platforms, 46% of the chosen
articles used data from Twitter, followed by Weibo (18%). Machine learning analyses, such as
LDA and RF, were applied in research that studied public attitudes.
We identified six themes based on our modified SPHERE framework, including (i) infodemics,
(ii) public attitudes (iii) mental health, (iv) detection or prediction of COVID-19 cases, (v)
government responses to the pandemic, and (vi) quality of prevention education videos.
However, a common limitation in all chosen studies using social media data is the “data
quality” challenge, such as various formats, metrics, or even the definition of a variable. For
instance, the definition of a “view” on one social media platform is likely different from
another. Besides, not every social media offers accessible data like Twitter and Weibo. To
address these challenges, the selected studies have controlled for many factors, including social
media platforms, languages, locations, time, misspellings, keywords, or hashtags. However,
such search strategies resulted in many study limitations, like non-representative sample sizes
or selection bias, cross-sectional or retrospective study design. We also observed that, given
the large amount of available data, most studies across all domains still sampled smaller data
size for analyses, except four studies under the “public attitudes” theme analyzing over one
million posts via machine learning methods. Additionally, data from Twitter and Weibo
accounted for over 60% of our selected studies. Research examining other social media
platforms, including Facebook, Instagram, TikTok, Snapchat, and WhatsApp, are limited due
to barriers of data availability and accessibility. We also identified future research topics
needed for each category as we are currently in the midst of a second wave of the COVID-19
pandemic . From “infodemics” perspective, additional research is needed to investigate how
effective misinformation/rumours/fake news, such as anti-mask wearing, undermine
preventions to compromise public health, although social media companies like Twitter and
Facebook recently started to remove misinformation-based accounts or label unaccountable
posts. Robot (or bot) posts is another focus to be addressed. Studies evaluating effective
counter-infodemic interventions are also needed.
Articles regarding “public attitudes” toward the COVID-19 pandemic have revealed sentiments
that shifted over time. Yet, this theme can be a useful indicator when evaluating interventions,
social distancing and wearing masks for instance, that aim to reduce the COVID-19 infection
risks. However, public sentiments have not been incorporated into many intervention studies
by the time we conducted this review. When a disease like COVID-19 starts spreading and
causing negative sentiments, timely, proper, and effective risk communication is needed to help
ease peoples’ anxiety or negative attitudes regarding the COVID-19 pandemic especially
through social media.
Mental health is another issue that requires further investigations. Our chosen studies did not
address mental health issues based on age, as youth and adults could struggle with different
problems and varied interventions. Public health measures such as social distancing
implemented in the COVID-19 pandemic exacerbated risk factors and adverse health
behaviours at the individual and population levels. Studies have shown that social media data
were useful to detect mental health issues at the population level. Due to the early outbreak of
COVID-19 and the prevalence of social media (i.e. Weibo, WeChat) in China, two studies that
shed light on issues of mental health were both among the Chinese population. 78-79 Similar
deteriorating trend of mental health could happen in other regions. At the time of this writing,
British Columbia, Canada recorded the highest number of overdose deaths in May 2020.101
In terms of the surveillance of the COVID-19 pandemic, six chosen studies demonstrated
methods to detect or predict the number of COVID-19 cases using social media data. Unlike
other infectious diseases such as influenza and malaria, COVID-19 has not had real-time
monitoring surveillance developed with social media data according to our review. It is
possible that the pandemic has evolved so rapidly that finding COVID-19 vaccinations or
therapies has been prioritized over others. Besides, lacking accurate and reliable data sources
may discourage the development of the COVID-19 real-time surveillance. Moreover, it
remains unknown whether COVID-19 is just a one-time black swan event or would become
something like the seasonal flu. If the latter, then it may be meaningful and useful to establish
a real-time model to monitor the disease using social media data.
Government responses distributed via social media have increasingly played critical in
combating infodemics and promoting accurate and reliable information for the public.
However, little has been studied about how efficient and effective these official reactions lead
to belief or behavioural changes in real life. It also remained unknown that, compared with
infodemics, government posts would reach greater numbers of social media users or have
greater impacts on them.
YouTube has served as one of major platforms to spread information concerning the disease
control of COVID-19. Nonetheless, our chosen studies revealed undesirable quality for these
videos because they contained few recommended preventions from the governments or public
health organizations. It is a worrisome observation if accurate and reliable videos and other
types of information are not created and disseminated in a timely manner. Therefore, videos,
especially from public health authorities, should include accurate and reliable medical and
scientific information and use relevant hashtags to reach more audience, generate more views
and increase responses. Moreover, our selected studies were limited to YouTube videos only.
Additionally, a significant proportion of the studies were done using Sina Weibo, which,
although used by a large number of people, is exclusive of China, which may lead to an over-
representation of a single country in the scoping review.
In summary, although our scoping review has limitations embedded from the chosen studies,
it recognized six themes that have been studied so far, and future research directions are also
identified. Our adopted framework can serve as a fundamental and flexible guideline when
conducting studies related to social media and epidemiology.
Conclusions
Our review identified various topics, themes and methodological approaches in studies on
social media and COVID-19. Among the six identified themes, public attitudes comprised the
majority of the articles. Among the selected studies, Twitter was the leading social media
platform followed by Weibo. A minority of the studies included machine learning methods,
whereas a vast majority employed traditional statistical methods. Unlike influenza, we were
not able to find studies documenting real-time surveillance developed with social media data
on COVID-19. Our review also identified studies on COVID-19 related infodemics, mental
health and prediction. For COVID-19, accurate and reliable information through social media
platforms can play a crucial role in tackling infodemics, misinformation, and rumours.
Additionally, real time surveillance from social media about COVID-19 can be an important
tool in the armamentarium of interventions by public health agencies and organizations.
Contributors
HC and ZB conceived of the review. SFT conducted the screening, searched for articles,
performed data analysis, and wrote the first draft of the manuscript. TT refined the search
strategy and searched for articles. YY refined the search strategy and created the tables. LL
assisted in screening. ZB and HC supervised the review process and prepared the final draft for
submission. All authors contributed to the interpretation of results, manuscript preparation, and
revisions. All authors read and approved the final manuscript.
Declaration of interests
Acknowledgements
References
1) Medicine JHUa. Coronavirus Resource Center: World Map. © 2020 Johns Hopkins
University. 2020; Available at: https://coronavirus.jhu.edu/map.html.
2) Pérez-Escoda A, Jiménez-Narros C, Perlado-Lamo-de-Espinosa M, Pedrero-Esteban
LM. Social Networks' Engagement During the COVID-19 Pandemic in Spain: Health
Media vs. Healthcare Professionals. Int J Environ Res Public Health. 2020 Jul
21;17(14):5261.
3) Jordan SE, Hovet SE, Fung IC, Liang H, Fu K, Tse ZTH. Using Twitter for public
health surveillance from monitoring and prediction to public response. Data
2019;4(1):6.
4) Shah Z, Surian D, Dyda A, Coiera E, Mandl KD, Dunn AG. Automatically appraising
the credibility of vaccine-related web pages shared on social media: a Twitter
surveillance study. Journal of medical Internet research 2019;21(11):e14007.
5) Sinnenberg L, Buttenheim AM, Padrez K, Mancheno C, Ungar L, Merchant RM.
Twitter as a tool for health research: a systematic review. Am J Public Health
2017;107(1):e1-e8.
6) Steffens MS, Dunn AG, Wiley KE, Leask J. How organisations promoting
vaccination respond to misinformation on social media: a qualitative investigation.
BMC Public Health 2019;19(1):1-12.
7) Abd-Alrazaq A, Alhuwail D, Househ M, Hamdi M, Shah Z. Top concerns of tweeters
during the COVID-19 pandemic: infoveillance study. Journal of medical Internet
research 2020;22(4):e19016.
8) Freberg K, Palenchar MJ, Veil SR. Managing and sharing H1N1 crisis information
using social media bookmarking services. Public relations review 2013;39(3):178-
184.
9) Giustini D, Ali SM, Fraser M, Kamel Boulos MN. Effective uses of social media in
public health and medicine: a systematic review of systematic reviews. Online J
Public Health Inform. 2018 Sep 21;10(2):e215.
10) Al-Dmour H, Masa'deh R, Salman A, Abuhashesh M, Al-Dmour R. Influence of
Social Media Platforms on Public Health Protection Against the COVID-19 Pandemic
via the Mediating Effects of Public Health Awareness and Behavioral Changes:
Integrated Model. J Med Internet Res. 2020 Aug 19;22(8):e19996.
11) Bridgman, A.; Merkley, E.; Loewen, PJ; Owen, T; Ruths, D.; Teichmann, L.; Zhilin,
O. (2020). The causes and consequences of COVID-19 misperceptions: understanding
the role of news and social media. The Harvard Kennedy School (HKS)
Misinformation Review
12) Tang L, Bie B, Park S, Zhi D. Social media and outbreaks of emerging infectious
diseases: A systematic review of literature. Am J Infect Control 2018;46(9):962-972.
13) Arksey H, O'Malley L. Scoping studies: towards a methodological framework.
International journal of social research methodology 2005;8(1):19-32.
14) Levac D, Colquhoun H, O'Brien KK. Scoping studies: advancing the methodology.
Implementation science 2010;5(1):69
15) Ose, S. O. (2016). Using Excel and Word to structure qualitative data. Journal of
Applied Social Science, 10(2), 147-162.
16) Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative
research in psychology, 3(2), 77-101.
17) Schillinger D, Chittamuru D, Ramírez AS. From “infodemics” to health promotion: A
novel framework for the role of social media in public health. American journal of
public health. 2020 Sep;110(9):1393-6.
18) World Health Organization. Novel Coronavirus(2019-nCoV) Situation Report - 13.
2020. Available at https://www.who.int/docs/default-source/coronaviruse/situation-
reports/20200202-sitrep-13-ncov-v3.pdf
19) Rovetta A, Bhagavathula AS. Global Infodemiology of COVID-19: Analysis of
Google Web Searches and Instagram Hashtags. J Med Internet Res
2020;22(8):e20673. doi:10.2196/20673.
20) Gallotti R, Valle F, Castaldo N, Sacco P, De Domenico M. Assessing the risks of
'infodemics' in response to COVID-19 epidemics. Nat Hum Behav (2020).
https://doi.org/10.1038/s41562-020-00994-6.
21) Pulido CM, Villarejo-Carballido B, Redondo-Sama G, Gómez A. COVID-19
infodemic: More retweets for science-based information on coronavirus than for false
information. International Sociology 2020:0268580920914755.
22) Kouzy R, Abi Jaoude J, Kraitem A, El Alam MB, Karam B, Adib E, et al.
Coronavirus Goes Viral: Quantifying the COVID-19 Misinformation Epidemic on
Twitter. Cureus 2020 March 13;12(3):e7255.
23) Moscadelli A, Albora G, Biamonte MA, Giorgetti D, Innocenzio M, Paoli S, Lorini C,
Bonanni P, Bonaccorsi B. Fake News and Covid-19 in Italy: Results of a Quantitative
Observational Study. Int. J. Environ. Res. Public Health 2020, 17(16), 5850;
https://doi.org/10.3390/ijerph17165850
24) Galhardi CP, Freire NP, Maria Minayo CS, Fagundes, MCM. Fact or Fake? An
analysis of disinformation regarding the Covid-19 pandemic in Brazil. Ciênc. saúde
coletiva [Internet]. 2020 Oct [cited 2020 Nov 18] ; 25( Suppl 2 ): 4201-4210.
Available from: http://www.scielo.br/scielo.php?script=sci_arttext&pid=S1413-
81232020006804201&lng=en. Epub Sep 30, 2020. https://doi.org/10.1590/1413-
812320202510.2.28922020.
25) Ahmed W, López Seguí F, Vidal-Alaball J, Katz MS. COVID-19 and the "Film Your
Hospital" Conspiracy Theory: Social Network Analysis of Twitter Data. J Med
Internet Res 2020;22(10):e22374. doi: 10.2196/22374.
26) Ahmed W, Vidal-Alaball J, Downing J, Seguí FL. COVID-19 and the 5G conspiracy
theory: social network analysis of Twitter data. Journal of Medical Internet Research.
2020;22(5):e19458.
27) Bruns, A, Harrington, S, Hurcombe, E. ‘Corona? 5G? or both?’: the dynamics of
COVID-19/5G conspiracy theories on Facebook. Media International Australia,
177(1), pp. 12–29. doi: 10.1177/1329878X20946113.
28) Islam MS, Sarkar T, Khan SH, Mostofa Kamal AH, Hasan SMM, Kabir A, Yeasmin
D, Islam MA, Amin Chowdhury KI, Anwar KS, Chughtai AA, Seale H. COVID-19-
Related Infodemic and Its Impact on Public Health: A Global Social Media Analysis.
The American Society of Tropical Medicine and Hygiene, 2020, 103(4): 1621 – 1629.
doi: https://doi.org/10.4269/ajtmh.20-0812.
29) Uyheng J, Carley KM. Bots and online hate during the COVID-19 pandemic: case
studies in the United States and the Philippines. J Comput Soc Sc (2020). doi:
https://doi.org/10.1007/s42001-020-00087-4
30) Brennen, J.S., Simon, F.M., Nielsen, R.K. Beyond (Mis)Representation: Visuals in
COVID-19 Misinformation. The International Journal of Press/politics. 2020 Oct.
31) Kwon, J, Grady, C, Feliciano, JT, Fodeh, SJ. Defining facets of social distancing
during the COVID-19 pandemic: Twitter analysis. Journal of Biomedical Informatics,
2020, 111. doi: https://doi.org/10.1016/j.jbi.2020.103601
32) Lai D, Wang D, Calvano J, Raja AS, He S. Addressing immediate public coronavirus
(COVID-19) concerns through social media: Utilizing Reddit's AMA as a framework
for Public Engagement with Science. PLOS ONE 15(10): e0240326.
https://doi.org/10.1371/journal.pone.0240326
33) Li Y, Twersky S, Ignace K, Zhao M, Purandare R, Bennett-Jones B, Weaver SR.
Constructing and Communicating COVID-19 Stigma on Twitter: A Content Analysis
of Tweets during the Early Stage of the COVID-19 Outbreak. Int. J. Environ. Res.
Public Health 2020, 17, 6847.
34) Medford RJ, Saleh SN, Sumarsono A, Perl TM, Lehmann CU. An "Infodemic":
Leveraging High-Volume Twitter Data to Understand Early Public Sentiment for the
Coronavirus Disease 2019 Outbreak. Open Forum Infectious Diseases, Volume 7,
Issue 7, July 2020, ofaa258, https://doi.org/10.1093/ofid/ofaa258
35) Al-Rawi A, Siddiqi M, Morgan R, Vandan N, Smith J, Wenham C. COVID-19 and
the gendered use of emojis on Twitter. J Med Internet Res 2020;22(11):e21646. doi:
10.2196/21646
36) Samuel, J, Mokhlesur Rahman GG, Ali N, Smaual Y, Pelaez A, Chong PHJ, Yakubov
M. Feeling Positive about Reopening? New Normal Scenarios from COVID-19 US
Reopen Sentiment Analytics. IEEE Access, vol. 8, pp. 142173-142190, 2020, doi:
10.1109/ACCESS.2020.3013933.
37) Odlum M, Cho H, Broadwell P, Davis N, Patrao M, Schauer D, Bales ME, Alcantara
C, Yoon S. Application of Topic Modeling to Tweets as the Foundation for Health
Disparity Research for COVID-19. Studies in Health Technology and Informatics,
2020;272: 24 – 27. doi: 10.3233/SHTI200484
38) Thelwall, M., Thelwall, S. COVID-19 tweeting in English: Gender differences.
Relaciones Públicas, 2020, 29(3). doi: https://doi.org/10.3145/epi.2020.may.01.
39) Xue J, Chen J, Chen C, Hu R, Zhu T. The Hidden Pandemic of Family Violence
During COVID-19: Unsupervised Learning of Tweets. J Med Internet Res
2020;22(11):e24361. doi: 10.2196/24361
40) Yigitcanlar T, Kankanamge N, Preston A, Gill PS, Rezayee M, Ostadnia M, Xia B,
Ioppolo G. How can social media analytics assist authorities in pandemic-related
policy decisions? Insights from Australian states and territories. Health Inf Sci Syst 8,
37 (2020). https://doi.org/10.1007/s13755-020-00121-9
41) Mohamad SM. Creative Production of 'COVID-19 Social Distancing' Narratives on
Social Media. Tijds. voor econ. en Soc. Geog., 111: 347-359. doi:
https://doi.org/10.1111/tesg.12430
42) Arpaci, I, Alshehabi, S, Al-Emran, M, Khasawneh, M, Mahariq, I, Abdeljawad, T,
Hassanien, AE. Analysis of twitter data using evolutionary clustering during the
COVID-19 pandemic. Computers, Materials & Continua, vol. 65, no.1, pp. 193–204,
2020.
43) Pastor CK. Sentiment Analysis of Filipinos and Effects of Extreme Community
Quarantine Due to Coronavirus (Covid-19) Pandemic. Available at SSRN 3574385
2020.
44) Boon-Itt S, Skunkan. Y. A text-mining analysis of public perceptions and topic
modeling during the COVID-19 pandemic using Twitter data.
JMIR Public Health Surveill 2020;6(4):e21978. doi: 10.2196/21978
45) Kurten S, Beullens K. #Coronavirus: Monitoring the Belgian Twitter Discourse on the
Severe Acute Respiratory Syndrome Coronavirus 2 Pandemic. Cyberpsychology,
Behavior, and Social Networking.ahead of print. doi;
http://doi.org/10.1089/cyber.2020.0341
46) De Santis, E, Martino, A, Rizzi, A. An Infoveillance System for Detecting and
Tracking Relevant Topics from Italian Tweets during the COVID-19 Event. IEEE
Access, vol. 8, pp. 132527-132538, 2020, doi: 10.1109/ACCESS.2020.3010033.
47) Essam BA, Abdo MS. How Do Arab Tweeters Perceive the COVID-19 Pandemic? J
Psycholinguist Res (2020). https://doi.org/10.1007/s10936-020-09715-6.
48) Lwin MO, Lu J, Sheldenkar A, Schulz PJ, Shin W, Gupta R, Yang Y. Global
Sentiments Surrounding the COVID-19 Pandemic on Twitter: Analysis of Twitter
Trends. JMIR Public Health Surveill 2020;6(2):e19447. doi: 10.2196/19447.
49) Chang A, Schulz PJ, Tu S, Liu MT. Blaming Devices in Online Communication of
the COVID-19 pandemic: Stigmatizing cues and negative sentiment gauged with
automated analytic techniques. Journal of Medical Internet Research.
02/10/2020:21504 (forthcoming/in press). doi: 10.2196/21504.
50) Das, S., Dutta, A. Characterizing public emotions and sentiments in COVID-19
environment: A case study of India. Journal of Human Behavior in the Social
Environment. doi: 10.1080/10911359.2020.1781015.
51) Chehal D, Gupta P, Gulati P. COVID-19 pandemic lockdown: An emotional health
perspective of Indians on Twitter. International Journal of Social Psychiatry. doi:
10.1177/0020764020940741.
52) Dheeraj, K. Analysing COVID-19 news impact on social media aggregation.
International Journal of Advanced Trends in Computer Science and Engineering.
2020, 9(3). doi: 10.30534/ijatcse/2020/56932020.
53) Park, HW, Park, S, Chong, M. Conversations and medical news frames on twitter:
Infodemiological study on COVID-19 in South Korea. J Med Internet Res
2020;22(5):e18897. doi: 10.2196/18897
54) Wicke P, Bolognesi MM. Framing COVID-19: How we conceptualize and discuss the
pandemic on Twitter. PLOS ONE 15(9): e0240010. doi:
https://doi.org/10.1371/journal.pone.0240010
55) Damiano, Amanda D, Catellier A, Jennifer R. A Content Analysis of Coronavirus
Tweets in the United States Just Prior to the Pandemic Declaration. Cyberpsychology,
Behavior, and Social Networking.ahead of print. doi:
http://doi.org/10.1089/cyber.2020.0425
56) Green, J, Edgerton. J, Naftel, D, Shoub, K, Cranmer, SJ. Elusive consensus:
Polarization in elite communication on the COVID-19 pandemic. Science Advances.
2020;6(28): eabc2717. doi: 10.1126/sciadv.abc2717.
57) Gozzi N, Tizzani M, Starnini M, Ciulla F, Paolotti D, Panisson A, Perra N. Collective
Response to Media Coverage of the COVID-19 Pandemic on Reddit and Wikipedia:
Mixed-Methods Analysis. J Med Internet Res 2020;22(10):e21597. doi:
10.2196/21597
58) Yu J, Lu Y, Muñoz-Justicia J. Analyzing Spanish News Frames on Twitter during
COVID-19-A Network Study of El País and El Mundo. Int. J. Environ. Res. Public
Health 2020, 17, 5414. doi: 10.3390/ijerph17155414.
59) Jelodar, H, Wang, Y, Orji, R, Huang, S. Deep Sentiment Classification and Topic
Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using
LSTM Recurrent Neural Network Approach. arXiv:2004.11695 [cs.IR]
60) Samuel J, Ali N, Mokhlesur Rahman GG, Esawi, E, Samuel, Y. COVID-19 public
sentiment insights and machine learning for tweets classification. Information 2020,
11, 314. doi: https://doi.org/10.3390/info11060314.
61) Chen Q, Min C, Zhang W, Wang G, Ma X, Evans R. Unpacking the black box: How
to promote citizen engagement through government social media during the COVID-
19 crisis. Comput Hum Behav 2020:106380.
62) Han X, Wang J, Zhang M, Wang X. Using Social Media to Mine and Analyze Public
Opinion Related to COVID-19 in China. International Journal of Environmental
Research and Public Health 2020;17(8):2788.
63) Zhao, Y, Cheng, S, Yu, X, Xu, H. Chinese public's attention to the COVID-19 epidemic
on social media: Observational descriptive study. J Med Internet Res
2020;22(5):e18825. doi: 10.2196/18825
64) Li J, Xu Q, Cuomo R, Purushothaman V, Mackey T. Data mining and content analysis
of the Chinese social media platform Weibo during the early COVID-19 outbreak:
retrospective observational infoveillance study. JMIR Public Health and Surveillance
2020;6(2):e18700.
65) Your Emotional Brain on Resentment, Part 2. Psych Central Professional. © 2020;
Available at: https://pro.psychcentral.com/your-emotional-brain-on-resentment-part-
2/.
66) Wang, T, Lu, K, Chow, KP, Zhu, Q. COVID-19 Sensing: Negative Sentiment Analysis
on Social Media in China via BERT Model. IEEE Access, vol. 8, pp. 138162-138169,
2020, doi: 10.1109/ACCESS.2020.3012595.
67) Xi, W., Xu, W., Zhang, X., & Ayalon, L. (2020). A thematic analysis of weibo topics
(Chinese twitter hashtags) regarding older adults during the COVID-19 outbreak. The
Journals of Gerontology: Series B.
68) Su, Y, Xue, J, Liu, X, Wu, P, Chen, J, Chen, C, Liu, T, Gong, W, Zhu, T. Examining
the Impact of COVID-19 Lockdown in Wuhan and Lombardy: A Psycholinguistic
Analysis on Weibo and Twitter. Int. J. Environ. Res. Public Health 2020, 17, 4552. doi:
https://doi.org/10.3390/ijerph17124552
69) Ma, R, Deng, Z, Wu, M. Effects of Health Information Dissemination on User Follows
and Likes during COVID-19 Outbreak in China: Data and Content Analysis. Int. J.
Environ. Res. Public Health 2020, 17, 5081. doi:
https://doi.org/10.3390/ijerph17145081
70) Xie T, Tan T, Li J. An Extensive Search Trends-Based Analysis of Public Attention on
Social Media in the Early Outbreak of COVID-19 in China. Risk Manag Healthc
Policy. 2020;13:1353-1364. doi: https://doi.org/10.2147/RMHP.S257473
71) Zhu B, Zheng X, Liu H, Li J, Wang P. Analysis of spatiotemporal characteristics of big
data on social media sentiment with COVID-19 epidemic topics. Chaos, Solitons &
Fractals. 2020;140. doi: https://doi.org/10.1016/j.chaos.2020.110123.
72) Kim B. Effects of social grooming on incivility in COVID-19. Cyberpsychology,
Behavior, and Social Networking 2020.
73) Budhwani, H, Sun, R. Exploring U.S. Shifts in Anti-Asian Sentiment with the
Emergence of COVID-19. Int. J. Environ. Res. Public Health 2020, 17, 7032.
74) Darling-Hammond S, Michaels EK, Allen AM, Chae DH, Thomas MD, Nguyen TT,
Mujahid MM, Johnson RC. After "The China Virus" Went Viral: Racially Charged
Coronavirus Coverage and Trends in Bias Against Asian Americans. Health Education
& Behavior. 2020;47(6):870-879. doi:10.1177/1090198120957949.
75) Nguyen TT, Criss S, Dwivedi P, Huang D, Keralis J, Hsu E, Phan L, Nguyen LH, Yardi
I, Glymour MM, Allen AM, Chae DH, Gee GC, Nguyen, QC. Exploring U.S. Shifts
in Anti-Asian Sentiment with the Emergence of COVID-19. Int. J. Environ. Res. Public
Health 2020, 17(19), 7032. doi: https://doi.org/10.3390/ijerph17197032
76) Jimenez‐Sotomayor MR, Gomez‐Moreno C, Soto‐Perez‐de‐Celis E. Coronavirus,
Ageism, and Twitter: An Evaluation of Tweets about Older Adults and COVID‐19.
Journal of the American Geriatrics Society. 2020 Apr 27.
77) Barrett AE, Michael C, Padavic I. Calculated Ageism: Generational Sacrifice as a
Response to the COVID-19 Pandemic. The Journals of Gerontology: Series B,
gbaa132, doi: https://doi.org/10.1093/geronb/gbaa132.
78) Gao J, Zheng P, Jia Y, Chen H, Mao Y, Chen S, et al. Mental health problems and
social media exposure during COVID-19 outbreak. Plos one 2020;15(4):e0231924.
79) Li S, Wang Y, Xue J, Zhao N, Zhu T. The impact of COVID-19 epidemic declaration
on psychological consequences: a study on active Weibo users. International journal of
environmental research and public health 2020;17(6):2032.
80) de Vreese, CH, Neijens P. Measuring media exposure in a changing communications
environment. Journal of communication methods and measures. 2016;10(2-3):69-80.
81) Qin L, Sun Q, Wang Y, Wu K, Chen M, Shia B, et al. Prediction of number of cases of
2019 novel coronavirus (COVID-19) using social media search index. International
journal of environmental research and public health 2020;17(7):2365.
82) Li C, Chen LJ, Chen X, Zhang M, Pang CP, Chen H. Retrospective analysis of the
possibility of predicting the COVID-19 outbreak from Internet searches and social
media data, China, 2020. Eurosurveillance 2020;25(10):2000199.
83) Peng Z, Wang R, Liu L, Wu H. Exploring urban spatial features of COVID-19
transmission in Wuhan based on social media data. ISPRS Int. J. Geo-Inf. 2020, 9, 402.
doi: https://doi.org/10.3390/ijgi9060402
84) Liu D, Wang Y, Wang J, Liu J, Yue Y, Liu W, Zhang F, Wang Z. Characteristics and
Outcomes of a Sample of Patients With COVID-19 Identified Through Social Media
in Wuhan, China: Observational Study. J Med Internet Res 2020;22(8):e20108. doi:
10.2196/20108.
85) O'Leary, DE, Storey, VC. A Google–Wikipedia–Twitter Model as a Leading Indicator
of the Numbers of Coronavirus Deaths. Intell Sys Acc Fin Mgmt. 2020; 27: 151– 158.
doi: https://doi.org/10.1002/isaf.1482
86) Zhu Y, Fu KW, Grépin KA, Liang H, Fung IC. Limited Early Warnings and Public
Attention to Coronavirus Disease 2019 in China, January–February, 2020: A
Longitudinal Cohort of Randomly Sampled Weibo Users. Disaster Medicine and Public
Health Preparedness. 2020 Apr 3:1-4.
87) Li L, Zhang Q, Wang X, Zhang J, Wang T, Gao T, et al. Characterizing the propagation
of situational information in social media during covid-19 epidemic: A case study on
weibo. IEEE Transactions on Computational Social Systems 2020;7(2):556-562.
88) Rufai SR, Bunce C. World leaders’ usage of Twitter in response to the COVID-19
pandemic: a content analysis. Journal of Public Health 2020.
89) Merkley E, Bridgman A, Loewen PJ, Owen T, Ruths D, Zhilin O. A Rare Moment of
Cross-Partisan Consensus: Elite and Public Response to the COVID-19 Pandemic in
Canada. Canadian Journal of Political Science/Revue canadienne de science politique
2020:1-8.
90) Netizen. Wikipedia. © 2020; Available at: https://en.wikipedia.org/wiki/Netizen.
91) Sutton J, Renshaw SL, Butts CT. COVID-19: Retransmission of official
communications in an emerging pandemic. PLOS ONE 15(9): e0238491. doi:
https://doi.org/10.1371/journal.pone.0238491.
92) Wang, Y, Hao, H, Platt, LS. Examining risk and crisis communications of government
agencies and stakeholders during early-stages of COVID-19 on Twitter. Computers in
Human Behavior. 114. doi: https://doi.org/10.1016/j.chb.2020.106568
93) Basch CH, Hillyer GC, Meleo-Erwin ZC, Jaime C, Mohlman J, Basch CE. Preventive
behaviors conveyed on YouTube to mitigate transmission of COVID-19: cross-
sectional study. JMIR public health and surveillance 2020;6(2):e18807.
94) Basch CE, Basch CH, Hillyer GC, Jaime C. The role of YouTube and the
entertainment industry in saving lives by educating and mobilizing the public to adopt
behaviors for community mitigation of COVID-19: successive sampling design study.
JMIR Public Health and Surveillance 2020;6(2):e19145.
95) Hakimi AA, Armstrong WB. Hand Sanitizer in a Pandemic: Wrong Formulations in
the Wrong Hands. The Journal of Emergency Medicine. 2020. Doi:
https://doi.org/10.1016/j.jemermed.2020.07.018.
96) Khatri P, Singh S, Belani NK, Leng YY, Lohan R, Wei LY, et al. YouTube as source
of information on COVID-19 outbreak: a cross sectional study of English and Mandarin
content. Travel medicine and infectious disease 2020:101636.
97) Hernández-García I, Giménez-Júlvez T. Characteristics of YouTube Videos in
Spanish on How to Prevent COVID-19. Int. J. Environ. Res. Public Health 2020,
17(13), 4671. doi: https://doi.org/10.3390/ijerph17134671.
98) Moon, H, Lee, GH. Evaluation of Korean-language COVID-19-related medical
information on YouTube: Cross-sectional infodemiology study. J Med Internet Res
2020;22(8):e20775. doi: 10.2196/20775
99) Ozdede M, Peker I. Analysis of Dentistry YouTube Videos Related To COVID-19.
Braz. Dent. J. [Internet]. 2020 Aug [cited 2020 Nov 18] ; 31( 4 ): 392-398.
Available from: http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0103-
64402020000400392&lng=en. Epub Sep 04, 2020. http://dx.doi.org/10.1590/0103-
6440202003767.
100) Yüce MÖ, Adalı E, Kanmaz B. An analysis of YouTube videos as educational
resources for dental practitioners to prevent the spread of COVID-19. Ir J Med Sci
(2020). doi: https://doi.org/10.1007/s11845-020-02312-5
101) Rhianna Schmunk. B.C. records highest number of fatal overdoses in a single
month, with 170 deaths. 2020; Available at: https://www.cbc.ca/news/canada/british-
columbia/overdose-deaths-bc-1.5607792.
Author(s) Yea Origin Aim/Them Social Study Population / Methods Key Findings
r e Media Sample Size
Liu D, Wang 2020 China Detection Weibo Weibo messages • Gathered data via • Older age, diffuse distribution, and
Y, Wang J, or between Jan 20 – Feb 15, Weibo, then hypoxemia are factors that can
Liu J, Yue Y, prediction 599 patients participated followed up with help clinicians identify patients
Liu W, Zhang of COVID- telephone with COVID-19 who have poor
F, Wang Z. 19 cases • Statistical analysis prognosis
taken with Fisher • Aggregated data from social media
exact test can also be comprehensive,
• Rates of death by immediate, and informative in
Kaplan-Meier disease prognosis.
method
• Used multivariate
Cox regression to
determine risk
factor for mortality
O'Leary, DE, 2020 USA Detection Google, Google Trend searches • Regression analysis • To model of the number of cases,
Storey, VC. or Wikipedia, for “coronavirus” and the current Wikipedia page views,
prediction Twitter “COVID‐19” started on 1‐week‐ahead tweets, and 2‐week
of COVID- 21 January 2020 to 5 ahead Google searches were
19 cases April 2020. Wikipedia employed.
page views for • To model of the number of deaths,
“coronavirus” and each variable was placed 1 week
“COVID‐19” started on further ahead.
12 January 2020 to 5
April 2020. The number
of Twitter original
tweets, starting on 27
January 2020 to 5 April
2020. The numbers of
COVID-19 cases and
deaths in the USA from
www.worldometers.info
Peng Z, Wang 2020 China Detection Weibo 1,200 records • Spatiotemporal • Elderlies are the group of high-risk
R, Liu L, Wu or distribution of and high-prevalence in the COVID-
H. prediction COVID-19 cases in 19 outbreak, account for > 50% of
of COVID- the main urban area the total number of Weibo help
19 cases of Wuhan. seekers.
• Kernel Density • Early transmission of COVID-19 in
Analysis. Wuhan could be divide into three
• Ordinary Least phrases: Scattered infection,
Square Regression community spread, and full-scale
outbreak.
Cuilian Li, Li 2020 China Disease Google Keywords "coronavirus" • Lag correlation • Lag correlations showed a max
Jia Chen, prediction Trends, and "pneumonia" correlation at 8-12 days for lab-
Xueyu Chen, Baidu searched, and trend data confirmed cases & 6-8 days for
Mingzhi Index, Sina collected from Google suspected cases.
Zhang, Chi Weibo Trends, Baidu Index, &
Pui Pang, Index Sina Weibo Index from
Haoyu Chen Jan 2 to Feb 20, 2020
Lei Qin, 2020 China Disease Baidu SMSI for dry cough, • Subset selection • New suspected COVID-19 case
Qiang Sun, prediction search fever, chest distress, • Forward selection. numbers correlated significantly
Yidan Wang, index (BSI) coronavirus, and • Lasso regression. with the lagged series of SMSI.
Ke Fei Wu, pneumonia from Dec 31, • Ridge regression. • SMSI could detected 6 - 9 days
Mingchih 2019 to Feb 9, 2020. • Elastic net earlier than new suspected cases.
Chen, Ben New suspected cases of
Chang Shia, COVID-19 data from Jan
Szu Yuan Wu 20 to Feb 9, 2020.
Yuner Zhu, 2020 China Disease Sina Weibo 1,101 COVID-19 related • Descriptive • Attention to COVID-19 was
King-Wa Fu, prediction Weibo posts from Dec statistics: numbers, limited until China openly admitted
Karen A 31, 2019 to Feb 12, 2020 percentage human-to-human transmission on
Grépin, Hai • Time series Jan 20, 2020.
Liang, Isaac analysis • Attention quickly increased and
Chun-Hai remained high over time.
Fung
Charles E 2020 United Governme YouTube 100 most widely viewed • Descriptive • < 50% the videos in either sample
Basch, Corey States nt YouTube videos as of analysis: frequency, covered any of the prevention
H Basch, responses Jan 31 & Mar 20, 2020 percent, mean, behaviours recommended by the
Grace C with keywords standard deviation US CDC.
Hillyer, "Coronavirus" in
Christie Jaime English, with English
subtitles, or in Spanish
Eric Merkley, 2020 Canada Governme Twitter, 33,142 tweets from 292 • Linear regression • No MPs from any party
Aengus nt Google social media accounts of downplaying the pandemic.
Bridgman, responses Trends federal MPs from Jan 1 • No association between
Peter John to Mar 28, 2020. 87 Conservative Party vote share &
Loewen, Google search trends for Google search interest in the
Taylor Owen, the search term coronavirus.
Derek Ruths, "coronavirus" in the 1st
Oleg Zhilin half (1-14) and 2nd half
(15-31) of Mar,
2020. A survey of 2,499
Canadian citizens ≥ 18 yr
from Apr 2 to 6, 2020
Lifang Li, 2020 China Governme Sina Weibo 36,746 Weibo data from • Linear regression. • Classify theCOVID-19-related
Qingpeng nt Dec 30, 2019 to Feb 01, SVM. NB. NLP information into 7 types of
Zhang, Xiao responses 2020; a random sample situational information & their
Wang, Jun of 3,000 Weibo as predictors.
Zhang, Tao training dataset
Wang, Tian-
Lu Gao, Wei
Duan, Kelvin
Kam-fai Tsoi,
Fei-Yue Wang
Priyanka 2020 Governme Youtube 150 videos on Feb 1, 2 • Descriptive • Mean DISCERN score for
Khatri, nt for “2019 novel analysis: percent, reliability was 3.12/5 for English
Shweta R responses coronavirus” (50), mean and 3.25/5 for Mandarin videos.
Singh, Neeta “Wuhan virus” in • DISCERN score • Mean cumulative MICI score of
Kesu Belani, English (50), and • MICI score useful videos was 6.71/25 for
Yin Leng Mandaring (50) English & 6.28/25 for Mandarin.
Yeong, Rahul
Lohan, Yee
Wei Lim,
Winnie ZY
Teo
Sohaib R 2020 United Governme Twitter 203 viral tweets from G7 • Qualitative design - • 82.8% were “informative.”
Rufai, Catey States nt world leaders from Nov content analysis • 9.4% were “morale-boosting.”
Bunce responses 17, 2019 to Mar 17, 2020 • 6.9% were “political.”
with keywords "COVID-
19" or "coronavirus" and
a minimum of 500
"likes."
Sutton J, 2020 USA Governme Twitter 690 accounts • Chi-square • Systematic changes in message
Renshaw SL, nt representing public analyses strategies over time and identify
Butts CT. responses health, emergency • Negative binomial key features that affect message
management and elected regression passing, both positively and
officials and 149,335 modeling negatively.
tweets • The results have the potential to aid
in message design strategies as the
pandemic continues, or in similar
future events.
Wang, Y, 2020 USA Governme Twitter 13,598 COVID-19 • Temporal analysis • 16 categories of message types
Hao, H, Platt, nt related tweets from • Networking were manually annotated.
LS. responses January 1, 2020 to April analysis • Inconsistencies and incongruencies
27, 2020 were identified in four critical
topics—wearing masks,
assessment of risks, stay at home
order, and disinfectant and
sanitizer.
• Network analysis revealed
increased communication
coordination over time.
Ahmed W, 2020 UK Infodemics Twitter 22,785 tweets and 11,333 • Social network • The most important drivers of the
López Seguí Twitter users with analysis conspiracy theory are ordinary
F, Vidal- #FilmYourHospital • User analysis citizens.
Alaball J, hashtag April 13-20, • YouTube was the information
Katz MS. 2020 source most linked to by users.
The most retweeted post belonged
to a verified Twitter user.
Brennen, J.S., 2020 UK Infodemics Digital 96 samples of visuals • Qualitative coding • Organized all findings into six
Simon, F.M., visual from January 2020 trends – authoritative agency,
Nielsen, R.K. media through March 2020 virulence, medical efficacy,
intolerance, prophecy, satire
• A small number of manipulated
visuals, all were produced using
simple tools; there were no
examples of “deepfakes” or other
artificial intelligence-based
techniques.
Bruns, A, 2020 Australia Infodemics Facebook 89,664 distinct Facebook • Time series. • Significantly greater volume of 5G
Harrington, S, posts from 1 January to Network analysis rumour posts on Facebook after
Hurcombe, E. 12 April 2020 March 19, 2020.
• Network analysis revealed that
coalition of various groups were
brought together by conspiracy
theories about COVID-19 and 5G
technology.
Cristina M 2020 Spain Infodemics Twitter 942 valid tweets between • Communicative • Misinformation tweeted more but
Pulido, Feb 6 & 7, 2020 content analysis retweeted less than science-based
Beatriz evidence tweets.
Villarejo- • Science-based evidence tweets had
Carballido, more engagement than
Gisela misinformation.
Redondo-
Sama, Aitor
Gómez
Galhardi CP, 2020 Brazil Infodemics WhatsApp, Fake news collected • Quantitative • WhatsApp is the primary channel
Freire NP, Instagram, from March 17 to April content analysis for sharing fake news, followed by
Maria Minayo Facebook 10, 2020 based on the Eu Instagram and Facebook.
CS, Fagundes, Fiscalizo app’s data
MCM.
Gallotti R, 2020 Italy Infodemics Twitter > 100 million Tweets • Developed an • Before the rise of COVID-19
Valle F, Infodemic Risk cases, entire countries expose
Castaldo N, Index measurable waves of potentially
Sacco P, De unreliable information, posing a
Domenico M. serious threat to public health.
Islam MS, 2020 Bangladesh Infodemics Online 2,311 reports related to • Descriptive • Misinformation fueled by rumors,
Sarkar T, platforms, COVID-19 related analysis stigma, and conspiracy theories
Khan SH, websites, infodemic reports • Spatial distribution can have potentially severe
Mostofa Facebook, between Dec 31, 2019 – analysis implications on public health if
Kamal AH, Twitter, April 5, 2020 prioritized over scientific
Hasan SMM, online guidelines.
Kabir A, papers
Yeasmin D, • Governments and other agencies
Islam MA, must understand the patterns of
Amin COVID-19–related rumors,
Chowdhury stigma, and conspiracy theories
KI, Anwar circulating the globe so that they
KS, Chughtai can develop appropriate risk
AA, Seale H. communication messages.
Moscadelli A, 2020 Italy Infodemics “fake 2102 articles between • Social media trend • Links containing “fake news” were
Albora G, news” and December 31, 2019 and analysis using the shared 2,352,585 times, accounting
Biamonte correspondi April 30, 2020 BuzzSumo for 23.1% of all reviewed articles.
MA, Giorgetti ng verified Application
D, Innocenzio news
M, Paoli S, circulated
Lorini C, in Italy
Bonanni P,
Bonaccorsi B.
Ramez 2020 Lebanon Infodemics Twitter 673 English tweets • Descriptive • 153 tweets (24.8%) had
Kouzy, Joseph collected on Feb 27, statistics misinformation.
Abi Jaoude, 2020 • Bar chart • 107 (17.4%) had unverifiable
Afif Kraitem, • Chi-square statistic information.
Molly B El to calculate p-value • Misinformation rate higher in
Alam, Basil (2-sided; 0.05) for informal individual/group accounts
Karam, Elio the association (33.8%)
Adib, Jabra between
Zarka, Cindy account/tweet
Traboulsi, characteristics and
Elie W Akl, the presence of
Khalil misinformation or
Baddour unverifiable
information about
COVID-19
Rovetta A, 2020 Italy Infodemics Google, 2 million Google Trends • Classification of • Globally, there is a growing
Bhagavathula Instagram queries and Instagram infodemic interest in COVID-19, and
AS. hashtags February 20, monikers numerous infodemic monikers
2020, to May 6, 2020 • Computed the continue to circulate on the
average peak internet.
volume with a 95%
CI
Wasim 2020 United Infodemics Twitter 10,140 tweets collected • Descriptive • 34.8% linked 5G & COVID-19.
Ahmed, Josep Kingdom from 19:44 UTC on statistics: numbers, • 32.2% denounced the conspiracy
Vidal-Alaball, Friday, Mar 27, 2020 to percent theory.
Joseph 10:38 UTC on Saturday, • Social network
Downing, Apr 04, 2020 analysis
Francesc • Content analysis
López Seguí
Uyheng J, 2020 USA, Infodemics Twitter 12 million tweets from • Hate speech scored • Analysis reveals idiosyncratic
Carley KM. Philippines 1.6 million users from assigned to each relationships between bots and
the US and 15million tweet using ML hate speech across datasets,
tweets from 1 million algorithm highlighting different network
users from the • Bot scores were dynamics of racially charged
Philippines between Mar assigned to each toxicity in the US and political
5 and Mar 19, 2020 user via BotHunter conflicts in the Philippines.
algorithm • Bot activity is linked to higher hate
• Social media in both countries, especially in
analysis via ORA communities which are denser and
software more isolated from others.
• Network analysis
via centrality
analysis
• Cluster analysis via
Leiden algorithm
Junling Gao, 2020 China Mental Sina Weibo Online survey on • Multivariable • Frequently SME positively
Pinpin Zheng, Health Wenjuanxing platform logistic regression associated with high odds of
Yingnan Jia, from Jan 31 to Feb 2, anxiety (OR = 1.72, 95% CI: 1.31 -
Hao Chen, 2020; with 4,872 2.26) & CDA (OR = 1.91, 95%
Yimeng Mao, Chinese citizens ≥ 18 yr CI: 1.52 - 2.41)
Suhong Chen, from 31 provinces and
Yi Wang, Hua autonomous regions in
Fu, Junming China
Dai
Sijia Li, Yilin 2020 China Mental Sina Weibo Weibo posts from 17,865 • Sentiment analysis • Negative emotions & sensitivity to
Wang, Jia Health active Weibo users • Paired sample t-test social risks increased.
Xue, Nan
Zhao, between 13 Jan to 26 • Scores of positive emotions & life
Tingshao Zhu Jan, 2020 satisfactions decreased.
Hakimi AA, 2020 USA Prevention YouTube The first 100 videos on • Codified video • Most of the videos failed to
Armstrong Education YouTube with the most content describe labeling storage
WB. in Videos views using the search • Assessed using containers, 69% of videos
term “DIY hand Cohen kapa encouraged the use of oils or
sanitizer.” • Descriptive perfumes to enhance hand sanitizer
statistics calculated scent, and 2% of videos promoted
• Assessed by Chi- the use of coloring agents to be
square test with 2- more attractive for use among
sided p-value <0.5 children specifically.
• Significantly increased average
number of daily calls to poison
control centers regarding unsafe
pediatric hand sanitizer exposure
since the first confirmed COVID-
19 patient in the US.
Abbreviations:
* BERT = Bidirectional Encoder Representations from Transformers
* CDA: combination of depression and anxiety
* CDC: Centres for Disease Control and Prevention
* GQS = Global Quality Score
* JAMAS = Journal of the American Medical Association Score
* LDA: latent Dirichlet allocation
* MICI = Medical Information and Content Index
* MP: members of Parliament
* M-SDL: multiple-information susceptible-discussing-immune
* NB: naïve Bayes
* NLP: natural language processing
* RF: random forecast
* SME: social media exposure
* SMSI: social media search index
* SVM: support vector machine
* TCCI = Title–Content Consistency Index
* TF-IDF = Term Frequency-Inverse Document Frequency