Professional Documents
Culture Documents
REVIEWED BY
communities and portals in
Gill Ten Hoor,
Maastricht University, Netherlands
Yihan Lu,
China
Fudan University, China
*CORRESPONDENCE
Lin Wang1 , Zuquan Xian2* and Tianyu Du2
Zuquan Xian
xianzuquan@163.com 1
Chinese Academy of Science and Education Evaluation, Hangzhou Dianzi University, Hangzhou,
SPECIALTY SECTION
China, 2 School of Managment, Tianjin Normal University, Tianjin, China
This article was submitted to
Health Psychology,
a section of the journal Purpose: This study analyzes the topic and distribution features of public
Frontiers in Psychology
information needs for the COVID-19 vaccine from Chinese online Q&A
RECEIVED 04 June 2022
ACCEPTED 16 September 2022
communities and portals. It aims to identify the features and differences in
PUBLISHED 10 October 2022 public COVID-19 vaccine information needs at different periods.
CITATION Design/Methodology: A total of 14,296 questions about the COVID-19
Wang L, Xian Z and Du T (2022) The
public information needs of COVID-19
vaccine from four Chinese mainstream online communities and portals were
vaccine: A study based on online Q&A studied following five procedures: data collection, data processing, K-means
communities and portals in China. clustering, LDA topic model analysis, and needs identification.
Front. Psychol. 13:961181.
doi: 10.3389/fpsyg.2022.961181 Findings: The study identified the topical features of public information
COPYRIGHT needs for the COVID-19 vaccine during the first pandemic outbreak, pre-
© 2022 Wang, Xian and Du. This is an listing period, and post-listing period. It constructed a framework of public
open-access article distributed under
the terms of the Creative Commons vaccine information needs. The information needs can be classified into
Attribution License (CC BY). The use, 8 main categories and 16 subcategories. The eight main categories are
distribution or reproduction in other
vaccination (53.72%), evaluation and impact of other social events (17.90%),
forums is permitted, provided the
original author(s) and the copyright vaccine R&D and listing (9.49%), vaccine side effects and countermeasures
owner(s) are credited and that the (5.63%), vaccination necessity (4.98%), vaccine patent exemption (3.26%),
original publication in this journal is
cited, in accordance with accepted vaccination effectiveness (2.94%), and essential knowledge of vaccine (2.08%),
academic practice. No use, distribution where percentage refers to the distribution of information needs data under
or reproduction is permitted which
various categories.
does not comply with these terms.
Implications: Online communities and portals should provide dynamic and
tailored information services according to changing public vaccine information
needs. The public information needs regarding vaccination is prominent
and should be addressed first. In the follow-up booster vaccination efforts,
government health departments should prioritize susceptible groups, such as
overseas students, airport workers, and healthcare workers.
Originality/Value: We built a conceptual framework using data mining
techniques and analyzed the COVID-19 vaccine information needs distribution
at different time points and among different social groups, focusing on the
theme of public information needs for the COVID-19 vaccine. It makes
recommendations for government health departments and online platforms
to improve the quality of COVID-19 vaccine information services for the public
and provide a reference for the vaccination of COVID-19 booster shots.
KEYWORDS
online Q&A communities, COVID-19 vaccine, vaccine information needs, data mining,
topic model
During the COVID-19 pandemic, the public’s need for COVID- influence subjects’ immunization. Huang et al. (2018) adopted
19 vaccine information was high, but no research on the topic a multi-stage random sampling method to select 652 parents
has been found. In addition, studies on health information whose children were aged 3–10 years old in Nanhai District,
needs mainly employ traditional research methods, such as Foshan City, China. They used a face-to-face questionnaire to
questionnaire surveys, interviews, and content analysis, which analyze parents’ cognition of the varicella vaccination and its
have several drawbacks. For example, the number of research influencing factors. They concluded that children’s age, parent’s
subjects is limited, and the research results may be affected education level, children’s history of varicella, acceptable vaccine
by unrepresentative samples. In this study, we applied the price, and vaccine information accessibility were the factors
web crawler as the data collection method to ensure enough influencing the vaccination rate of children.
sample data for extensive data analysis. It has been proved as The above-mentioned studies have discussed the public’s
an effective way of obtaining more universal research results willingness to vaccinate against various diseases and their
in social sciences (Zhao and Li, 2022). The findings of this influencing factors. However, until now, previous studies have
study can illustrate the characteristics of the COVID-19 vaccine not addressed the public’s willingness to vaccinate against
information needs of the general public in China and help COVID-19. It is believed that meeting information needs can
the government and stakeholders improve information service alleviate information and promote vaccination willingness in
quality and health policies. public health emergencies (Maire et al., 2021). Therefore, we will
analyze the levels and features of the public’s COVID-19 vaccine
information needs at different stages and divide such needs
into several categories by constructing the public’s COVID-
Research on influencing factors of 19 vaccine information need framework. Expected findings
vaccination intention will contribute to research on factors influencing COVID-19
vaccination willingness.
Influencing factors on vaccination intention is a common
topic in vaccine information needs. In the field of the HPV
vaccine, Galvin et al. (2021) evaluated the correlation between
Research questions
HPV vaccine information quality and users’ vaccination by
The public information needs of the COVID-19 vaccine
testing the exposure of users and their children to HPV vaccine
are examined based on Chinese mainstream online Q&A
information on social media. They found that increasing
communities and portals. The study mainly focuses on the
the number of positive messages and information credibility
following questions: (1) What is the classification framework
promotes users’ willingness to receive the vaccine. Zhou
of public COVID-19 vaccine information needs under Chinese
(2020) set up two scenario experiments and administered
online Q&A communities and portals? What is the proportion of
questionnaires to 200 randomly selected citizens. The
each need category? (2) What are the distribution characteristics
results demonstrated that the presentation of negative
of public COVID-19 vaccine information needs (including
information about vaccines had no significant correlation
topic distribution and temporal distribution)? What social
with the willingness to vaccinate against HPV. However,
phenomena do these characteristics reflect? (3) What about the
the psychological risk of infection and disease severity were
distribution of COVID-19 vaccine information needs among
significantly related to vaccination intention. Kwon et al.
different social groups? What insights could we gain from it?
(2010) found that the influencing factors of public intention to
Through the above exploration, we can effectively grasp the
vaccinate against influenza A (H1N1) mainly included the fear
patterns and characteristics of public information needs for
of influenza A (H1N1), the likelihood of infecting the virus,
the COVID-19 vaccine and provide reference and guidance for
prioritization in the production of novel influenza vaccines,
governments and relevant operators to understand the status
and the effectiveness of the vaccine. In addition, some scholars
quo better, optimize their service level, and impel the subsequent
have studied the influencing factors of vaccination intention
vaccination of COVID-19 booster shots.
for different public groups. For example, Nikula et al. (2009)
conducted focus group discussions and interviews with 40
healthcare professionals, students, and clients and reported that Methods
vaccinators’ professional conduct, education, client conduct,
and vaccination environment impact vaccination. Kalaij et al. In this study, we analyze the data on the public information
(2021) analyzed 16 relevant research articles in three databases needs of the COVID-19 vaccine from four online Q&A
(PubMed, Scopus, and Cochrane) on factors associated with communities and portals by utilizing the text mining method.
childhood vaccination in Southeast Asia. They identified The specific analysis procedure is displayed in Figure 1. It
that parental, personal-related, children and family status- includes five steps: data collection and examination, data
related, socioeconomic, and healthcare-related factors strongly processing, converting the raw data into a Document Term
FIGURE 1
Procedure for analyzing the public COVID-19 vaccine information needs.
Matrix (DTM) that can be recognized and processed by contains 65,462 words, was adopted. We also adopted the HIT
computers, analyzing the DTM by K-means clustering and LDA (Harbin Institute of Technology) stop words list (https://github.
topic model, and synthesizing the results generated by both and com/goto456/stopwords/blob/master/hit_stopwords.txt), which
identifying the public information needs of COVID-19 vaccine contains 676 stop words. The Baidu medical thesaurus and HIT
(Mi et al., 2021). stop word list are widely used by Chinese scholars (Wu, 2013;
Xu, 2013; Yang, 2013; Hu, 2018; Li, 2019; Zhu, 2020). Then,
we carried out word segmentation. Chinese word segmentation
Data collection and processing methods mainly include lexicon-based and statistics-based word
segmentation (Zong et al., 2019). We adopted the lexicon-based
Data source and collection word segmentation method, realizing the word segmentation
We collected data on the COVID-19 vaccine information with the Jiebar package developed in the R language. Stop
needs from the platforms of “Zhihu,” “Baidu Post,” words had to be removed during this process to ensure
“39Health.com”, and “Chunyu Doctor.” The “Zhihu” and that the textual features were extracted correctly. Afterword
“Baidu Post” platforms are the mainstream online Q&A segmentation, the data types were transformed into the corpus,
communities, and “39Health.com” and “Chunyu Doctor” are and the corpus into Document Term Matrix (DTM) by the
two mainstream healthcare portals in China. The four platforms “DocumentTermMatrix()” method. DTM is a two-dimensional
together have over 1.03 billion registered users, of which over matrix in which the first row represents all feature words in
503 million are active. The data on the platform can fairly reflect the corpus, the first column represents the serial number of
the information requirements of the Chinese public. Therefore, users’ question data, and the matrix value represents the co-
the questions on these platforms reflect the public information occurrence frequency of feature words in each document. The
needs regarding the COVID-19 vaccine. We constructed the initial DTM has many dimensions. To improve the algorithm’s
dataset as 14,296 questions along with their number of answers, running speed and clustering accuracy, feature screening and
the number of followers, and questioning time on these four extraction of the initial DTM are required. The standard
platforms from 23 January 2020 to 15 July 2021. We searched methods are principal component analysis (PCA), singular value
these data using Octopus collector version V8.1 (https://www. decomposition (SVD), and manual feature screening. In this
bazhuayu.com) with the keyword “COVID vaccine” to crawl study, we set thresholds for word frequency and word length in
and de-duplicate the data. The data were then saved in a the DTM to filter features and θm retained 837 feature words
CSV file. with word frequency higher than 10 and word length longer
than one.
Data processing
We imported the data, professional thesaurus, and stop K-means clustering
words list in the data processing step. We programmed using
the R language, used the RSTUDIO compiler environment, The clustering algorithm is an unsupervised learning
and imported the data through the “read.csv().” Baidu medical algorithm that studies how to classify objects, including
thesaurus (https://shurufa.baidu.com/dict_list?cid=217), which the K-means clustering algorithm, density-based clustering
curve is the optimal number of clusters. The uk in the equation LDA topic model probability diagram.
reduction tool. After the LDA model is trained, the document terms that appear in document d. IDF refers to the inverse
can be represented in the topic space. Document processing in document frequency. Its calculation is illustrated in Equation
word space can thus be done in topic space using the LDA model. (4), where |D| indicates the total number of documents in
On the other hand, collaborative filtering, document similarity the document set, and |d ǫ D: t ǫ d| indicates the number of
calculation, and text segmentation can be accomplished with documents containing the word t in the document set (Zhang
the parameter estimates of the topic model. In this study, we et al., 2019). TF-IDF means the product of TF and IDF, as
applied the LDA topic model to analyze the textual data of detailed in Equation (5). The extracted keywords were analyzed,
the public information needs of the COVID-19 vaccine for the and a word cloud was drawn. In addition, it was essential to
subsequent topic discovery. LDA assumes that a “topic-word” conduct a statistical analysis of the public COVID-19 vaccine
distribution parameter is generated for each topic: ϕk ∼ Dir(β). information needs pattern. Therefore, the keyword co-word
TABLE 1 Explanation of the parameters in LDA topic model number of answers takes second place, reflecting that the public
probability diagram.
needs for different types of COVID-19 vaccine information
Symbols Meaning Symbols Meaning varied greatly. To investigate the focus of the COVID-19
vaccine’s public information needs, we set a threshold value of
M Number of K Number of topics 10 for the number of followers and answers of the question
documents data to extract questions with more than 10 followers and
V Number of words α The prior distribution answers. Then, we analyzed these data to obtain a word cloud
(Word hyperparameters of θm diagram of the topics with serious concerns and answers,
list dimension) which contained the top 50 topic keywords, as displayed in
β The θm Topic parameter Figure 3. We can find from the graph that the “COVID-
prior distribution distribution of the m-th 19 vaccine” is the gist of social concerns, and “vaccination,”
hyperparameters document “research and development,” and “evaluation” are significant
of ϕk aspects of public information needs. In addition, the keyword
ϕk The words Nm Length of the m-th “experiment” implied that the public follows the vaccine
distribution document R&D progress by accessing information related to COVID-
parameters for the 19 vaccine trials. The keyword “antibody” indicates that the
k-th topic public paid great attention to the concentration of antibodies
zm,n The topic wm,n The lexical term produced by COVID-19 vaccines, which helps them judge the
corresponding to corresponding to the nth effectiveness of various vaccines. “AstraZeneca” is a world-
the n-th word of the word of the m-th renowned COVID-19 vaccine supplier. The public attaching
m-th document document importance to information about “AstraZeneca” meant that the
zm = {zm,n }N
n=1 The topic sequence
m
wm = {wm,n }N
n=1 The lexical word
m Chinese public was inclined to compare domestic and foreign
corresponding to sequence corresponding vaccines to choose the most suitable for them.
the m-th document to the m-th document We classified the evolution of public vaccine information
w = {wm }M
m=1 The word sequence z = {zm }M
m=1 The topic sequence needs into three stages according to the critical time points
corresponding to corresponding to the of the pandemic development: the first pandemic outbreak
the document set document set period (23.01.2020–08.04.2020), the vaccine pre-listing period
(09.04.2020–31.12.2020), and the vaccine post-listing period
(01.01.2021–15.07.2021). The division between the pandemic
network and bar chart were drawn to visualize the results. first outbreak period and the vaccine pre-listing period is based
on the lifting lockdown time in Wuhan, China. The division
f (t, d)
tf(t, d) = P (3) between the vaccine pre-listing and post-listing periods relied
k f (wk , d) on when Sinopharm, China’s first COVID-19 vaccine R&D
and production company, received marketing authorization
|D|
idf(t, d) = lg (4) from the National Medical Products Administration. The public
d ∈ D:t ∈ d
information needs for the COVID-19 vaccine are higher during
the first pandemic outbreak period, while it is significantly
tf _idf (t, d) = tf (t, d) × idf (t, d) (5) lower during the vaccine pre-listing period, with an average
of only 15.44 questions per day. It indicates that people’s lives
Results gradually returned to normal with the overall improvement
of China’s pandemic prevention and control situation. Hence,
Basic features of public information the information needs for the COVID-19 vaccine decreased
needs of COVID-19 vaccine significantly. In contrast, the need for public information
increased again in the vaccine post-listing period, reaching a
By crawling and examining the data on the information daily average of 32.53 items. This reflects that the availability of
needs of COVID-19 vaccine from the platforms of “Zhihu,” the vaccine, the government’s vigorous vaccination promotion,
“Baidu Post,” “39Health.com,” and “Chunyu Doctor” with the and the worsening global pandemic amplified the public
keyword of “COVID-19 vaccine”, a total of 14,296 questions, information needed for the COVID-19 vaccine.
as well as their number of answers, number of followers, and To better explore the content features of public information
questioning time, were collected. The basic statistics of these needs of COVID-19 vaccine in each period, 15 keywords and
data are demonstrated in Table 2. their TF-IDF values were extracted from each period in this
The statistical data show that the standard deviation of the study. The results are displayed in Table 3. From the table, it
number of followers is the largest. The standard deviation of the can be seen that during the first outbreak period, the public
TABLE 2 Basic statistics of data on the public information needs of COVID-19 vaccine.
Basic feature Mean value Median value Mode Standard deviation Minimal value Maximum value
TABLE 3 Keywords of the public information need of COVID-19 vaccine in different periods.
FIGURE 4
Keyword co-word network.
Discussion
This study uses data mining to investigate the Chinese
public information needs for the COVID-19 vaccine. We
innovatively divided COVID-19 vaccine information needs of COVID-19 patients in China following the first infection
into three periods: the first pandemic outbreak period with COVID-19 in Wuhan, Hubei Province, China, the public
(23.1.2020–08.04.2020), the vaccine pre-listing period experienced varying degrees of pandemic panic (Wang et al.,
(09.04.2020–31.12.2020), and the vaccine post-listing period 2021). To dispel their panic, the public became increasingly
(01.01.2021–15.07.2021). It was found that during the first concerned with the COVID-19 vaccine in the hope that the
pandemic outbreak period (23.1.2020–08.04.2020), the public vaccine could alleviate this severe novel coronavirus pandemic.
information needs for the COVID-19 vaccine were high, mainly They actively expressed information needs related to COVID-19
focusing on the vaccine safety, vaccination necessity, adverse vaccines on online Q&A communities and portals so that
effects of vaccines, and the vaccine potency of coping with the the public information needs of COVID-19 vaccine were at a
variation of novel coronavirus. With the increasing number high level during the outbreak. Chinese and American experts
C1 Basic knowledge C1.1 Vaccine mechanism Asking for the preparation mechanism of COVID-19 vaccine. 82 0.57%
of the vaccine
C1.2 Vaccine effect Asking how COVID-19 vaccine works. 94 0.66%
C1.3 Difference in various Ask for differences between different types of vaccines, such as 122 0.85%
COVID-19 vaccines adenovirus vector vaccines, inactivated vaccines, and recombinant
protein vaccines.
Total 298 2.08%
C2 Vaccine R&D C2.1 Vaccine R&D process Asking how is COVID-19 vaccine R&D going. 1016 7.11%
and listing
C2.2 Vaccine listing time Asking when COVID-19 vaccine will go public. 340 2.38%
Total 1,356 9.49%
C3 Vaccination C3.1 Vaccination Asking for information on how, when, and where to make an 522 3.65%
appointment appointment for COVID-19 vaccination.
C3.2 Vaccination fees Asking for the cost of vaccination against novel coronavirus. 371 2.60%
C3.3 Vaccine type Asking what kind of vaccine will be vaccinated. 156 1.09%
C3.4 Vaccination population Asking for the scope of vaccination population and if someone can 2,036 14.24%
confirmation receive COVID-19 vaccine when he or she has a past medical history,
medication history, or physical discomfort symptoms.
C3.4 Preparation before Asking what must be made before vaccination 1,591 11.13%
vaccination
C3.5 Cautions after Asking what to look for after vaccination. 2,183 15.27%
vaccination
C3.6 Vaccination procedure Asking what the procedure for COVID-19 vaccination is; asking for 821 5.74%
details of vaccination procedure.
Total 7,680 53.72%
C4 Vaccination Asking for the necessity of vaccination against novel coronavirus. 712 4.98%
necessity
C5 Vaccination Asking about the effectiveness of COVID-19 vaccine and comparing 420 2.94%
effectiveness the effectiveness of different types of COVID-19 vaccines.
C6 Vaccination side C7.1 Side effects Asking for any side effects or adverse reactions to the COVID-19 694 4.85%
effects and vaccine.
countermeasures
C7.2 Countermeasures Asking what measures to deal with side effects or adverse reactions 112 0.78%
after vaccination.
Total 806 5.63%
C7 Vaccine patent Asking about the patent exemption for COVID-19 vaccine. 465 3.26%
exemption
C8 Evaluation and C8.1 Evaluation of other Asking for evaluating other social events regarding the COVID-19 2084 14.58%
impact of other social events vaccine (e.g., how do you rate the COVID-19 vaccine appointment at
social events the University of Electronic Science and Technology in China?).
C8.2 Impact of other social Asking for the impact of other social events regarding COVID-19 475 3.32%
events vaccine (e.g., what would be the impact if China were to take the lead
in developing the COVID-19 vaccine?).
Total 2559 17.90%
and high viral load strains have been developed, such as Alpha,
Gamma, and Delta (Tracking SARS-CoV-2 Variants, 2021).
The public asked many questions about the coping capacity of
COVID-19 vaccines for new coronavirus variations, implying
their information needs to be focused on the vaccine potency
aspect. As the pandemic was gradually brought under control
and the public gained confidence in the Chinese government’s
ability to prevent and control the pandemic outbreak, questions
related to the necessity for vaccination began to increase among
the majority of the public who did not intend to go abroad.
In the pre-listing vaccine period (09.04.2020–31.12.2020),
the public information on the COVID-19 vaccine mainly
concentrated on vaccine listing, R&D, and its safety. Although
the COVID-19 pandemic in Wuhan has been successfully
controlled during this period, several small-scale aggregated
outbreaks emerged in other Chinese provinces and cities
(Beijing, Hebei, Heilongjiang, Jilin). It has led to a great deal of
renewed public interest in COVID-19 vaccine R&D and listing
and needs questions about COVID-19 vaccine on online Q&A
FIGURE 5
The results of elbow method. communities and portals.
During the vaccine post-listing period (01.01. 2021–
15.07.2021), the public information needs of the COVID-
19 vaccine focused on cautions before and after vaccination,
indicated that the first vaccine would be available for clinical use evaluation of vaccine events, and adverse reactions. Due to
by August 2020. However, since no vaccine has been developed individual differences, some people experience side effects
for SARS, the Chinese public was worried about the safety of different degrees, such as local pain, rash, dizziness,
and adverse effects of a COVID-19 vaccine developed within a etc. The public asked questions about this issue on online
short time. The novel coronavirus is an RNA virus with a high Q&A communities and portals to avoid side effects after
variation rate (Esakandari et al., 2020). Many highly infectious vaccination. Furthermore, there were many social events related
FIGURE 6
COVID-19 vaccine information needs of different social groups during the pandemic.
to the COVID-19 vaccine (vaccine patent exemptions and the grasp the public vaccine information needs in time through
vaccination rate of the public in other countries) during this large-scale online forum data collection and analysis to get
period. The public also asked questions about the evaluation objective and accurate public opinion. It will enable them to
of events, especially for events closely related to themselves. timely formulate targeted health policies and solve the most
The public seeks information about the COVID-19 vaccine by concerned problems. For example, some fake news about mass
asking questions and is eager to receive feedback to further their adverse reactions to vaccination in China has sparked public
understanding. Therefore, online Q&A communities and portals concern, highlighting the need for public information about the
should provide dynamic and tailored information services COVID-19 vaccine’s side effects. Chinese health departments
according to changes in public vaccine information needs at found such information in time and combated misinformation
different times to enhance service quality. For example, it is a by disclosing the scientific evidence for very low adverse
good choice for online Q&A communities and portals to invite reactions. In the middle stage of the pandemic, the government
medical experts to answer questions and dispel doubts about cyberspace administration, think tanks, information centers,
meeting the current public information needs of the COVID-19 libraries, and other stakeholders should build coordination
vaccine. Therefore, we constructed a general COVID-19 vaccine mechanisms to further analyze and track online public data
information need framework. It was found that the public on social media for mining public vaccine information needs.
information needed for vaccination was the most significant To address such needs, timely and targeted health information
during the pandemic, accounting for half of all information should be released and pushed to the public through press
needs. It included information on appointments, fees, vaccine conferences, traditional media, government official websites,
type, population confirmation, preparation before vaccination, social media platforms, and other channels (Liu and Ding, 2022).
cautions after vaccination, and vaccination procedure. It reflects As the epidemic worsens, the abovementioned organizations
the public’s desire to access relevant knowledge better to should work together to conduct a dynamic analysis of
assist them in vaccination and minimize adverse reactions public data to track the evolution of the public’s vaccine
after vaccination. During the pandemic, the public information information needs and adjust their health information service
needed on the vaccine mechanism is the least. It indicated that strategies and public health policies. For example, when
the public was less concerned about the fundamental scientific the COVID-19 vaccine was licensed, considering the weak
principles of vaccines and how the vaccine works on the human resistance of the elderly and infants, medical institutions
body to acquire immunity. did not provide vaccination for these groups. However,
By counting the information needs data of different groups, with the continuous improvement of vaccine safety and
we found that the proportion of information needed regarding expansion of the scope of application, and more reports about
vaccination was high in all social groups. It was followed by the elderly and infant population infection, the vaccination
relatively high information needs regarding vaccine R&D and information need of such groups is increasing. Government
listing among overseas students, airport workers, and healthcare departments and medical institutions have been aware of
workers, and relatively high information regarding vaccination such emerging information needs and promptly informed
necessity among teachers and domestic students. Overseas these people to get vaccination without delay, ensuring their
students, airport workers, and healthcare workers are susceptible health and safety.
groups that are easily exposed to COVID-19 and at a much
higher risk of being infected than the general public. These
groups are consequently more concerned about the COVID-19 Conclusion
vaccine R&D process and vaccine listing time, hoping to reduce
their risk of infection and the chance of becoming seriously This study investigated the topic and distribution features of
ill after vaccination (Lu et al., 2020). Therefore, the follow- the public COVID-19 vaccine information needs. In addition,
up COVID-19 booster shots vaccination efforts should first be the topic features of public COVID-19 vaccine information
directed to these susceptible groups to ensure their safety at needs in the different periods were examined. Data from Chinese
work. The relatively high need for information on vaccination online Q&A communities and portals were analyzed using
necessity from teachers and domestic students reflects that with K-means clustering and the LDA topic model. As a result, a
the successive local small COVID-19 pandemic outbreaks in general COVID-19 vaccine information need framework was
China, the public is still at risk of contracting novel coronavirus. constructed, including 8 main categories and 16 subcategories. It
Therefore, government health departments should conduct has several implications. First, online communities and portals
vigorous promotion to improve the public’s recognition of the should provide dynamic and tailored information services to
importance of vaccination and willingness to vaccinate. realize organizational value according to changes in public
This study can effectively help the government and vaccine information needs at different time points (Zhang et al.,
stakeholders improve information on service quality and make 2012; Day and Montoya, 2019). Second, the public information
better health policies. At the beginning of the pandemic need regarding vaccination is prominent and should be
outbreak, the government and relevant organizations can addressed first. Third, government health departments should
Conflict of interest
Data availability statement
The authors declare that the research was conducted in the
Publicly available datasets were analyzed in this study. absence of any commercial or financial relationships that could
This data can be found here: https://www.zhihu.com/topic/ be construed as a potential conflict of interest.
19607469/hot.
Publisher’s note
Author contributions
All claims expressed in this article are solely those of the
LW: conceptualization, methodology, writing-review and authors and do not necessarily represent those of their affiliated
editing, and funding acquisition. ZX: formal analysis, data organizations, or those of the publisher, the editors and the
curation, visualization, and writing-original draft. TD: writing- reviewers. Any product that may be evaluated in this article, or
original draft. All authors contributed to the article and claim that may be made by its manufacturer, is not guaranteed
approved the submitted version. or endorsed by the publisher.
References
Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation. J. Jin, B. Y., and Xu, X. (2015). Research on theme features in online health
Mach. Learn. Res. 3, 993–1022. doi: 10.1162/jmlr.2003.3.4-5.993 community. Libr. Inf. Serv. 59, 100–105.doi: 10.13266/j.issn.0252-3116.2015.12.015
China Internet Network Information Center. (2021). The 48th Statistical Report Kalaij, A. G. I., Sugiyanto, M., and Ilham, A. F. (2021). Factors associated
on the Development of the Internet in China. Available online at: https://cit.buct. with vaccination compliance in southeast asia n children: a systematic
edu.cn/2021/0925/c7951a157922/page.htm review. Asia Pac. J. Public Health33, 479–488. doi: 10.1177/101053952110
14640
Day, R. E., and Montoya, R. D. (2019).’What is (a) disease? Disease as events and
access to information. Inform. Res. Int. Electron. J. 24, 36–44. Available online at: Kwon, Y., Cho, H. Y., Lee, Y. K., Bae, G. R., and Lee, S. G. (2010).
http://informationr.net/ir/24-4/colis/colis1920.html Relationship between intention of novel influenza A (H1N1) vaccination and
vaccination coverage rate. Vaccine 29, 161–165. doi: 10.1016/j.vaccine.2010.
Esakandari, H., Nabi-Afjadi, M., Fakkari-Afjadi, J., Farahmandian,
10.063
N., Miresmaeili, S. M., Bahreini, E., et al. (2020). A comprehensive
review of COVID-19 characteristics. Biol. Procedures Online 22, 1–10. Li, X. L. (2019). Construction and Implementation of Knowledge Base of Maternal
doi: 10.1186/s12575-020-00128-2 and Child Health Care [Master’s thesis, Hainan University]. Pudong: China National
Knowledge Internet.
Galvin, A. M., Garg, A., Moore, J. D., Litt, D. M., and Thompson, E. L. (2021).
Quality over quantity: human papillomavirus vaccine information on social media Liu, Y., and Ding, Z. (2022). Personalized recommendation model of electronic
and associations with adult and child vaccination. Human Vacc. Immunother. commerce in new media era based on semantic emotion analysis. Front.
17:3587–94. doi: 10.1080/21645515.2021.1932219 Psychol. 13, 952622. doi: 10.3389/fpsyg.2022.952622
Hu, P. (2018). A Study on Construction and Application of Online Learner Aspect- Lu, Q., Zhu, A. Q., Zhang, J. Y., and Chen, J. (2019). Research on user
Opinion Mining Model based on the Method of NLP [Doctoral dissertation, Central information requirement in Chinese network health communities:Taking tumor-
China Normal University]. Pudong: China National Knowledge Internet. forum data of Qiuyi as an example. Data Anal. Knowl. Discov. 3, 22–32.
doi: 10.11925/infotech.2096-3467.2018.1153
Huang, D. A., and Zhou, J. Y. (2020). Analysis of information needs for HPV
vaccines among Zhihu community users. Cult. Commun. 9, 85–91. Lu, Y. Z., Li, T., Wang, Q. F., Liu, L., and Ni, S. G. (2020). Effect
of resilience and expression suppression on the relationship between social
Huang, K., Guo, M., He, X. J., and Li, L. (2020). An analysis of research on health support and posttraumatic growth among front-line medical workers in
information literacy during public health emergency and its implications. Libr. J. the epidemic situation of COVID-19. Chin. J. Clin. Psychol. 28, 743–746.
39, 57–69. doi: 10.13663/j.cnki.lj.2020.07.007 doi: 10.16128/j.cnki.1005-3611.2020.04.019
Huang, Y. D., Lu, H. Y., Qiu, Z. Y., Qian, S. X., Li, J. Y., Lin, C. Q., et al. Maire, A., Chapet, N, Laugier, M. L, Laffont-Lozes, P., Rigoni, M., Audurier, Y.,
(2018). Analysis on the variation of varicella antibody and immunization strategy and Castet-Nicolas, A. (2021). Pneumococcal and influenza vaccination coverage
of varicella among school-age children in Nanhai District, Foshan City, Guangdong of heart failure patients: still a long way to go. Eur. Heart J. 42, ehab724-0968.
Province. J. Med. Pest Control 34, 1024–1027. doi: 10.7629/yxdwfz201811002 doi: 10.1093/eurheartj/ehab724.0979
Mi, G. W., Xian, Z. Q., Wang, L., and Lu, D. S. (2021). Public Tracking SARS-CoV-2 Variants (2021). Available online at: http://www.who.int/
psychological health information needs during the COVID-19 pandemic–Take activities/tracking-SARS-CoV-2-variants/tracking-SARS-CoV-2-variants
the social QandA platform “Zhihu” as an example. Modern Inf. 41, 108–117.
Wang, X., Xiao, C. Q., and Zhu, H. (2021). Impact of panic from coronavirus
doi: 10.3969/j.issn.1008-0821.2021.06.010
disease on Chinese coping style: The moderation of perceived risk. Chin. J. Health
National Health Commission (2021). The Number of People Who Completed Psychol. 29, 1445–1449. doi: 10.13342/j.cnki.cjhp.2021.10.002
the Full Course of Vaccination with the COVID-19 Vaccine Exceeded 770 Million.
Williamson, K., Qayyum, A., Hider, P., and Liu, Y. H. (2012). Young adults and
Available online at: http://www.199it.com/archives/1295404.html (accessed on
everyday-life information: The role of news media. Libr. Inf. Sci. Res. 34, 258–264.
August 13, 2021).
doi: 10.1016/j.lisr.2012.05.001
Nikula, A. E., Rapola, S. P., Hupli, M. I., and Leino-Kilpi, H. T. (2009). Factors
Wu, D., and Liu, Z. J. (2018). Research on the application and the trend of
strengthening and weakening vaccination competence. Int. J. Nurs. Pract. 15,
intelligent information services from big data perspective. J. Inf. Resour. Manage.
444–454. doi: 10.1111/j.1440-172X.2009.01781.x
23, 28–39. doi: 10.13365/j.jirm.2018.02.028
Oh, S., Zhang, Y., and Park, M. S. (2016). Cancer information seeking in social
Wu, H. D. (2013). Development of Psychological Stress Rating Scale for Clinicians
question and answer services: Identifying health-related topics in cancer questions
[Master’s thesis, Ludong University]. Pudong: China National Knowledge Internet.
on Yahoo! Answers. Inf. Res. 21, 27–34. Available online at: http://informationr.
net/ir/21-3/paper718.html#.YzRa2xpBxPY Xu, S. T. (2013). Ontology Based Knowledge Representation [Master’s thesis,
Xiangtan University]. Pudong: China National Knowledge Internet.
Özbayir, T., Malak, A. T., Bektas, M., Ilce, A. O., and Celik, G. O.
(2011). Information needs of patients with meningiomas. Asian Pac. J. Cancer Yang, S. C. (2013). Research on Question Classification for Chinese Question
Prevent. 12, 439–441. Available online at: http://journal.waocp.org/article_25539_ Answering Systems [Doctoral dissertation, Nanjing University]. Pudong: China
fa26f627ebfaf48673c1600378986a0a.pdf National Knowledge Internet.
Price, L., and Robinson, L. (2021). Tag analysis as a tool for investigating Zhang, L. J., Xie, J. B., Yang, T., and Xiao, G. (2019). R Language and Data Mining.
information behaviour: comparing fan-tagging on Tumblr, Archive of Beijing, China: Mechanical Industry Press. Available online at: http://informationr.
Our Own and Etsy. J. Document. 77, 320–358. doi: 10.1108/JD-05-2020- net/ir/17-2/paper515.html
0089
Zhang, X., Majid, S., and Foo, S. (2012). Perceived environmental uncertainty,
Rutten, L., Arora, N. K., Bakos, A. D., Aziz, N., and Rowland, J. (2005). information literacy and environmental scanning: towards a refined framework.
Information needs and sources of information among cancer patients: a Inform. Res. Int. Electron. J. 17, 86–98.
systematic review of research (1980-2003). Patient Educ. Counsel. 57, 250–261.
Zhao, J., and Li, B. (2022). Regional private financing risk index
doi: 10.1016/j.pec.2004.06.006
model based on private financing big data. Front. Psychol. 13, 874412.
Tang, X. B., and Li, J. (2019). Analysis on the topic and sentiment doi: 10.3389/fpsyg.2022.874412
of information needs in online health community. Digital Library Forum,
Zhou, L. Y. (2020). The impact of negative information on medical risk
2, 12–17. doi: 10.3772/j.issn.1673-2286.2019.02.002
communication—Taking HPV vaccination as an example. Stat. Manage. 35, 94–98.
The Novel Coronavirus Pandemic: The Extent of Vaccination Process Around
the World and in Your Region (2021). Available online at: http://www.bbc.com/ Zhu, L. L. (2020). Research on Information Quality Perception of Online
zhongwen/simp/science-56084055 Reviews [Doctoral dissertation, Jilin University]. Pudong: China National
Knowledge Internet.
Thelwall, M. (2021). Can Twitter give insights into international differences
in Covid-19 vaccination? Eight countries’ English tweets to 21 March Zong, C. Q., Xia, R., and Zhang, J. J. (2019). Text Data Mining.
2021. Professional De La Informacion 30, 11–18. doi: 10.3145/epi.2021. Beijing, China: Tsinghua University Press. doi: 10.16722/j.issn.1674-537x.2020.
may.11 09.018