Professional Documents
Culture Documents
https://doi.org/10.1007/s13278-021-00779-3
ORIGINAL ARTICLE
Received: 1 November 2020 / Revised: 5 July 2021 / Accepted: 20 July 2021 / Published online: 21 August 2021
© The Author(s) 2021
Abstract
Social media allow to fulfill perceived social needs such as connecting with friends or other individuals with similar inter-
ests into virtual communities; they have also become essential as news sources, microblogging platforms, in particular, in a
variety of contexts including that of health. However, due to the homophily property and selective exposure to information,
social media have the tendency to create distinct groups of individuals whose ideas are highly polarized around certain top-
ics. In these groups, a.k.a. echo chambers, people only "hear their own voice,” and divergent visions are no longer taken into
account. This article focuses on the study of the echo chamber phenomenon in the context of the COVID-19 pandemic, by
considering both the relationships connecting individuals and semantic aspects related to the content they share over Twit-
ter. To this aim, we propose an approach based on the application of a community detection strategy to distinct topology-
and content-aware representations of the COVID-19 conversation graph. Then, we assess and analyze the controversy and
homogeneity among the different polarized groups obtained. The evaluations of the approach are carried out on a dataset of
tweets related to COVID-19 collected between January and March 2020.
Keywords Echo chambers · Social media · Social network analysis · Community detection · Sentiment analysis · Topic
modeling · COVID-19
13
Vol.:(0123456789)
78
Page 2 of 17 Social Network Analysis and Mining (2021) 11:78
13
Social Network Analysis and Mining (2021) 11:78 Page 3 of 17 78
Sasahara et al. (2019) have also studied the conditions for the identification of echo chambers the one constituted
in which echo chambers emerge. The proposed model’s by the following three phases:
dynamics indicate that the online community rapidly evolves
into isolated, homogeneous groups, even with small amount (1) Modeling the conversation graph (i.e., the graph of
of influence (related to the concept of involvement previ- interactions among users around a topic of interest);
ously discussed), and unfriending. Moreover, their study (2) Partitioning the graph into two groups by selecting/
shows that social media debates tend to polarize individuals defining a suitable strategy;
in exactly two opinion groups. They also find empirical evi- (3) Quantifying the polarization of the members of the
dence that, in many cases, the presence of users with many groups obtained in the previous step by assessing con-
followers (hub nodes) affects the dissemination of the same troversy/homogeneity.
messages. Furthermore, their study suggests that triadic clo-
sure connects individuals to friends of their friends, facili- 2.2.1 Modeling the conversation graph
tating repeated exposure to the same opinion. Such echoes
are a powerful reinforcing mechanism for the adoption of Online communities are complex networks whose members
beliefs and behaviors. and relationships are usually represented and analyzed by
Recently, in the Twitter scenario (the same considered in means of graph theory and Social Network Analysis (SNA)
this article), Baumann et al. (2020) have proposed a model techniques (Knoke and Yang 2019). From a formal point of
introducing the dynamics of radicalization as a reinforc- view, a graph G = (V, E) consists of a pair of sets: a set V,
ing mechanism driving the evolution to extreme opinions which is called the set of vertices, i.e., V = {v1 , … , vn } , and
from moderate initial conditions. The outcome of the work a set E, which is called the set of edges. E is a subset of the
illustrates that the transition between a global consensus Cartesian product V × V , i.e., it is a set of pairs ej = (vk , vl ) ,
and emerging radicalized states is mostly governed by with j = 1, … , m , and vk , vl ∈ V .
social influence and by the “controversialness” of the topic In general, in the graph-based representation of a virtual
discussed. community, the vertices represent the members of the com-
munity, and the edges possible social interactions between
them. Different social media platforms can lead to the gen-
2.2 Echo chamber detection eration of many different kinds of relationships among their
users (Carminati et al. 2012), e.g., a friendship in Facebook,
Often, a virtual community is formed around a specific a followee/follower relationship in Twitter, a comment to
topic of interest. This is especially true in those social media someone’s post in Instagram, etc.; this happens also within
which are mainly focused on the exchange of textual content the same social media platform, e.g., the followee/follower,
and the generation of the so-called conversational threads the retweet, and the mention relationships in Twitter, which
(Aragón et al. 2017). In this scenario, there are specific will be better detailed in Sect. 3.1.1.
topics that lend more than others to the formation of echo As previously illustrated, recent works investigating the
chambers (e.g., politics, health, religion, etc.) (Baumann echo chamber phenomenon mainly focus on those social
et al. 2020), which are usually polarized around two main platforms, such as Twitter, and social relationships that allow
points of view (Sasahara et al. 2019). Here, the concepts of the construction of conversation graphs. In this context, for
controversy (Garimella et al. 2018b) and homogeneity (Quat- example, the work by Garimella et al. (2018b) proposes to
trociocchi et al. 2016; Sasahara et al. 2019) become funda- model the three conversation graphs built around two con-
mental for the recognition of echo chambers, interpreted as troversial (i.e., #Ukraine and #BeefBan) and one non-
polarized groups of individuals. The higher the controversy controversial (i.e., #NationalKissingDay) hashtags,
between members of different groups and the homogeneity collected on Twitter, via an undirected graph representation
between members of the same group, the higher the prob- where vertices are Twitter users and edges represents both
ability to be in the presence of echo chambers. a retweet or a followee/follower relationship between them.
Recent works in the literature whose aim is to identify Also in the work by Coletto et al. (2017), different graphs
binary polarization (i.e., two strongly polarized groups are constructed with respect to different controversial and
of individuals) in virtual communities mostly rely on the non-controversial topics on Twitter. In this case, for each
quantification of controversy/homogeneity after perform- tweet collected with respect to a topic, the discussion thread
ing graph partitioning on the graph-based representation is reconstructed by performing a crawling operation on the
of social interactions around a given topic, also known as replies to the initial tweet. In this case, a graph relating to
conversation graph. the followee/follower relationship between two users is con-
Therefore, referring in particular to the work of Garimella structed, as well as a graph relating to the fact that a tweet is
et al. (2018b), in this paper we present as a possible scenario the reply to another tweet; hence, by using these two graphs,
13
78
Page 4 of 17 Social Network Analysis and Mining (2021) 11:78
a further graph is generated, in which an edge between two representations of conversation graphs (Yuan and Crooks
users represents the reply of a user to another user’s tweet. 2018). In this work, on the contrary, our aim is to consider
Also in the work by Kumar et al. (2018), the community these aspects together in the graph modeling and partition-
is modeled through the use of a reply tree which involves, ing phases.
given a tweet, all the related interactions.
2.2.3 Quantifying polarization
2.2.2 Partitioning the graph
The last echo chamber detection phase consists in quantify-
Recent works of the literature addressing the problem of ing the polarization of the identified partitions, by assessing
echo chamber detection over a conversation graph have pro- controversy and/or homogeneity within them. This activity
ceeded to use community detection algorithms for partition- serves to verify the effective presence of echo chambers in
ing the graph into two distinct groups, a.k.a. communities. the virtual community.
As stated in (Papadopoulos et al. 2012): In the work by Coletto et al. (2017), structural aspects of
the network are mainly taken into account to assess polariza-
Community detection constitutes a significant tool
tion (e.g., the number of vertices, edges, and degree distribu-
for the analysis of complex networks by enabling the
tion), as well as propagation information (e.g., the reply tree
study of mesoscopic structures [...] often associated
depth), temporal information (e.g., the time elapsed between
with organizational and functional characteristics of
one reply and the next one), and local information (e.g., how
the underlying networks.
many replying users are connected by a followee/follower
The first literature works that have addressed the problem of relationship). Del Vicario et al. (2016) focus mainly on the
community detection in complex networks (regardless of the concept of homogeneity in diffusing specific kinds of con-
echo chamber phenomenon) have often focused on methods tents, while both structural and content aspects to assess
that analyze only the topological structure of the network controversy and homogeneity are discussed by Sasahara
(Fortunato and Hric 2016). In these approaches, usually the et al. (2019). However, one of the most complete work from
number of intra-group connections compared to the number this point of view is the one proposed by Garimella et al.
of extra-group connections is considered in some way. (2018b), which illustrates specific metrics for evaluating
However, considering only topological aspects is prob- the level of controversy between user groups, which will be
lematic mainly for two reasons, particularly in online com- detailed in Sect. 3.3.
munities: (i) the graph-based modeling of the community
depends on the type of relationships that are taken into
account, as previously illustrated; (ii) the content associated 3 A topology‑ and content‑based approach
with social interactions is ignored, not allowing to “enrich” for echo chamber detection and analysis
the topological representation of the conversation graph with
semantic information. As stated by Natarajan et al. (2013), The approach proposed in this work and detailed in this sec-
a link between two users increases the chances that the two tion, aimed at detecting and analyzing echo chambers, is
share common interests, but does not necessarily imply it. framed in the context of the COVID-19 pandemic discus-
Furthermore, two users who do not share a link might still sions on the Twitter microblogging platform at the begin-
have common interests. ning of 2020 (roughly in the first months of its diffusion).
Therefore, for some years now, the works that deal with First, the modeling of the conversation graph built on
community detection have been trying to consider some the available COVID-19 data is illustrated, leading to four
semantic information such as the sentiment linked to the distinct graph representations based on both topological and
exchanged content and topics of interest (within the more semantic aspects; then, the chosen partitioning algorithm
general topic around which the community is generated) that and its application to the four above-mentioned represen-
the community members discuss. This is the case, among tations of the conversation graph are detailed; finally, the
others, of the approaches proposed by Pathak et al. (2008), measures and other qualitative methods employed to assess
Sachan et al. (2012), Natarajan et al. (2013), Zhang et al. the level of controversy and homogeneity with respect to the
(2015), Sawhney et al. (2017). groups identified by the partitioning algorithm (under the
As for the community detection algorithms employed four representations) are presented.
in the identification of echo chambers by recent literature
approaches, they are mostly topology-based, not taking suf- 3.1 Modeling the conversation graph
ficiently into account specific semantic aspects (Garimella
et al. 2018a, b; Dokuka et al. 2018; Cossard et al. 2020), or The modeling phase of a conversation graph is strongly data-
they are applied separately to topology- and content-based set-dependent; for this reason, we first introduce the data that
13
Social Network Analysis and Mining (2021) 11:78 Page 5 of 17 78
1
https://help.twitter.com/en/using-twitter/types-of-tweets.
2
This choice has been undertaken to better concentrate on a single 3
semantic aspect of the problem and better identify the echo cham- The size of this graph is comparable to those built on several data-
ber phenomenon. However, this choice does not limit the proposed sets used for the study of the controversy in conversation graphs, as
approach to the use of other types of relations (even together), as dis- illustrated in (Garimella et al. 2018b).
4
cussed in Sect. 5. https://gephi.org/.
13
78
Page 6 of 17 Social Network Analysis and Mining (2021) 11:78
3.1.3 Enriching the graph with semantic information score with respect to users with a “mixed” or an opposite
sentiment.
As briefly illustrated in Sect. 2.2.2, in the research area of Finally, the sentiment similarity score is employed to
community detection algorithms, various approaches have compute, between the considered couples of users, a new
tried to take into account distinct semantic aspects related to weight wss replacing the wt weight of the topology-based
content. By considering these approaches, in this work we modeling, according to the following equation:
propose to “enrich” the topology-based representation of the
conversation graph with this semantic information, acting in
wss (vi , vj ) = 1 + ss(vi , vj ). (3)
particular on the weights of the edges connecting vertices. According to this formulation, in the event of a completely
The idea is to replace the original wt weights on the edges in discordant sentiment between two users vi and vj , i.e.,
the topology-based representation by new weights that are ss(vi , vj ) = 0 , the weight of the edge will be equal to 1, there-
designed to take into account content-related semantic aspects fore only the topological component will be considered (the
in addition to topological ones. In particular, taking inspira- presence of an explicit link between users, regardless of the
tion from (Sawhney et al. 2017), we consider: (i) the sentiment number of mentions between them). Conversely, if they have
of the tweets, and (ii) the topics discussed within the tweets a similar sentiment, there will be an increase in the weight of
(for example, in the context of the general topic of COVID-19, the edge that connects them, even if the number of mentions
the topics related to negationism and conspiracy theories, to between them is low. The rationale behind this sentiment-
the need of maintaining social distancing, to the usefulness based representation is that we do not want to completely
of wearing masks, etc.); furthermore, (iii) we also propose to ignore the topological component in the community detec-
consider both aspects together in a hybrid fashion. Such three tion phase, but we want to emphasize that similar groups of
semantic-based modelings are detailed in the following. users should be identified also by considering the similarity
among their sentiment.
Sentiment-based modeling. In this modeling, a sentiment In this work, the sentiment of the tweets has been assessed
score to be associated with each user based on their user- by employing VADER (Valence Aware Dictionary and
generated tweets is first obtained; then, such score is used to sEntiment Reasoner) (Hutto and Gilbert 2014), a domain-
compute a new weight on each edge between users already free, lexicon-based model, which is particularly suitable for
connected in the topology-based modeling of the graph. the social and microblogging context like that of Twitter
Formally, let us consider a vertex v and the sentiment (Elbagir and Yang 2020). VADER does not require training,
scores x1 , … , xt associated with its t tweets computed in a and has defined rules for evaluating emojis, slang, upper-
discrete interval of integers [−𝛼, +𝛼],5 according to a given case words, and grade modifiers, i.e., words such as very,
sentiment analysis technique; a user sentiment score s(v) rather, fairly, and quite that impact the sentiment intensity
associated with v is computed as: by either increasing or decreasing it (e.g., “I am very scared
t of the pandemic” is more intense than “I am scared of the
1 ∑
s(v) = ⋅ x. (1) pandemic”). When a string needs to be evaluated, VADER
t j=1 j
returns a dictionary of scores in four distinct categories, i.e.,
(i) positive, (ii) negative, (iii) neutral, and (iv) compound.
Once each user has been assigned a sentiment score s(v),
For the first three categories, a value between 0 and 1 is
which, according to its formulation, may vary again between
generated, which represents the proportion of text that falls
the range values +𝛼 and −𝛼 (included), it is possible to
into each category. The compound score is the aggregation
compute a sentiment similarity score, denoted as ss(vi , vj ) ,
of the scores associated with the previous three categories,
between any couple of users vi and vj , already connected by
normalized in an interval ranging from −1 (extremely nega-
an edge in the topology-based modeling, as follows:
tive) to +1 (extremely positive). In this work, it was decided
ss(vi , vj ) = 2𝛼 − |s(vi ) − s(vj )|. (2) to use such compound metric, which provides a one-dimen-
sional evaluation of sentiment within a text.6 Considering
This way, a value from 0 to 2𝛼 is attributed to such couples the compound score, a text is defined as neutral if its value
of users, depending on their sentiment similarity; users with falls within the interval [−0.05, 0.05].
a similar sentiment will have a higher sentiment similarity To be employed in the sentiment-based modeling described
above, it has been necessary to scale the compound scores
5
Considering a discrete interval of values is functional to the fact produced by VADER with respect to the discrete interval
that in the topology-based undirected and weighted representation of
the conversation graph, the weights on the edges are integer values [−𝛼, +𝛼] previously mentioned. In this work, the value of
representing the number of mentions among users’ tweets, and to the
usage of the community detection algorithm that will be described in
6
the next section. https://github.com/cjhutto/vaderSentiment.
13
Social Network Analysis and Mining (2021) 11:78 Page 7 of 17 78
𝛼 has been chosen heuristically, based on some preliminary of tokens by considering the specificity of short texts such as
experiments, and its value has been set to 30. Such scaling was tweets. Then, we eliminated all those tokens that appeared in
based on a simple linear conversion, formally: less than 2% and in more than 50% of the documents, consider-
ing, at the end, the first 100,000 tokens given their frequency.9
(x − oldMin) ⋅ newRange
scaling(x) = + newMin. (4) After a phase of stop-words removal and stemming, the Gen-
oldRange sim library (Rehurek and Sojka 2010) was used to implement
Specifically, scaling(x) represents the scaled value of the the LDA MALLET model. Given the considered vocabulary,
original compound score x; oldRange = (oldMax − oldMin) , the LDA model has been trained on a number of topics rang-
where oldMin and oldMax are the minimum and maximum ing from 2 to 30, to identify the best number of topics to fit the
compound scores from the old distribution; newRange = model (Sbalchiero and Eder 2020). For each number of top-
(newMax − newMin) , where newMin and newMax corre- ics in the range, the topic coherence metric was evaluated to
spond to −𝛼 and +𝛼 , respectively. assess the quality of the obtained topics, and the corresponding
trained model saved. Specifically, topic coherence has been
Topic-based modeling. In this graph modeling, to each computed as the CV measure discussed in (Röder et al. 2015)
user v in the graph, is associated a set of topics denoted and implemented in the models.coherencemodel pipe-
as T(v), i.e., the subset of topics discussed by v within the line in the Gensim library.10 The quality of the topics was
global set of topics, namely T, discussed by all users (in our also evaluated by human assessors; such double evaluation of
work, extracted from the COVID-19 conversation graph, see topics (more details regarding these evaluation aspects will be
Sect. 4.3.2). The topic similarity ts(vi , vj ) between any cou- illustrated in Sect. 4.3.2), have been necessary given the fact
ple of users vi and vj , already connected by an edge in the that a good topic selection phase is strictly related to the cor-
topology-based representation of the graph, is computed by rect semantic enrichment of the conversational graph.11
considering the overlap of their topics as follows. Formally:
Hybrid modeling. Having available the similarity values
ts(vi , vj ) = |T| − |T(vi )ΔT(vj )| (5) between users with respect to the sentiment and topics dis-
cussed in their tweets, it was decided to develop a hybrid
where Δ denotes the symmetric difference between the two modeling of the conversation graph that took both aspects
sets T(vi ) and T(vj ) . In this way, we assign a topic similarity into consideration at the same time. The objective behind this
value that is maximal, and equal to |T|, for users with a total modeling is to “distance” two users who, speaking in general
overlap of topics discussed, and it is minimal (i.e., equal to of the same topics, show a different sentiment, and, vice versa,
0) when |T(vi ) ∪ T(vj )| = |T| and |T(vi ) ∩ T(vj )| = 0 . Such to “approach” users discussing the same topics and having a
formulation favors the decrease in topic similarity among similar sentiment. Hence, the hybrid modeling has seen the
users as the number of different topics they discuss increases, definition of a further new weight wh , which combines the two
while it considers equally similar users who discuss exactly previously defined similarity measures. Formally:
the same topics, whether they are more or less numerous.7
As in the case of the sentiment-based modeling of the wh (vi , vj ) = 1 + ss(vi , vj ) + ts(vi , vj ). (7)
graph, the new weight wts replacing wt in the topology-based
representation on the edge connecting vi and vj is expressed Again, with this simple weight modeling, if vi and vj are
as: totally dissimilar from the point of view of the sentiment
expressed by their tweets and the topics they treat over the
wts (vi , vj ) = 1 + ts(vi , vj ). (6) global set of topics (i.e., ss(vi , vj ) = 0 and ts(vi , vj ) = 0 ), the
wh weight (which in this graph representation replaces the
According to this formulation, in the borderline case in wt weight in the topology-based modeling) will be equal to
which ts(vi , vj ) = 0 , a weight equal to 1 will be assigned to 1, which indicates the presence of a topological link. The
the edge, which indicates the presence of a topological link.
To associate topics with users, an LDA (Latent Dirichlet
Allocation) model developed and maintained by the MALLET 8
https://www.nltk.org/api/nltk.tokenize.html.
(MAchine Learning for LanguagE Toolkit) project by McCal- 9
According to Luhn (1958), terms with medium frequencies are
lum (2002) has been employed. For each tweet belonging to more important concerning their significance than terms that have
the users in the conversation graph, a tokenization phase has low or high frequencies; according to Salton et al. (1983), it is neces-
sary to mitigate the effect of such terms to be significant in relevance
been performed by using the TweetTokenizer offered by NLTK
determination.
(Bird Steven and Klein 2009);8 this allows a better definition 10
https://radimrehurek.com/gensim/models/coherencemodel.html.
11
However, it may be useful to consider and test more than one
7
Different solutions based on increasing similarity as common top- configuration of topics in the future refinement of the approach. The
ics increase can be investigated in future research. same can be said for the token selection phase.
13
78
Page 8 of 17 Social Network Analysis and Mining (2021) 11:78
3.2 Partitioning the graph
13
Social Network Analysis and Mining (2021) 11:78 Page 9 of 17 78
to a certain community (again, selected at random), can be of length equal to the twice average shortest path of the
exposed to the content expressed by an authoritative node graph, no authoritative node is reached, the random selec-
of the opposite community. It is assumed that the degree tion of another node is performed. The number of nodes
centrality of a vertex can be used as an index of its authority, randomly selected for both partitions is equivalent, around
and that a random walk ends when reaching an authorita- the 60% of the nodes belonging to each partition; a node is
tive node (regardless of community affiliation). The third considered as authoritative if its degree is positioned in the
measure is also based on the random walk concept, but this top-15% given the community to which it belongs; partitions
time considering the number of community changes of a are selected at random (each with probability 0.5).
node during a random walk of fixed length. The last measure Displacement Random Walk Controversy. This metric,
presented is based on the so-called boundary connectivity proposed in this work, and denoted as Displacement Random
concept, which measures the degree of connection between Walk Controversy (DRWC), aims at considering the ratio
so-called internal community and boundary vertices. Details between the number of steps during a fixed-length random
are provided below. walk leading to a change of community, and the total length
Random Walk Controversy. This measure, defined by of the walk. Formally:
Garimella et al. (2018b), considers two partitions X and Y of ∑ � � ��
n(v)cc
the graph G = (V, E) (such that X ∪ Y = V , and X ∩ Y = � ), ∀v∈N 1 − lrw (10)
and two random walks, one ending in partition X and the DRWC =
�N�
other ending in partition Y. The Random Walk Controversy
(RWC) measure is defined as the difference of the probabili- where N is the set of randomly selected vertices to be con-
ties of two events: (i) both random walks start and end up in sidered in computing the measure, lrw is the length of the
the same partition, and (ii) both random walks start from a random walk (the number of edges in the walk), and n(v)cc
partition and end up in the other one. Formally: is the number of steps, in the random walk of v, where the
node has changed community. The value of this measure
RWC = PXX PYY − PXY PYX (8)
ranges in the [0, 1] interval. If a node, during its walk, has
where PAB with A, B ∈ {X, Y} is a conditional probability never changed community, it means that it is closely con-
defined as follows: nected to its own community, and, therefore, there is contro-
versy between the two communities. If, on the other hand,
PAB = P[starts in A|ends in B]. (9) it crosses the two communities many times, it means that
they do not present a high degree of controversy between
The value of this metric ranges between 0 and 1. The closer
them. Therefore, higher values of this measure correspond
it is to 0, the more likely it is to switch to the other partition
to higher controversy between communities and vice versa.
(no controversy); the closer it is to 1, the more likely it is to
In this work, for each partition, a number corresponding
stay in the original partition (presence of controversy).
to about the 60% of the vertices have been randomly selected
Authoritative Random Walk Controversy. This measure,
to compute the metric; the length of the random walk has
proposed in this work, constitutes a slightly modification
been defined as twice the average shortest path in the graph.
of the RWC measure; it is denoted as Authoritative Ran-
Boundary Connectivity. This metric, also employed in
dom Walk Controversy (ARWC). This name derives from
(Garimella et al. 2018b) to evaluate controversy between
the fact that, if in RWC the selection of starting nodes was
communities, is taken from (Guerra et al. 2013); the measure
completely random between the users of the two different
is based on the concepts of internal and boundary vertices.
partitions, in ARWC we start only from the nodes defined
Given a graph G, let us consider u ∈ X as a vertex in parti-
as authoritative. In a completely similar way to what has
tion X; u belongs to the boundary of X if and only if it is con-
already been seen for RWC, the random walk ends once a
nected to at least one vertex in partition Y and to at least one
vertex that is part of the set of authoritative nodes of one or
vertex in partition X that is not connected to any vertex in
the other partition is reached. In this way, we aim at quan-
partition Y. The set of boundary vertices is therefore defined
tifying how much the authoritative nodes of a partition are
as B = BX ∪ BY ; conversely, the set IX = X − BX is the set
exposed to similar individuals, but belonging to the opposite
of internal vertices of partition X (in a completely similar
partition. The hypothesis behind the definition of ARWC is
way we define the set IY for partition Y). The set of inter-
that, if an authoritative node is reached by an authoritative
nal vertices is therefore defined as the set I = IX ∪ IY . If the
node of the other community, it can then more easily influ-
two partitions would constitute echo chambers, the whole B
ence also the non-authoritative nodes of its own community,
should be made up of vertices that are more strongly con-
thus reducing controversy.
nected with the elements of I rather than with elements of B.
In this work, for both RWC and ARWC, a “restart”
The following equation, representing Boundary Connec-
mechanism has been implemented: if after a random walk
tivity (BC), formalizes the concept just expressed:
13
78
Page 10 of 17 Social Network Analysis and Mining (2021) 11:78
[ ]
1 ∑ di (u)
BC = − 0.5 (11)
|B| u∈B db (u) + di (u)
13
Social Network Analysis and Mining (2021) 11:78 Page 11 of 17 78
Table 2 Results of the measures considered to evaluate the controversy between the communities identified by the community detection algo-
rithm on the four representations of the conversation graph
Type Mod. Cov. RWC ARWC DRWC BC
4.3 Homogeneity results This section provides an analysis of the topics that have been
extracted by means of the topic modeling task presented in
This section is dedicated to the presentation of some Sect. 3.1.3. As previously mentioned, we had to select the
qualitative analyzes that have been carried out on the two LDA model that best performed with respect to the num-
ber of topics to be extracted. The optimal results have been
obtained with the choice of considering 25 topics. In fact,
14
Computed as the Louvain modularity detailed in (Newman and with this number of topics, we obtain one of the highest
Girvan 2004), and as detailed in (Fortunato 2010) for coverage. topic coherence scores, as shown in Fig. 5.
13
78
Page 12 of 17 Social Network Analysis and Mining (2021) 11:78
Fig. 4 Intra-community senti-
ment distribution given the
different representations used.
On the x-axis the sentiment
scores, while on the y-axis their
probability
The choice of 25 topics was also confirmed by an analysis seems to talk about the history of the doctor who first tried to
carried out by human assessors, based on which it seemed alert the authorities about the presence of a new virus; top-
that this number produced the best results in recognizing sig- ics 4 and 15 seem to encompass predictions and fears about
nificant keywords related to the topics considered. Hence, in the impact of the virus on the economy; topic 6 deals with
Table 3 we report, for each topic defined, the most probable, the possible ways of treating this disease; topic 13 concerns
i.e., the top-10, (stemmed) keywords that appear in it. As it Italy and the preventive measures adopted (e.g., lockdown,
can be observed from the table, the topics defined, in prin- closure of schools and universities); topic 19 is about the
ciple, tend to be characterized by keywords clearly related cruise ship case Diamond Princess; topic 21 seems to speak
to specific COVID-19 events, people, lifestyles, etc.: topic 1 of new hygiene habits; topic 22 has political connotations.
13
Social Network Analysis and Mining (2021) 11:78 Page 13 of 17 78
From the analysis of the topics discussed within the two information community; words like research, health, sci-
different communities, under the four graph representa- ence, healthcare, public health, and journalist, can actu-
tions, no significant differences were found with respect to ally be referred to this area and, more generally, to an
the number of specific topics dealt with more by one or the informed discussion on the COVID-19. On the other hand,
other community. Most likely what changes is the feeling Fig. 6b, referring to Community B, contains words that
linked to the discussion with respect to individual topics. clearly recall a political orientation aimed at supporting
Therefore, to get an idea of whether there are substantial the candidate Donald John Trump in the 2020 US presi-
differences with respect to specific themes, we have made a dential election; acronyms like MAGA (Make America
further qualitative assessment, described in the next section. Great Again) or KAG (Keep America Great), recall the
candidate’s election spots. Furthermore, words like con-
4.3.3 Wordclouds servative, patriot, christian, and God recall certain core
values on which the Republican Party in the United States
In this qualitative analysis, we have considered the is based.
descriptions associated with the Twitter accounts of the Given these wordclouds, and taking into considera-
users belonging to the two communities. For each account tion the graph illustrated in Fig. 4b, about the distribu-
description, the most important keywords employed by tion of average user sentiment given the sentiment-based
the user to define their beliefs and thoughts have been representation of the COVID-19 conversation graph, it
extracted, and illustrated by means of wordclouds. Fig- can be concluded that Community B, which seems more
ure 6 presents these wordclouds related to members of politically oriented, presents a much more negative senti-
Community A and Community B under the sentiment- ment than Community A, which seems more scientifically
based graph representation, i.e., the most effective in cap- oriented.
turing controversy (Sect. 4.2).
Figure 6a, referring to Community A, appears to be
endowed with words recalling the scientific and the
13
78
Page 14 of 17 Social Network Analysis and Mining (2021) 11:78
4.3.4 Verified VS unverified accounts In this article, we have addressed the problem of detecting
and analyzing echo chambers in social media, in particu-
Another interesting qualitative evaluation related to homo- lar in the Twitter microblogging platform, relating to the
geneity, concerns the possible contrast regarding the reli- first phase of the spread of the 2019 Coronavirus disease,
ability of the information exchanged on the conversation between January and March 2020.
graph. One possibility to evaluate such reliability concerns
the identification of the number of users, for each commu-
nity, with a verified account. In Twitter, in fact, a blue badge 15
https://help.twitter.com/en/managing-your-account/twitter-verif
ied-accounts
13
Social Network Analysis and Mining (2021) 11:78 Page 15 of 17 78
Specifically, the conversation graph built around the Additional investigations could be related to graph
tweets discussing the COVID-19 pandemic has been modeling and community detection algorithms. In this
modeled, by considering both topology- and content- work, we focused on the use of mentions to build the
based aspects. In this way, we have obtained four different topology-based graph; taking into consideration different
representations of the graph, one based only on explicit types of relationships is certainly an aspect to be further
relationships (i.e., mentions) between tweets, and the oth- deepened. In addition to this, the virtual community has
ers considering also the sentiment of tweets, the topics been modeled as an undirected graph, as performed in
discussed, and both aspects together. Then, on these con- other works in the literature. This is a commonly adopted
versation graph modelings, we have applied a well-known solution because effective community detection algorithms
community detection algorithm (i.e., METIS) to partition are mostly developed for this type of representation. It
the graph in two distinct communities. To verify that the would be worthy of investigation the study of the problem
obtain communities were indeed echo chambers, they have even in the presence of an oriented representation of the
been evaluated: (i) with respect to some classic graph par- graph. This could be useful not only in the case of binary
titioning measures (i.e., modularity and coverage), (ii) to partitioning of the graph, but also with respect to the iden-
some controversy measures proposed both in the litera- tification of a higher number of sub-communities within
ture and in this article, and (iii) with respect to their level the conversation graph.
of homogeneity, by means of a qualitative analysis. The As for taking semantic aspects into consideration, in
results obtained in relation to these evaluations have made this work we have shown how the use of sentiment linked
it possible to highlight, first of all, that taking into account to tweets can be particularly effective in the study of echo
semantic aspects in modeling the conversation graph is chamber detection. Regarding the results obtained con-
certainly useful for echo chamber detection with respect to sidering topic-based and hybrid representations, even
considering only topological aspects. In this sense, it will if they too have led to obtain better results than those
certainly be interesting to also consider possible modeling obtained with the topology-based representation alone,
of the graph that add or remove links on the basis of con- however, new studies should be performed from different
siderations exclusively related to the exchanged contents. perspectives.
However, while it has become fairly clear that the addi- As for the representation that takes into account topics,
tion of semantic information may have a contribution with other tokenization rules and topic configurations should be
respect to the problem under consideration, further inves- investigated to verify that these choices do not affect the
tigation is needed to clarify whether those identified are detection of echo chambers. Regarding the hybrid repre-
strongly or rather weakly formulated echo chambers. In sentation, its current formulation could lead to borderline
fact, although most of the controversy measures have con- situations where one aspect (sentiment or topic) dominates
firmed the presence of echo chambers in a pretty strong the other; furthermore, it has been observed that it led to
way (and this has also been confirmed by the qualitative a near reversal of community members. The above-men-
analyzes, at least for the sentiment-based modeling), mod- tioned issues, and the possible interaction between them,
ularity and boundary connectivity have not confirmed this need an in-depth study and statistical analyzes to verify
outcome so clearly. It is true that in the proposed approach if the identified echo chambers are not affected by this
we considered a binary partitioning of the conversational formulation.
graph and this may have affected modularity, a measure Connected to the above-mentioned aspect, there is the
that is not targeted on the echo chamber problem; however, need of studying in more detail the relationship between the
the results obtained through its use and those obtained topics discussed and the sentiment linked to these topics, an
through boundary connectivity should be further inves- aspect that in this work has been only partially considered.
tigated, to assess the effectiveness of such measures with
respect to the considered problem.
Related to the above-mentioned issue, another interest- Author Contributions Both GV and MV contributed to the study con-
ceptualization and design, formal analysis, and validation. Material
ing aspect to be tackled concerns the definition of new preparation, software development, data collection, analysis, and visu-
measures for the assessment of controversy. Some of the alization were performed by GV. The first draft of the article was writ-
state-of-the-art measures are based on the concept of ran- ten by MV. All authors reviewed and commented on further versions
dom walk, but the limit of this modeling is to be found in of the article. All authors read and approved the final article.
the fact that each edge has the same probability of being
Funding Open access funding provided by Università degli Studi
followed along the walk. In reality, this does not hap- di Milano - Bicocca within the CRUI-CARE Agreement. This work
pen; users are encouraged to choose certain paths with received no financial support.
respect to others based on psychological and technological
reasons. Data availability Data are available upon request.
13
78
Page 16 of 17 Social Network Analysis and Mining (2021) 11:78
13
Social Network Analysis and Mining (2021) 11:78 Page 17 of 17 78
Papadopoulos S, Kompatsiaris Y, Vakali A, Spyridonos P (2012) Sawhney K (2017) Community detection using graph structure and
Community detection in social media. Data Min Knowl Discov semantic understanding of text. SNAP Stanford University, Mar-
24(3):515–554 cella Cindy Prasetio and Suvadip Paul
Parés F, Gasulla DG, Vilalta A, Moreno J, Ayguadé E, Labarta J, Cor- Sbalchiero S, Eder M (2020) Topic modeling, long texts and the best
tés U, Suzumura T (2017) Fluid communities: a competitive, scal- number of topics. Some problems and solutions. Qual Quant 1–14
able and diverse community detection algorithm. International Del V Michela, Vivaldo G, Bessi A, Zollo F, Scala A, Caldarelli G,
conference on complex networks and their applications. Springer, Quattrociocchi W (2016) Echo chambers: emotional contagion
New York, pp 229–240 and group polarization on Facebook. Sci Rep 6:37825
Pathak N, DeLong C, Banerjee A, Erickson K (2008) Social topic Viviani Marco, Pasi Gabriella (2017) Credibility in social media: opin-
models for community extraction. In: The 2nd SNA-KDD work- ions, news, and health information-a survey. Wiley Interdiscip Rev
shop, vol 8. pp 2008 Data Min Knowl Discov 7(5):e1209
Prasetyo Putri DG, Viviani M, Pasi G (2020) Social search and task- Wardle C, Derakhshan H (2017) Information disorder: toward an inter-
related relevance dimensions in microblogging sites. In: Inter- disciplinary framework for research and policy making. Counc
national conference on social informatics. Springer. pp 297–311 Eur Rep 27
Quattrociocchi W, Scala A, and Sunstein CR (2016) Echo chambers on Waszak PM, Kasprzycka-Waszak W, Kubanek A (2018) The spread
Facebook. Available at SSRN 2795110 of medical fake news in social media-the pilot quantitative study.
Rehurek R, Sojka P (2010) Software framework for topic modelling Health Policy Technol 7(2):115–118
with large corpora. In: Proceedings of the LREC 2010 workshop Yuan X, Crooks AT (2018) Examining online vaccination discussion
on new challenges for NLP frameworks. Citeseer and communities in Twitter. In: Proceedings of the 9th interna-
Röder M, Both A, Hinneburg A (2015) Exploring the space of topic tional conference on social media and society, pp 197–206
coherence measures. In: Proceedings of the eighth ACM interna- Zhang F, Li J, Li F, Xu M, Xu R, He X (2015) Community detection
tional conference on Web search and data mining. pp 399–408 based on links and node features in social networks. In: Interna-
Sachan M, Contractor D, Faruquie TA, Venkata Subramaniam L (2021) tional conference on multimedia modeling. Springer, pp 418–429
Using content and interactions for discovering communities in Zubiaga A, Liakata M, Procter R, Hoi GWS, Tolmie P (2016) Ana-
social networks. In: Proceedings of the 21st international confer- lysing how people orient to and spread rumours in social media
ence on World Wide Web. pp 331–340 by looking at conversational threads. PloS one 11(3):e01509898
Salton G, Fox EA, Wu H (1983) Extended boolean information
retrieval. Commun ACM 26(11):1022–1036 Publisher's Note Springer Nature remains neutral with regard to
Sasahara K, Chen W, Peng H, Ciampaglia GL, Flammini A, Menczer jurisdictional claims in published maps and institutional affiliations.
F (2019) On the inevitability of online echo chambers. arXiv pre-
print arXiv:1905.03919
13