You are on page 1of 43

Networks

L1: an introduction to networks and communities

Michele Tumminello,
Department of Economics, Business and Statistics
University of Palermo

Data Science and Big Data Analytics


Networks or Graphs?
Seven Bridges of Königsberg (Euler 1735):

A D

C
Is there a walk through the city
that would cross each bridge
once and only once? A D

NO
13/09/20 MT B 2
Networks or Graphs?
The Zachary’s Karate Club (Zachary 1977):
At some point, a conflict arose
between the club’s administrator
(vertex 1) and one of the teachers
(vertex 33), which led to the split
of the club in two smaller clubs,
with some members staying with
the administrator and the others
following the instructor.

Is there a “natural” way to break


the network in two?

YES

13/09/20 MT 3
It’s the question that really matters!

13/09/20 MT 4
Types of networks

Newman (2003)
13/09/20 MT 5
Networks modeling Social Systems are typically
Complex Networks

What does it mean?

13/09/20 MT 6
Between Regular and Random Networks

Maybe complicated… but still NOT COMPLEX

13/09/20 MT 7
Sparse Networks

The probability that any node on the network will be very highly
connected to many others is VERY LOW

The probability that a very large number of nodes will be connected


very loosely or not at all is VERY HIGH

Complex networks are typically sparse networks

Barabasi and Albert (1999)

13/09/20 MT 8
Vertex degree
K=1 K=2 K=3

The vertex degree “K” is the number


of edges incident to the vertex

13/09/20 MT 9
Normal and power-law distribution

Benkler (2006): 244

13/09/20 MT 10
The function P(X>x)
P(X>x) is the probability to observe a value
of X that is larger than x.

• Example: If x=1 link then P(X>1) is the


probability that a site has more than 1 link
pointing to it

13/09/20 MT 11
Normal and power-law distribution
Linear Plot Log-Linear Plot

Normal distribution: Power-law distribution:

Very small # of vertices – few links Very large # of vertices – few links

Very small # of vertices – large # of links Small # of vertices – very large # of links

13/09/20 MT 12
Complex Networks are scale free

A power-law distribution of the vertex degree is


often seen as a mark of complex networks…

Why?

13/09/20 MT 13
Star Alliance Footprint

Pittsburgh Los Angeles Tokyo Okinawa

Pittsburgh New York Amsterdam

Pittsburgh Detroit Buffalo

Would that be possible with a Normal distribution of the degree?


13/09/20 MT 14
Communities
Zachary’s
Karate club

Lusseau’s
network of
bottlenose
dolphins
Scientific
collaborations
at SFI

S. Fortunato, Physics Reports 486 (2010) 75–174


13/09/20 MT 15
Concepts to take home
Complex networks
• Are (typically) sparse
• Are scale free
• Display emergent structures: communities

• Nodes are (typically) interdependent


• Are robust, but cascading phenomena are
possible
• Are evolving systems

13/09/20 MT 16
Concepts to take home
Complex networks
• Are (typically) sparse
• Are scale free
• Display emergent structures: communities

• Nodes are (typically) interdependent


• Are robust, but cascading phenomena are
possible
• Are evolving systems

13/09/20 MT 17
Communities

13/09/20 MT 18
Learning outcomes

• To learn about one of the fundamental mechanisms


driving the evolution of social networks: the triadic closure

• To understand how the mechanism of triadic closure relates


to the formation of communities in social networks

• To learn about the basic quantities used to describe the


degree to which nodes in a network tend to cluster together

13/09/20 MT 19
Levels of description of networks

Macroscopic level: degree distribution, total clustering coefficient, etc.

Microscopic level: degree, centrality of individual nodes, etc.

Mesoscopic level: communities

13/09/20 MT 20
People tend to aggregate…

13/09/20 MT 21
…selectively

13/09/20 MT 22
WARNING

A community is NOT necessarily identified by a


single connected component of the network

13/09/20 MT 23
Map of science based on citation patterns

M. Rosvall and C.T. Bergstrom, PNAS 105 (4), 1118-1123 (2008)


13/09/20 MT 24
An intuitive definition of community

In many real networks, nodes appear to group in sub-networks in which the


density of internal links is larger than the connections with the rest of vertices
in the network.

13/09/20 MT 25
An intuitive definition of community

13/09/20 MT 26
An intuitive definition of community

13/09/20 MT 27
What happens with directed links?

13/09/20 MT 28
What happens with directed links?

13/09/20 MT 29
The definition of “community” might depend on

• the properties of the network (weighted, directed, etc.)

• the “processes” running on it (spread of information, etc.)

13/09/20 MT 30
Communities naturally appear in social
networks

How do they form?

13/09/20 MT 31
The building block of a community

13/09/20 MT 32
Triadic Closure
Consider a network of friendship:

If two people have a friend in common, then there


is an increased likelihood that they will become
friends themselves at some point in the future.

Triple Triangle

A. Rapoport, Bulletin of Mathematical Biophysics 15(4), 523-533 (1953)

13/09/20 MT 33
Reasons for Triadic Closure

•Opportunity: If vertices B and C have a common friend A then the probability


that they meet (and potentially become friends) increases.

•Trusting: the fact that node B and C have a common friend A is a basis for trusting
each other.

•Incentive: If node A is friends with B and C, then it is a source of latent stress


for A if B and C are not friends with each other. This is an incentive for A to
bring B and C together.

13/09/20 MT 34
The building block of a community

Triadic closure favors


the formation of triangles
in an evolving network

13/09/20 MT 35
Clustering coefficient of a vertex (local)

Theoretical definition: The clustering coefficient of a vertex A is the probability


that two randomly selected friends (first neighbors) of A are friends with each other.

Operative definition: The clustering coefficient of a vertex A is the fraction of pairs


of A’s first neighbors that are linked to each other.

13/09/20 MT 36
Clustering coefficient of a vertex (local)

Operative definition: The clustering coefficient of a vertex A is the fraction of pairs


of A’s first neighbors that are linked to each other.

13/09/20 MT 37
Clustering coefficient of a vertex (local)

# of pairs A’s friends that are linked to each other = number of triangles involving A=2
(CD, BC) (ABC, ACD)

Operative definition: The clustering coefficient of a vertex A is the fraction of pairs


of A’s first neighbors that are linked to each other.

13/09/20 MT 38
Clustering coefficient of a vertex (local)

# of pairs of A’s friends = number of triples rooting in A = 6


(BC,BD,BE,CD,CE,DE) (ACD, ABC,ABD,ABE,ACE,ADE)

Operative definition: The clustering coefficient of a vertex A is the fraction of pairs


of A’s first neighbors that are linked to each other.

13/09/20 MT 39
Question

Suppose that a vertex A, with degree(A)>1, fully satisfies the triadic closure property.

What is the clustering coefficient of A?

13/09/20 MT 40
Total Clustering coefficient (global)
Triple Triangle

A triangle
contains
3 triples

3 ⇥ total number of triangles


C=
total number triples

13/09/20 MT 41
Question
What is the total clustering coefficient of this network?

3 ⇥ total number of triangles


C=
total number triples

13/09/20 MT 42
How is it possible to detect
communities in complex networks?

MODULARITY OPTIMIZATION

13/09/20 MT 43

You might also like