Professional Documents
Culture Documents
WIW Random
V 12388
E 74495
具k典 12.0
具C典 0.2 0.001
具l典 4.5 3.8
CLUSTERING
036131-2
STRUCTURE OF A LARGE SOCIAL NETWORK PHYSICAL REVIEW E 69, 036131 共2004兲
FIG. 3. The correlation between the local clustering coefficient FIG. 4. The time development of the number of nodes and edges
C(k) and the node degree k for the WIW graph, showing a power of the WIW network. Note that the number of nodes is multiplied
law C(k)⬃k ⫺0.33. by 3 for better visibility.
A relationship C(k)⬃k ⫺ ␣ was observed before in Refs. edge creation activity of members necessarily decreased with
关19,20兴, but with significantly larger exponents. Such power time.
laws hint at the presence of hierarchical architecture 关20兴: The fact that the edge/node ratio changes little over time
when small groups organize into increasingly larger groups is consistent with the observed stationary nature of the de-
in a hierarchical manner, the local clustering decreases on gree distribution. To see this, consider a growing network
different scales according to such a power law. with V(t) nodes at time t and a time-independent degree
The average clustering coefficient and average diameter distribution P(k) with 兺 k P(k)⫽1 and finite first moment
of the WIW network are listed in Table I. The data show that 兺 k k P(k). At time t, there are
it is a ‘‘small-world’’ network in the sense of Ref. 关1兴: the
average path length is almost as short as in a random net- n 共 k,t 兲 ⫽V 共 t 兲 P 共 k 兲
work 关13兴 of the same size, but the clustering coefficient is
two orders of magnitude larger. nodes of degree k, and hence the number of edges is
1 V共 t 兲
TIME DEVELOPMENT E共 t 兲⫽
2 兺k kn 共 k,t 兲 ⫽ 2 兺k k P 共 k 兲 .
As Fig. 4 shows, the number of nodes of the WIW net-
work grew approximately linearly with time. This appears to Consequently the edge/node ratio E(t)/V(t) is essentially
be related to the fact that the WIW network develops as a constant, and it only changes because the maximal degree
subgraph of the underlying social network, and thus the increases. This argument applies to the WIW network with
availability of new members is constrained by high cluster- stationary distribution
ing of the existing social links: a substantial proportion of the
冦
acquaintances of a newly invited member will have been c1
if k⬍k crit
invited already. This conclusion is supported by noting that k ␥1
the average number of successful invitations is very close to P共 k 兲⫽
one 共except for the very first nodes, and the latest ones whose c2
if k⬎k crit
invitations presumably have not yet all been sent out or acted k ␥2
upon兲. The average number of invitations sent out is about
two. Similar observations are noted in Ref. 关21兴 for systems with ␥ 1 ⬇⫺1, ␥ 2 ⬇⫺2. This distribution is on the boundary
growing on a background network. of distributions with finite first moment: the first moment
Furthermore, the number of edges also grew linearly with exists for ␥ 2 ⬍⫺2 but not otherwise.
time, and thus the edge/node ratio only grew moderately dur-
ing the existence of the network. This observation is in con- NETWORK MODELING
tradiction with any purely local time-independent edge cre-
ation mechanism. If every member of the network actively A random graph process based on linear preferential at-
participates in edge creation independently of its age in the tachment for the creation of new edges was proposed in Ref.
network, the edge/node ratio would also increase linearly 关5兴 to account for the observed power laws in natural net-
with time. This linear growth of the edge/node ratio was not works. Such a process leads indeed to a graph with a power
observed in the network, and hence we conclude that the law degree distribution 关5,22兴. However, this model is by
036131-3
G. CSÁNYI AND B. SZENDRŐI PHYSICAL REVIEW E 69, 036131 共2004兲
FIG. 5. The dependence of the degree distribution of our model FIG. 6. The degree distribution of the invitation tree in the
graph on the parameter q, with m⫽15 and V⫽2⫻105 (q⫽0.5,1) model, for various values of q.
and V⫽5⫻104 (q⫽0), averaged over 50 graphs; q crit is ⬇0.5.
definition ‘‘macroscopic,’’ requiring global information the edge/node ratio without significantly affecting any other
about the entire network in every step. This assumption is property of the graph.
realistic for the World Wide Web or some collaboration net- The precise description of our process is as follows.
works, where all nodes are ‘‘visible’’ from all others. For 共1兲 We begin with a small sparse graph.
human social networks however, it is reasonable to assume 共2兲 New nodes arrive at a rate of one per unit time and are
some degree of locality in the interactions. Also, the original attached to an earlier node chosen with a probability distri-
scale-free models are not applicable to networks with high, bution giving weight k q to a node of degree k, where q⭓0 is
degree-dependent clustering coefficients. These problems a parameter.
motivated the introduction of models which use local tri- 共3兲 Internal edges are created as follows: we select two
angle creation mechanisms 关15,23–26兴, which increase clus- random neighbors of a randomly chosen node v , and if they
tering in the network. These models have degree-dependent are unconnected, we create an edge between them. Other-
local clustering, and can also lead to power law degree dis- wise, we select two new neighbors of the same node v and
tributions, though no existing model of this kind shows a try again. If all neighbors of v are already connected to each
double power law. other, we pick a new v .
We now present a model to account for the observed 共4兲 A constant number of internal edges is created per unit
properties of the WIW network. As mentioned above, the time, so that the edge/node ratio equals a constant m after
WIW can be viewed as a growing subgraph of the underlying each time step.
social acquaintance graph. This suggests a model obtained The degree distribution of graphs generated by our pro-
by a two-step process, first modeling the underlying graph, cess is shown in Fig. 5. We found a very robust large-k
and then implementing a growth process. The lack of avail- power law of exponent ␥ 2 ⬇⫺2, essentially independently
able data on the underlying graph, however, prevents us from of the invitation mechanism. We measured the joint probabil-
following this program directly. We build instead a growing ity distribution of the degrees k,k ⬘ of nodes connected by
graph in a single process, choosing the local triangle mecha- new internal edges, and found that for large values, it was
nism as our basic edge creation method. This models the proportional to kk ⬘ . This directly leads to a power law ex-
social introduction of two members of the WIW network by ponent of ⫺2 via standard mean-field arguments 关7兴. On the
a common friend some time in the past, such edges being other hand, the small-k behavior was found to be sensitive to
recorded in the WIW network itself gradually over time. The the invitation mechanism; Fig. 5 shows that a second power
invitation of new members is modeled by sublinear prefer- law only appears at a critical q⫽q crit . The value of q crit
ential attachment 关5兴, motivated by experimental results on depends on the edge/node ratio.
scientific collaboration networks 关7,27兴, where the data per- To explore the hypothesis that the low degree power law
mit the analysis of initial edge formation. We also impose the is indeed related to the invitation mechanism, we plot in Fig.
constant edge/node ratio to be consistent with the observed 6 the degree distribution of the invitation tree of the model
stationary degree distribution. Note that this constant has to network for various values of the parameter q. For q
be tuned from the shape of observed distributions, and can- ⫽q crit , we obtain a scale-free distribution with exponent
not be inferred from the WIW data directly. The reason for ⫺3, similar to what was observed for the real network. De-
this is that the WIW network has a disproportionate number creasing q leads to a much sharper drop in the curve, with an
of nodes of degree 1 共Fig. 1兲, representing people who once exponential tail for q⫽0, whereas increasing q above the
responded to the invitation but never returned, which distorts critical value leads to a gelation-type behavior: new nodes
036131-4
STRUCTURE OF A LARGE SOCIAL NETWORK PHYSICAL REVIEW E 69, 036131 共2004兲
关1兴 D.J. Watts et al., Nature 共London兲 393, 440 共1998兲. 关14兴 S. Milgram, Psychol. Today 2, 60 共1967兲.
关2兴 J.M. Montoya and R.V. Sole, J. Theor. Biol. 214, 405 共2002兲. 关15兴 D.J. Watts, Small Worlds 共Princeton University Press, Prince-
关3兴 H. Jeong et al., Nature 共London兲 407, 651 共2000兲. ton, NJ, 1999兲.
关4兴 F. Liljeros et al., Nature 共London兲 411, 907 共2001兲. 关16兴 I. de Sola Pool and M. Kochen, Soc. Networks 1, 1 共1978兲.
关5兴 A.-L. Barabási and R. Albert, Science 286, 509 共1999兲. 关17兴 S. Wassermann and K. Faust, Social Network Analysis 共Cam-
关6兴 M.E.J. Newman, Proc. Natl. Acad. Sci. U.S.A. 98, 404 共2001兲. bridge University Press, Cambridge, 1994兲.
关7兴 A.-L. Barabási et al., Physica A 311, 590 共2002兲. 关18兴 M.E.J. Newman, Phys. Rev. E 64, 016131 共2001兲.
关8兴 L.A.N. Amaral, A. Scala, M. Barthelemy, and H.E. Stanley, 关19兴 A. Vazquez et al., Phys. Rev. E 65, 066130 共2002兲.
Proc. Natl. Acad. Sci. U.S.A. 97, 11 149 共2000兲. 关20兴 E. Ravasz and A.-L. Barabási, Phys. Rev. E 67, 026112 共2003兲.
关9兴 M. Faloutsos et al., Comput. Commun. Rev. 29, 251 共1999兲. 关21兴 R. Pastor-Satorras et al., Phys. Rev. E 63, 066117 共2001兲.
关10兴 R. Albert et al., Nature 共London兲 401, 130 共1999兲. 关22兴 B. Bollobás, O. Riordan, J. Spencer, and G. Tusnády, Random
关11兴 R. Kumar, P. Raghavan, S. Rajalopagan, and A. Tomkins, in Struct. Algorithms 18, 279 共2001兲.
Proceedings of the Ninth ACM Symposium on Principles of 关23兴 E.M. Jin et al., Phys. Rev. E 64, 046132 共2001兲.
Database Systems 共Association for Computing Machinery, 关24兴 A. Vazquez, Europhys. Lett. 54, 430 共2001兲.
New York, 2000兲, p. 1. 关25兴 P. Holme and B.J. Kim, Phys. Rev. E 65, 026107 共2002兲.
关12兴 A. Broder et al., Comput. Netw. 33, 309 共2000兲. 关26兴 J. Davidsen et al., Phys. Rev. Lett. 88, 128701 共2002兲.
关13兴 P. Erdős and A. Rényi, Publ. Math. 共Debrecen兲 6, 290 共1959兲. 关27兴 M.E.J. Newman, Phys. Rev. E 64, 025102 共2001兲.
036131-5