Probabilistic Network Theory
Probabilistic Network Theory
Introduction
The aim of this chapter is to describe the foundations of probabilistic network theory. We review
the development of the field from an early reliance on simple random graph models to the
construction of progressively more realistic models for human social networks. We hence show
how developments in probabilistic network models are increasingly able to inform our
understanding of the emergence and structure of social networks in a wide variety of settings.
Social networks
Growing numbers of social scientists from an increasingly diverse set of disciplines are turning their
attention to the study of social networks. The precise reasons vary, but almost certainly there are at
least two key factors at work. The first is an increasing recognition that networks matter in many
realms of social, political and economic life. Networks both potentiate and constrain the social
interactions that, for instance, underpin the dissemination of knowledge, the exercise of power and
influence, and the transmission of communicable diseases. Ignoring the structured nature of these
interactions often leads to erroneous conclusions about their consequences, as social scientists from
a number of disciplines have repeatedly pointed out (e.g., Bearman, Moody & Stovel, 2004;
Kretzschmar & Morris, 1996). The second reason for a heightened focus on social networks is our
increasing capacity to measure, monitor and model social networks and their evolution through
time, and hence to draw social networks into a more general program for a quantitative social
science. Probabilistic models for social networks have played – and will likely increasingly play –
a vital role in these developments. In this chapter we review progress in attempts to develop
probabilistic network models, and point to areas of ongoing development.
At the outset it is important to recognise that social networks pose particular challenges as far as
probability modelling is concerned. Unlike observations on a set of distinct actors, where an
1
We are grateful to Peng Wang and Galina Daraganova for helpful comments on this chapter.
assumption of independent observations may often seem reasonable, social relationships are much
less plausibly regarded as independent. Relational observations may share one or more actors and
hence be subject to such influences as the goals and constraints of a particular actor. Alternatively,
they may be linked by other relationships (eg the relationship between actors i and j may be linked
with the relationship between actors k and l by a relationship involving actors j and k) and hence
dependent, for example, by virtue of competition or cooperation regarding relational resources
involving actors j and k. As we see below, the development of probabilistic network models began
with simple models that assumed independent relational ties, but empirical researchers quickly
confronted the problem that social networks appeared to deviate from simple random structures in
seemingly systematic ways. Hence, the story of the development of probabilistic network models is
a story of alternatively probing and parameterising progressively more complex systematicities in
network structure.
We begin with some notation and some important definitions, referring the reader to Wasserman
and Faust (1994; also Bollobás, 1998) for a fuller exposition of key concepts.
2
Graphs and directed graphs can be conveniently represented by a graph drawing. The elements of
the node set N are represented by points in the drawing, and a nondirected line connects node i and
node j if (i,j) is an edge in the edge set E. In the case of a directed graph, an arc represented by a
directed arrow is drawn from node i to node j if (i,j) is in the arc set E. Figures 1 and 2 show
examples of a graph and directed graph, respectively.
-----------------------------------------
Insert Figures 1 and 2 about here
-----------------------------------------
Adjacency matrix
Graphs and directed graphs can be represented by a binary adjacency matrix. For example, in the
case of a graph, we can define x to be an n × n matrix with entries: xij = 1 if there is an edge
between i and j; and xij = 0, otherwise. Since xij = 1 if and only if xji = 1, x is necessarily a
symmetric matrix. In the case of a directed graph, unit entries in x correspond to arcs in E (that is,
xij = 1 if and only if there is an arc from i and j; and xij = 0, otherwise). In this case, symmetry is not
necessarily implied. The adjacency matrix corresponding to the graph of Figure 1 is shown in
Table 1. Note that the size of a graph is x++/2 whereas the size of a directed graph is x++, where x++
= ∑i,jxij is the sum of entries in the adjacency matrix.
3
degree is more complex, since there may be an arc directed from node j towards a given node i or
away from node i towards node j, or there may be arcs in both directions between nodes i and j. We
therefore characterise each node i in a directed graph by its: outdegree outi = xi+ = ∑jxij; indegree ini
= x+i = ∑jxji; and mutual degree muti = ∑jxijxji.
-----------------------------------------------
Insert Figure 3 and Table 1 about here
-----------------------------------------------
A useful set of descriptive statistics for a graph or directed graph is a summary of the form of all of
its small subgraphs. For example, the dyad census is a count of the number of each possible type of
2-node induced subgraphs and the triad census is the set of counts of 3-node induced subgraphs.
For example, the graph of Figure 1 has 102 null dyads and 34 linked dyads; its triad census
comprises 279, 322, 49 and 30 induced 3-node subgraphs with 0, 1, 2 and 3 edges, respectively.
The dyad census and the triad census for directed graphs are defined similarly, but the number of
forms of 2- and 3-node subgraphs is greater in the directed graph case.
Implicit in the description of the dyad and triad census is the notion that two graphs (or subgraphs)
can have the same form. We can make this notion more explicit by defining an isomorphic
mapping between graphs. Specifically, two graphs G =(N,E) and H =(N′,E′) are isomorphic if there
is a one-to-one mapping ϕ from N onto N′ such that (i,j) is an edge in E if and only if (ϕ(i),ϕ(j)) is
an edge in E′.
4
ties. Not surprisingly, therefore, path-like structures in networks that might be associated with the
flow of social processes are important concepts. A path from node i to node j is an ordered
sequence i = i0, i1, …, il = j of distinct nodes in which each adjacent pair (ij-1,ij) is linked by an edge
or an arc. The length of the path is l. If there is a path from a node i to a node j, then j is said to be
reachable from i. If node j is the same as node i, then the path is termed a cycle of length l.
A geodesic from node i to another node j is a path of minimum length and the geodesic distance dij
from node i to node j is the length of the geodesic. If there is no path from i to j, the geodesic
distance is infinite. The geodesic distance dij for distinct nodes i and j is either an integer in the
range from 1 to n-1 or is infinite. For a graph, geodesic distances are symmetric, that is dji = dij; this
is not necessarily the case, however, for directed graphs. The geodesic distribution of a graph or
directed graph is the distribution of frequencies of geodesic distances, that is, the distribution of
counts of the number of ordered pairs of nodes having each possible geodesic distance. The
geodesic distribution of the graph of Figure 1 is presented in Figure 4 in the form of a histogram.
The geodesic distribution can be seen as a useful summary of inter-node distances. Later, we refer
to the quartiles of this distribution as simple summary statistics for inter-node distances.
--------------------------------
Insert Figure 4 about here
--------------------------------
If each node in a graph G is reachable from each other node, then G is connected. A component of
G is a maximal connected subgraph, that is, a connected subgraph with vertex set W for which no
larger set Z containing W is connected. The graph of Figure 1 is clearly connected.
In the case of a directed graph, we may also define a semipath from node i to node j as an ordered
sequence i = i0, i1, …, il = j of distinct nodes in which either (ij-1,ij) and/or (ij,ij-1) is an arc. The
length of the semipath is m. If each node in a directed graph G is reachable from each other node,
then G is strongly connected. If there is a semipath from each node in G to each other node, then G
is said to be weakly connected.
The Hungarian mathematicians Paul Erdös and Alfréd Rényi initiated an important approach to the
study of random graph structures with a foundational series of papers beginning in 1959 (Erdös &
Rényi, 1959). They introduced two primary random graph distributions on a fixed node set N =
{1,2,…,n}. The probability distributions are defined on the set of all graphs on n distinct nodes;
5
this set contains 2n(n-1)/2 graphs, since each of the n(n-1)/2 pairs of nodes may or may not be linked
by an edge. Each of the two random graph distributions that we introduce below associates a
probability with every graph in this set.
G(n,p)
In the first case, the edges of a graph are regarded as a set of independent Bernoulli variables. If we
let Xij denote the edge variable for the pair of nodes i and j, and p be the (uniform) probability that
the edge between i and j is present, then we can write Pr(Xij = 1) = p and hence Pr(Xij = 0) = 1- p.
Since the edge variables are independent, it is then easy to write down the probability of any
particular graph H of order n and size m:
Pr (G = H) = pm(1 - p)m*-m
where m* = n(n-1)/2 is the maximum number of edges in a graph of order n. The set of all possible
graphs of order n and their corresponding probabilities is the random graph distribution G(n,p).
In the special case where p = 0.5, the probability of each graph H on n nodes is:
Pr (G = H) = (0.5)m(0.5)m*-m = (0.5)m*
and hence every graph on n nodes is equiprobable. This distribution is often termed the uniform
random graph distribution, and denoted by U.
G(n,m)
The second random graph distribution associates nonzero probabilities only with graphs of order n
and size m; further, every such graph is assumed to be equiprobable. Since there are n!/(m!(n-m)!)
distinct graphs in the class, the probability of any particular graph of order n and size m is:
Pr (G = H) = m!(n-m)!/n!
The distribution G(n,m) may be regarded as the uniform random graph distribution on n nodes,
conditional on the property of having m edges. It may also be designated U│x++=m. More
generally, we can define a conditional uniform random graph distribution in terms of any graph
property Q (Bollobás, 1985). If Q is a subset of all possible graphs on n nodes, then the distribution
U│Q assigns equal probabilities (namely, 1/|Q|) to all graphs in the subset Q and zero probability to
all graphs not in Q. We term U│Q the uniform random graph distribution conditional on Q.
6
an excellent introduction to the field, as does the review by Albert and Barabási (2002); here we
illustrate just two aspects of this literature by considering the expected values of some statistics in
G(n,p) and by reviewing properties of a graph as a function of p in G(n,p).
We begin by determining the expected number of cliques in a graph. Let Ys = Ys(G) be the number
of cliques of order s in the graph G. Then the expected value of Ys can readily be computed as:
E(Ys) = (n! /[s!(n-s)!]) pu
where u = s(s-1)/2 is the number of edges in a clique of order s (for example, see Bollobás, 1998).
In G(17,.25), for example, the expected number of cliques of order 3 is 10.6, and the expected
number of cliques of order 4 is 0.58. The graph of Figure 1 has 17 nodes and a density of 0.25 and
it is therefore interesting to compare the expected values for G(17,.25) with those observed for the
graph of Figure 1, namely, 34 cliques of order 3 and 18 cliques of order 4.
Indeed, if F is any subgraph with s nodes and t edges, and YF is the number of subgraphs of G that
are isomorphic to F, then the expected value of YF can readily be shown to be:
E(YF) = (n! /[(n-s)!a]) pt
where a is the number of distinct ways in which the nodes of the graph F can be labeled with the
integers 1,2, …, s to yield the same graph. For example, in the case of a cycle of order s, F has s
edges and s nodes and there are 2s distinct ways in which nodes can be labeled to yield the same
graph; hence:
E(YF) = (n! /[(n-s)!2s]) ps
The expected number of induced subgraphs isomorphic to F may be similarly derived:
E(YF) = (n! /[(n-s)!a]) pt(1-p)s(s-1)/2 - t.
The latter formula may be used, for example, to compute the expected triad census for a graph of
given order and density.
The expression for the expected number of subgraphs isomorphic to F in G(n,p) allows us to
explore features of random graphs as n tends to infinity. Since
E(YF) = (n! /[(n-s)!a]) pt ≈ nspt/a
it is clear that if p = cn-s/t then E(YF) ≈ ct/a and the expected number of subgraphs isomorphic to F,
denoted by λ = ct/a, is a finite number. If, however, pns/t tends, respectively, to 0 or ∞ as n tends to
∞, then the probability that a random graph in G(n,p) contains at least one subgraph F converges to
0 or 1, respectively (e.g., Albert & Barabási, 2002). Thus p = cn-s/t is a critical probability, below
which graphs with large n rarely contain F, above which they almost certainly do. An important set
7
of results in random graph theory document the many graph properties that show this form of rapid
transition from being very unlikely to very likely as a function of the edge probability p.
Application of this approach allows us to infer for large n some expected features of random graphs
in G(n,p) as a function of p, relative to n. Thus, for example, if p < 1/n then almost every graph in
G(n,p) comprises a number of components, each without any cycles; if p lies between 1/n and (ln
n)/n, then almost every graph has a so-called giant component (that is a component including a
large proportion of the nodes in N); and if p > (ln n)/n, then almost every graph is connected (e.g.,
see Albert & Barabási, 2002).
Random directed graph distributions may be similarly defined, though interest in them has largely
come from social scientists with applications to network data in mind. It is to this literature that we
now turn.
Applications of random graph and directed graph distributions to social network data
Before describing the application of random graphs to social networks, it is important to say
something about typical sources of social network data (eg Wasserman & Faust, 1993). A common
method of measuring social networks is to survey all members of a circumscribed population about
their ties. In this case, ties are typically directional and there may or may not be a limit imposed on
the maximum number of ties reported by each respondent. Occasionally in this case, it is fruitful to
consider the graph constructed from mutual ties only. For example, the graph of Figure 1 is the set
of mutual ties observed among the girls in a Grade 8/9 high school class, obtained in response to the
question: “Who are your best friends in the class?” Networks are also commonly inferred from
archival data, such as communication logs or membership or attendance lists. Less common
strategies include direct observation, and more elaborate survey techniques.
8
Consider, for example, the random graph distribution DG(n,p) with a fixed set of n nodes and
uniform but unknown arc probability p; this is the directed graph analogue of G(n,p). The expected
number of mutual ties for directed graphs in this distribution is n(n-1)p2/2, and the expected number
of nodes with indegree k is npk(1-p)n-1-k. If a social network with n nodes and x++ = ∑i,jxij arcs has
been observed, then the arc probability p can be estimated from the network data as p* = x++/[n(n-
1)]. This estimate of the arc probability can be utilized to compute the expected number of mutual
ties and the expected number of nodes with each possible indegree k, on the assumption that the
observed network was generated from DG(n,p*). These expected values can be compared with the
observed number of mutual ties and the observed indegree distribution in the network x. If the
observed values are markedly different from the expected ones, then, it can be argued, there is
reason to question the suitability of the assumption of independent random arcs with uniform
probability that underpinned the computation of expected values.
The computations by Moreno and colleagues revealed what would later become a very common
finding: that the observed number of mutual ties in an observed human social network is much
greater than the number expected on the basis of arc probability alone, and the indegree distribution
is more heterogeneous than expected, that is, there are more nodes with very low and very high
indegree than expected. These and similar findings suggest that tendencies towards mutuality and
heterogeneity in partner “attractiveness” are systematic features of observed social networks
comprising ties of affiliation.
Most importantly for our purposes here, Moreno introduced the idea of using random directed graph
distributions as null distributions. The features of this null distribution could be compared with
features of an observed network, a comparison enabling researchers to identify ways in which the
observed network appeared to be systematically different. In this early example and in many to
follow, it was important that the expected features of the null distribution could be derived
mathematically. Much later, when fast computers and more versatile simulation algorithms were
introduced, this restriction could be relaxed, but it was an important reason for the focus of early
applications on this “null distribution” approach. Moreover, although this early application
assumed very simple null distributions (such as, independent arcs with uniform tie probability),
more complex distributions were soon developed. Indeed, the strategy continues to be utilized in
new ways (e.g., Bearman et al, 2004; Pattison, Wasserman, Robins & Kanfer, 2000) and to be
rediscovered in new fields (e.g., Milo, Shen-Orr, Itzkovitz, Kashtan, Chlovskii & Alon, 2002). It
remains an important means by which some of the systematic structural properties of human social
networks can be and have been uncovered.
9
Holland, Leinhardt and colleagues were responsible for developing a number of important
elaborations of this basic strategy. For example, Holland and Leinhardt (1975) computed the
expected mean vector and variance-covariance matrix for the triad census in the uniform random
directed graph distribution U│mut,asym,null 2 conditional on fixed numbers mut, asym and null of
mutual, asymmetric and null ties, respectively (mut = ∑i,jxijxji/2, asym = ∑i,j[xij(1-xji)+ xji(1-xij)], null
= ∑i,j(1-xij)(1-xji)). In other words, they computed the expected distribution of the triad census
while conditioning on the dyad census. This allowed them to construct a test statistic for any linear
combination of triad counts, and hence assess whether the observed combination of triad counts is
in the upper or lower tail of the expected distribution. For example, they could test for the presence
of transitivity (that is, the property that arcs from nodes i to j and from j to k are accompanied by an
arc from i to k).
Similar calculations can be made for other distributions of possible interest, including the uniform
distribution U│{xi+},mut conditional on the outdegrees of each node in the directed graph as well as
the number of mutual ties. Of course, for some desirable combinations, such as U│{xi+},{x+i},mut,
the calculations are very difficult and have prompted alternative parametric approaches. In some of
these difficult cases, clever simulation strategies have been devised to circumvent the difficult
mathematics. For example, Snijders (1991) used an importance sampling approach to simulate
U│{xi+},{x+i} and McDonald, Smith and Forster (in press) have described a Markov Chain Monte
Carlo algorithm to simulate the distribution U│{xi+},{x+i}, mut.
Biased nets
One other early probabilistic approach deserves mention. In a series of papers, Rapoport and
colleagues developed the theory of biased nets, that is, random networks with biases towards
symmetry, transitivity and other features characteristic of observed social networks (Rapoport,
1957). Although a full and satisfactory mathematical treatment proved elusive, Rapoport and
colleagues employed their conceptualization of biased nets to conduct some illuminating studies of
the connectivity structure of a large friendship network (e.g., Rapoport & Horvath, 1961). In part,
their work can be seen as the intellectual precursor to the more general probabilistic developments
described below, but a different framing of the “biases” has proved more useful.
2
In fact, Holland and Leinhardt (1975) termed this the U|MAN distribution, but we have attempted to keep notation
consistent in the chapter.
10
The p1 model
Comparison of observed social networks with random directed graph distributions consistently
revealed a greater than expected number of mutual ties and greater than expected degree
heterogeneity. As a consequence, it was felt desirable to compare observed networks with graph
distributions that resembled observed networks in these fundamental respects. The problem of
satisfactorily simulating the random graph distribution conditional on the number of mutual ties and
the in- and out-degree sequences is arguably still not resolved. In the meantime, Holland and
Leinhardt (1981) developed an alternative approach: a probability model that parametrised these
tendencies. This model was an important step towards the development of a more general
framework.
The p1 model developed by Holland and Leinhardt (1981) assumes independent dyads Dij = (Xij,Xji).
The distribution of the entire network X = [Xij] can then be determined by specifying the probability
of each possible dyadic form for Dij since the probability of the entire network X is the product of
the dyad probabilities. The individual dyad probabilities can be expressed in terms of the
probability of occurrence of a mutual dyad, an asymmetric dyad, and a null dyad. Thus, we define:
Pr(Dij=(1,1)) = mij= mji, Pr(Dij=(1,0)) = aij, and Pr(Dij=(0,0)) = nij = nji
where mij + aij + aji + nij = 1 for all i ≠ j.
Holland and Leinhardt added two useful restrictions to this general dyad-independent model. The
first was that the reciprocity parameter ρij is a constant for all dyads, that is, ρij = ρ for all i ≠ j. The
second was that the parameter θij depended additively on the propensity of arcs to emanate from
node i and the propensity for arcs to have node j as a target; in other words:
11
θij = θ + αi + ßj for i ≠ j.
The resulting model is termed the p1 model:
p1(x) = Pr(X = x) = K exp[ρΣi,jXijXji + θX++ + ΣiαiXi+ + ΣißiX+i]
The parameters ρ and θ can be interpreted as uniform reciprocity and density parameters, and the
node-dependent parameters αi and ßi reflect the expansiveness and attractivesness, respectively, of
each node i.
The development of the p1 model was an important step in probabilistic network theory, not least
because much of the machinery of statistical modeling could be brought to bear on the problem of
assessing model adequacy. The model could be estimated from data, and its goodness of fit could
be subjected to careful scrutiny, as Breiger (1981) demonstrated. Such scrutiny led to the
recognition that observed networks often exhibited structural properties not captured by the
parameters of the p1 model, and spawned the development of two important lines of further model
development.
One such model was inspired by the concept of structural equivalence in a graph (Lorrain & White,
1971). Two nodes are structurally equivalent if they have identical patterns of relationships to other
nodes. The concept of structural equivalence has been very influential in the social networks
literature because it can be used to represent the idea that two actors have the same social position
in a network, that is, that they are indistinguishable from a relational point of view.
Formally, two nodes i and j are structurally equivalent in a directed graph G with adjacency matrix
x if xik = xjk and xki = xkj for all nodes k ≠ i, j in N. Structurally equivalent nodes can be partitioned
into blocks. Nowicki and Snijders (2001) assumed that the blocks to which nodes belong are
unobserved. They defined a set of independent and identically distributed latent random variables
Z = [Zi] where Zi denotes the block of node i, and Pr(Zi = k) = θk. They assumed the dyads Dij =
(Xij,Xji) to be conditionally independent given the blocks and the probability that a dyad has a
particular relational form to depend only on the (unobserved) blocks of the nodes. In other words:
Pr(Dij = a|Z = z) = ηa(zi,zj)
where:
12
• a is a vector of possible values for the dyad, with a ∈ {(1,1), (1,0), (0,1), (0,0)} for a
directed graph and a ∈ {(1,1), (0,0)} for a graph; and
• ηa(zi,zj) is the block-dependent probability of observing the vector a.
In this model, two nodes i and j are stochastically equivalent if they belong to the same block and
hence the same dyad probabilities (Pr(Dik = a|Z = z) = Pr(Djk = a|Z = z)) for all nodes k. Since the
dyads are assumed to be conditionally independent given the blocks Z, the joint distribution of the
Dij given Z is the product of the conditional dyad probabilities. Nowicki and Snijders (2001)
developed a Bayesian approach to estimation of θ and η, and hence computation of the posterior
probabilities that any pair of nodes are in the same block and that a dyad has any particular
relational form.
Several other important latent variable models have been developed for particular types of social
networks that are likely to reflect some form of proximity among actors, such as friendship or
collaboration. In cases such as these, it may be reasonable to assume that tie probabilities are
monotonically related to proximity in a latent space. For example, Hoff, Raftery and Handcock
(2002) proposed a model which assumes that nodes have latent locations in some low-dimensional
Euclidean space and that, given these latent locations, tie variables are conditionally independent.
Schweinberger and Snijders (2003) developed a similar model based on an ultrametric rather than
Euclidean space; in their model, every pair of nodes is associated with an unobserved distance in an
ultrametric space corresponding to a discrete hierarchy of “settings”, and tie probabilities are
conditionally independent, given these latent ultrametric distances. (Distances in an ultrametric
space satisfy the ultrametric inequality, that is d(i,j) ≤ max{d(i,k), d(j,k)} for any triple of nodes i, j,
k.) They developed approaches for estimating the unobserved ultrametric distances. Handcock,
Raftery and Tantrum (2005) have recently extended Hoff et al’s (2002) model by assuming that the
latent locations are drawn from a finite mixture of multivariate normal distributions, each of which
represents a different group of nodes.
The second recent line of development has been to build probabilistic network models in which
conditional dependencies among tie variables are permitted. This work began with the recognition
by Frank and Strauss (1986) that a general approach for modeling interactive systems of variables
(Besag, 1974) could be usefully applied to the problem of modeling systems of interdependent
network tie variables on a fixed set of nodes. This was an important step because it permitted
13
models to go beyond the limiting assumption of dyad-independence in a quite general way. Frank
and Strauss (1986) introduced a Markov dependence assumption for network tie variables: two
network tie variables were assumed to be conditionally independent, given the values of all other
network tie variables, unless they had a node in common. Thus, whereas a tie between nodes i and j
was assumed to be conditionally independent of ties involving all other distinct pairs of nodes k and
l, it could be conditionally dependent on any other ties involving i and/or j.
Assumptions about which pairs of tie variables are conditionally dependent, given the values of all
other tie variables, can be represented as a dependence graph. The node set of the dependence
graph D is the set of tie variables {Xij} and two tie variables are joined by an edge in D if they are
assumed to be conditionally dependent, given the values of all other tie variables. In the case of
G(n,p), D is an empty graph since all pairs of variables are assumed to be mutually independent. In
the Markov case, the variable Xij is connected to Xik and Xjk for all k ≠ i or j and the dependence
graph is connected. Figure 5 shows the Markov dependence graph for a random graph of order 4.
--------------------------------
Insert Figure 5 about here
--------------------------------
As Frank and Strauss originally outlined, the consequences of any proposed assumptions about
potential conditional dependencies among network tie variables can be inferred from the
Hammersley-Clifford theorem (Besag, 1974). The theorem establishes a model for the interacting
system of tie variables in terms of parameters that pertain to the presence or absence of certain
configural forms in the network. The model, known as an exponential random graph model, takes
the general form:
Pr(X = x) = exp(ΣAγAzA(x))/κ
where:
• A is a subset of tie variables (defining a potential network configuration);
• γA is a model parameter associated with the configuration A (to be estimated) and is
nonzero only if the subset A is a clique in the dependence graph D;
• zA(x) = ΠXij∈Axij is the sufficient statistic corresponding to the parameter γA, and
indicates whether or not all tie variables in the configuration A have values of 1 in the
network x; and
• κ is a normalizing quantity.
14
In order to reduce the number of model parameters, Frank and Strauss (1986) introduced a
homogeneity constraint that parameters for isomorphic configurations are equal. With this
constraint, there is a single parameter γ [A] for each class [A] of isomorphic configurations that
correspond to cliques in the dependence graph. The sufficient statistic in the model corresponding
to the class [A] is then
z[A](x) = ΣA∈[A] ΠXij∈Axij,
that is, a count of all observed configurations in the graph x that are isomorphic to the configuration
corresponding to A. For example, in the case of a homogeneous Markov random graph, it is readily
seen that cliques A in the dependence graph D correspond to graph configurations that are edges,
stars and triangles (see Figure 6) and the model therefore takes the form:
Pr(X=x) = exp(θL(x) + ΣkσkSk(x) + τT(x))/κ
where L(x), Sk(x) and T(x) are the number of edges, k-stars (2 ≤ k ≤ n-1) and triangles in the
network x, and θ, σk (2 ≤ k ≤ n-1), and τ are corresponding parameters.
In many circumstances, the parameters may be interpreted by observing that if a configuration class
[A] has a large positive (or negative) parameter in the model, then the presence of many
configurations in the class enhances (or reduces) the likelihood of the overall network, net of the
effect of all other configurations. It should be noted, though, that the function relating the value of
one of the model’s parameters, say λ[A] to the expected value of the corresponding sufficient
statistic z[A](x) may be markedly nonlinear and exhibit a sharp and rapid transition from lower
average counts to higher average counts with a relatively small change in the parameter (holding
constant the values of all other model parameters). For example, the expected number of triangles
in a graph as a function of the triangle parameter τ is shown for a graph on 17 nodes in Figure 7.
The values of the parameters θ, σ2 and σ3 are fixed at -1.2558, -0.0451, and -0.1084, respectively,
and τ takes values in the range [0.05,1.70]. Figure 7 shows the distribution of the triangle statistic
in the form of a box plot for each value of τ. It can be seen that for low values of τ small increases
in τ are associated with small and steady increases in the triangle statistic. As τ approaches 1.40,
though, the impact of small changes in τ increases rapidly in magnitude, and there is a sharp
transition to a higher value of the triangle statistic. Near the point of transition, the triangle statistic
may take values typical of the graphs on either side of this apparent threshold. This form of
nonlinear relationship is common, and the location of this threshold and the sharpness of the rise in
the region of greatest sensitivity are likely to depend on other parameter values.
15
-----------------------------------------
Insert Figures 6 and 7 about here
-----------------------------------------
It is important to emphasise that even though this model is well-understood in the case where only
the parameter θ is nonzero (since this is just the model G(n,p) with p = exp(θ)/[1+exp(θ)]), more
complex instantiations can be seen as models for self-organizing network processes (Robins,
Pattison & Woolcock, 2005). Robins et al have demonstrated that specific sets of parameter values
for the homogeneous Markov model can characterize very diverse network structures, including
small worlds, caveman worlds, long-path worlds, and so on.
For some parameter values, the model may accord very high probability to a small set of graphs and
very low probability to the rest, as Handcock (2004) and Snijders (2002) have demonstrated.
Handcock (2004) termed these models near-degenerate. For detailed investigation of the behaviour
of specific models, see Handcock as well as Park & Newman (2004) and Burda, Jurkiewicz and
Krzywicki (2004).
Model simulation
In order to understand properties such as near-degeneracy of the exponential random graph model
Pr(X = x) = exp(ΣAγAzA(x))/κ, it is helpful to be able to simulate it efficiently (that is, to draw
graphs x with probability Pr(X = x)), and this generally means circumventing the need to compute
the normalizing quantity κ, since κ is a function of all graphs in the distribution. As Strauss (1986)
and others have observed, the Metropolis algorithm can be used for this purpose. The algorithm
sets up a Markov chain on the space of all possible graphs of order n in such a way that the Markov
chain has the model as its stationary distribution. It may be described as follows:
1. begin with some graph x;
2. at each step, select an edge at random, say the (i,j) edge, and let x′ be the graph that is
identical to x except that the edge from i to j is switched to absent if it is present in x, or to
present if it is absent in x;
3. replace x by x′ with probability min[1,exp{ΣAγA(zA(x′) - zA(x)}];
4. return to step 2, unless some specified target number of steps has been taken.
To sample from Pr(X = x), it is usual to begin sampling graphs from the chain after some initial
number of steps has been completed (the burn-in period); graphs are then sampled at a rate that may
depend on n. Many variations of this approach may also be used; a valuable discussion may be
found in Snijders (2002).
16
Estimation of model parameters
In many settings, primary interest lies in estimating exponential random graph model parameters
from observed network data. For example, given the graph of Figure 1, there may be interest in
estimating the parameters of a Markov model from which it might have been generated. In the
early applications of these models to observed data, an approximate form of estimation known as
pseudo-likelihood estimation was often used (Strauss & Ikeda, 1990; Wasserman & Pattison, 1996)
even though the properties of the estimates were not well understood. Initial attempts to apply the
very promising approach of Markov Chain Monte Carlo maximum likelihood estimation
(MCMCMLE) were not always successful because the properties of models under consideration
were not always fully appreciated, as Snijders (2002) and Handcock (2004) demonstrated.
However, with a growing understanding of model properties, and more careful attention to model
adequacy, substantial progress has now been made in implementing MCMCMLE approaches (see
Snijders, 2002; Handcock et al, 2004).
3
The estimation was conducted using PNet (Wang, Robins & Pattison, 2006), an implementation of the estimation
approach in Snijders (2002) available at: http://www.sna.unimelb.edu.au/pnet/download.html.
17
Goodness of fit
A good statistical model should not be unnecessarily complex but it should be adequate: that is, the
data should resemble realizations from the model in many important respects. We can assess model
adequacy by comparing the observed network to graphs generated by the model in features that are
not necessarily parameterised within the model. What is important in such comparisons is very
much a function of the modeling context, but there are often good reasons to require that the model
captures the degree of clustering in a network, the distribution of degrees, and the connectivity
structure that is represented by the geodesic distribution (e.g., Goodreau, in press; Robins, Snijders,
Wang, Handcock & Pattson, in press). It is important to note that only some of these characteristics
need be associated with model parameters; others might be seen as consequences of these
parameterized tendencies.
Table 3 summarises these comparisons for the graph of Figure 1 and the parameter estimates of
Table 2. It can be seen that the t statistics are all less than 1 for:
• the local clustering coefficient (the average across all nodes i of the proportion of pairs
of nodes j and k incident with i (xij = 1 = xik) that are themselves connected (xjk =1);
• the global clustering coefficient (the proportion of the triples of nodes {i,j,k} with xij = 1
= xik for which xjk =1);
• the standard deviation of the degree distribution; and
• the skewness coefficient of the degree distribution.
The observed graph, in other words, exhibits levels of clustering and degree heterogeneity that fall
within the envelope of values expected for the model. The first, second and third quartiles of the
observed geodesic distribution are 1, 3 and 4, respectively; the median values for the distribution of
these quartiles across simulations were 2, 2 and 4, suggesting that the model is associated with
somewhat more homogeneous inter-node distances than the data. In Figure 8, the distributions of
the number of edges, 2-stars, 3-stars and triangles for the Markov random graph model with these
parameter values are shown. While the mean of each distribution is close to the observed value for
18
the Figure 1 graph, as expected, it can be seen that the distributions are positively skewed. Indeed,
Figure 7 shows the impact on one of these statistics, the number of triangles, of changing its
corresponding parameter value τ while holding all other parameters constant. It can be seen from
Figure 7 that the estimated value of 1.4438 is very close to the point of transition between low and
high values of the triangle statistic, and the positively skewed distribution of the triangle statistic is
consistent with the estimated value of τ being just below this point.
-----------------------------------------------
Insert Figure 8 and Table 3 about here
-----------------------------------------------
4
Hunter and Handcock (in press) proposed an alternative statistic based on geometrically weighted degree statistics; the
resulting model is equivalent provided that the edge parameter is included.
19
reproducing the standard deviation and the skewness coefficient for the degree distribution, not
surprisingly, it does a poor job in recovering network clustering. We present a better fitting model
below.
In some applications to date (eg Goodreau, in press; Snijders et al, 2006), a fixed value of λ (such as
2) has been assumed; Hunter and Handcock (2006) have shown that λ can be treated as a variable
within a curved exponential family model, and have developed an associated estimation method.
Realisation-dependent models
A critique of the Markov dependence assumption led Pattison and Robins (2002) to construct a
more general class of “realisation-dependent” network models. They argued that conditional
dependencies among tie variables may emerge from the network processes themselves, with new
dependencies created as network ties are generated. For instance, Xij and Xkl might become
conditionally dependent if there is an observed tie between, say, j and k. Baddeley and Möller
(1989) termed such models realisation-dependent.
Snijders et al showed that this assumption led to additional nonzero parameters in an exponential
random graph model, including those referring to collections of 2-paths with common starting
and ending nodes, and collections of triangles with a common base (see Figure 9). We define a
k-2-path to be a subgraph comprising two nodes, i and j, and a set of k paths of length 2 from i to
j through distinct intermediate nodes m1, m2, …, mk. A k-triangle is a subgraph comprising two
connected nodes, i and j, and a set of k paths of length 2 from i to j through distinct intermediate
nodes m1, m2, …, mk. If we let νk be the model parameter associated with a k-2-path and τk the
parameter associated with a k-triangle, we can entertain assumptions about the relationships
among related parameters (as in the case of k-stars earlier), namely:
νk+1 = -νk /λ, and
20
τk+1 = -τk /λ.
As for the star parameters, this is just an hypothesis, and its adequacy needs to be assessed.
Under this assumption, the statistics:
U[λ](x) = ∑k(-1)kUk(x)/λk-2
and
T[λ](x) = ∑k(-1)kTk(x)/λk-2
become single statistics associated with the parameters ν1 and τ1, respectively, where Uk(x) and
Tk(x) are the number of k-2-paths and k-triangles in the network x. It should be noted that the
value of λ need not be the same for each statistic; as before, Hunter and Handcock have shown
how to estimate these parameters.
The parameter estimates presented in Table 4 are for a model fitted to the mutual friendship
network of Figure 1. The positive τ1 estimate suggests that networks with relatively many
triangles are more likely, other statistics being equal, with the cumulative impact of multiple
triangles with a common base pair of nodes diminishing as the number of such triangles
increases. Likewise, the negative ν1 estimate suggests that networks with relatively few 2-paths
among a pair of non-connected nodes are more likely, other statistics being equal. Both of these
effects are consistent with a pressure towards closure for mutual friendship ties.
The goodness of fit for this model is summarised in Table 5. The median values of the quartiles
of the geodesic distribution for the random graph distribution simulated from Table 4 parameter
estimates are 2,3 and 5, respectively, suggesting better recovery of short distances than the
Markov model, though not of longer ones. Overall, the model of Table 4 appears to do a
reasonably good job of characterising the features of the mutual friendship network.
-------------------------------------------------------
Insert Figure 9 and Tables 4 and 5 about here
-------------------------------------------------------
21
Exogenous covariates
The general modeling framework can readily accommodate covariates at the node or dyad level
and, of course, if such covariates are regarded as important influences on network tie formation,
then they should be included in models for the network. For example, a general and systematic
approach to the inclusion of node-level covariates has been outlined by Robins, Elliott and
Pattison (2001), who extended the dependence graph formulation described earlier to include
directed dependence relationships from exogenous node-level variables to endogenous tie
variables. These developments offer an important means for exploring a wide range of
interesting interactions among actor-level and network tie variables, including homophily effects
(e.g., McPherson, Smith-Lovin & Cook, 2001) and the effects of spatial locations (e.g., Butts,
2003; Wong, Pattison & Robins, 2006).
Extensions
There are a variety of ways in which the models just described may be extended to incorporate
richer data forms, including multiple networks, longitudinal data, and changing node and tie sets.
In addition, some initial progress has been made on the problem of dealing with missing data. We
do not have space for a full account of these interesting and important developments, but point to
some key developments in each case.
The development of probabilistic models for graphs and directed graphs has been loosely shadowed
by the construction of a parallel, albeit generally later, set of models for multiple networks
measured on a common node set. Multivariate exponential random graph models are described by
Pattison and Wasserman (1999; see also Koehly & Pattison, 2005).
Although networks are often measured at a single point in time they are, in reality, dynamic entities
and there is considerable interest in the processes that underpin their evolution. An important line
of work has developed continuous-time Markov process models for network evolution (eg see
Snijders, 2001). More recently, this framework has been extended to accommodate the possibility
of co-evolutionary mechanisms by which network tie change depends on and contributes to change
in node attributes (Snijders, Steglich & Schweinberger, in press).
22
Concluding comments
The field of probabilistic network theory has progressed rapidly in the last ten years and, as
Goodreau (in press) and Robins, Snijders et al (in press) have cogently demonstrated, it is now
possible to build plausible models for many small and large social networks. Undoubtedly,
experience with the current generation of realisation-dependent network models will lead to further
improvements in model specification, and a clearer understanding of how the content of network
ties and the contexts in which they are observed might inform model building. Perhaps most
importantly, though, the field has now advanced to the point where the promise of a step-change in
our understanding of social processes on networks and their consequences might be realised.
23
References
Albert, R., & Barabási, A. (2002). Statistical mechanics of complex networks. Reviews of Modern
Physics, 74, 47-97.
Baddeley, A., & Möller, J. (1989). Nearest-neighbour Markov point processes and random sets.
International Statistical Review, 57, 89-121.
Bearman, P., Moody, J., & Stovel, K. (2004). Chains of affection: the structure of adolescent
romantic and sexual networks. American Journal of Sociology, 110, 44-91.
Besag, J. (1974). Spatial interaction and the statistical analysis of lattice systems (with discussion).
Journal of the Royal Statistical Society, Series B, 36, 96-127.
Bollobás, B. (1998). Modern graph theory. Springer
Bollobás, B. (1985). Random graphs. London: Academic
Brieger, R. (1981). Comment on “An exponential family of probability distributions for directed
graphs”. Journal of the American Statistical Association, 76, 51-53.
Burda, Z., Jurkiewicz, J., and Krzywicki, A. (2004). Network transitivity and matrix models,
Physical Review E, 69, 026106.
Butts, C. (2003). Predictability of large-scale spatially embedded networks. Pp. 313-323 in Breiger,
R. L., Carley, K. M., and Pattison, P. E. (Eds.), Social Network Models: Workshop Summary
and Papers. Washington, DC: National Academies Press.
Coleman, J. (1988). Social capital in the creation of human capital. American Journal of
Sociology, 1988, 94 (Supplement), S95-S120.
Erdös, P., & Rényi, A. (1959). On Random Graphs I. Publ. Math. Debrecen 6, 290–297.
Frank, O., & Strauss, D. (1986). Markov graphs. Journal of the American Statistical Association,
81, 832-842.
Freeman, L. (2004). The Development of Social Network Analysis: A Study in the Sociology of
Science. Vancouver, Canada: Booksurge Publishing
Goodreau, S. (in press). Advances in exponential random graph (p*) models. Social Networks.
Handcock, M. (2004). Assessing degeneracy in statistical models for social networks. Center for
Statistics in the Social Sciences Working Paper No. 39, University of Washington.
Handcock, M., Hunter, D., Butts, C., Goodreau, S., & Morris, M. (2004). Statnet manual for R.
Center for Statistics in the Social Sciences, University of Washington.
Handcock, M., Raftery, A. & Tantrum, J. (2005). Model-based clustering for social networks.
Center for Statistics in the Social Sciences Working Paper No. 46, University of
Washington.
24
Hoff, P., Raftery, A., & Handcock, M. (2002). Latent space approaches to social network analysis.
Journal of the American Statistical Association, 97, 1090-1098.
Holland, P., & Leinhardt, S. (1975). Local structure in social networks. In D. Heise (Ed.)
Sociological Methodology 1976 (pp. 1-45). San Francisco: Jossey-Bass.
Holland, P., & Leinhardt, S. (1981). An exponential family of probability distributions for directed
graphs. Journal of the American Statistical Association, 76, 33-50.
Hunter, D., & Handcock, M. (2006). Inference in curved exponential families for networks.
Journal of Computational and Graphical Statistics, 15, 565-583.
Koehly, L., & Pattison, P. (2005). Random graph models for social networks: multiple relations or
multiple raters. In P. Carrington, J. Scott, & S. Wasserman (Eds.), Models and methods in
social network analysis (Pp. 162-191). New York: Cambridge University Press.
Kretzschmar, M. & Morris, M. (1996). Measures of concurrency in networks and the spread of
infectious disease: the AIDS example. Social Science and Medicine, 21, 1203-1216.
Lauritzen, S. (1996). Graphical models. Oxford: Oxford University Press.
Lorrain, F., & White, H. (1971). Structural equivalence of individuals in social networks. Journal
of Mathematical Sociology, 1, 49-80.
McDonald, J. Smith, P., & Forster, J. (in press). Markov Chain Monte Carlo exact inference for
social networks. Social Networks.
McPherson M., Smith-Lovin, L., & Cook, J. (2001). Birds of a feather: homophily in social
networks. Annual Review of Sociology, 27, 415-444.
Milo, R., Shen-Orr, S., Itzkovitz, S., Kashtan, N., Chkovskii, D., & Alon, U (2002). Network
motifs: simple building blocks of complex networks. Science, 298, 824-827.
Nowicki, K., & Snijders, T. (2001). Estimation and prediction for stochastic blockstructures.
Journal of the American Statistical Association, 96, 1077-1087.
Park, J., & Newman, M. (2004). Solution of the 2-star model of a network. Physical Review E, 70,
066146.
Pattison, P., & Robins, G. (2002). Neighbourhood-based models for social networks. Sociological
Methodology, 32, 301-337.
Pattison, P., & Wasserman, S. (1999). Logit models and logistic regressions for social networks: II.
Multivariate relations. British Journal of Mathematical and Statistical Psychology, 52, 169-
193.
Pattison, P., Wasserman, S., Robins, G., & Kanfer, A. (2000). Statistical evaluation of algebraic
constraints for social networks. Journal of Mathematical Psychology, 44, 536-568.
Rapoport, A. (1957). Contributions to the theory of random and biased nets. Bulletin of
Mathematical Biophysics, 19, 257-277.
25
Rapoport, A., & Horvath, W. (1961). A study of a large sociogram. Behavioral Science, 6, 279-
291.
Robins, G., Elliott, P., & Pattison, P. (2001). Network models for social selection processes. Social
Networks, 23, 1-30.
Robins, G., Pattison, P., & Wang, P. (2006). Closure, connectivity and degrees: new specifications
for exponential random graph models for directed social networks. Unpublished manuscript.
University of Melbourne.
Robins, G., Pattison, P., & Woolcock, J. (2005). Small and other worlds: Global network structures
from local processes. American Journal of Sociology, 110, 894-936.
Robins, G., Snijders T., Wang, P., Handcock, M., & Pattison, P. (2006). Recent developments in
exponential random graph (p*) models. Social Networks. Forthcoming.
Schweinberger, M., & Snijders, T. (2003). Settings in social networks: a measurement model.
Sociological Methodology, 33, 307-341.
Snijders, T. (1991).Enumeration and simulation models for 0-1 matrices with given marginals.
Psychometrika, 56, 397-417.
Snijders, T. (2001). The Statistical Evaluation of Social Network Dynamics. In Sobel. M., &
Becker, M. (Eds), Sociological Methodology 2001 (pp. 361-395). Boston and London: Basil
Blackwell.
Snijders, T. (2002). Markov Chain Monte Carlo estimation of exponential random graph models.
Journal of Social Structure, 3(2).
Snijders, T., Pattison, P., Robins, G., & Handock, M. (2006). New specifications for exponential
random graph models. Sociological Methodology, 36, 99-153.
Snijders, T., Steglich, C., & Schweinberger, M. (2006). Modeling the co-evolution of networks and
behavior. In van Montfort, K., Oud, H. & Satorra, A. (Eds.), Longitudinal models in the
behavioral and related sciences. Hillsdale, NJ: Lawrence Erlbaum.
Strauss, D. (1986). On a general class of models for interaction. SIAM Review, 28, 513-527.
Strauss, D., & Ikeda, M. (1990). Pseudolikelihood estimation for social networks. Journal of the
American Statistical Association, 85, 204-212.
Wang, P., Robins, G., & Pattison, P. (2006). PNet: Program for the estimation and simulation of
p*exponential random graph models, User Manual. Department of Psychology, University
of Melbourne.
Wasserman, S., & Faust, K. (1994). Social network analysis: methods and applications. New York:
Cambridge University Press.
Wasserman, S., & Pattison, P. (1996). Logit models and logistic regressions for social networks, I. An
introduction to Markov graphs and p*. Psychometrika, 61, 401-425.
26
Wong, L., Pattison, P. & Robins, G. (2006). A spatial model for social networks. Physica A: Statistical
Mechanics and its Applications, 360, 99-120.
27
Table 1. Adjacency matrix for graph of Figure 1
-------------------------------------------
1 1 1 1 1 1 1 1
1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7
-------------------------------------------
1 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 1
2 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0
3 0 0 0 1 0 0 0 1 0 1 1 1 0 0 0 0 0
4 0 0 1 0 0 0 0 1 1 0 1 0 0 0 0 0 0
5 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0
6 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
7 0 1 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0
8 0 1 1 1 0 0 1 0 1 1 1 0 0 0 0 0 0
9 0 1 0 1 0 0 1 1 0 1 1 0 0 0 0 0 0
10 0 1 1 0 0 0 1 1 1 0 1 0 0 1 0 0 0
11 0 1 1 1 0 0 1 1 1 1 0 0 0 0 0 0 0
12 0 0 1 0 1 0 1 0 0 0 0 0 0 0 0 1 1
13 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0
14 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0
15 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
16 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1
17 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0
-------------------------------------------
28
Table 2. MCMCMLEs for Markov model of the graph of Figure 1
-------------------------------------------------
Parameter estimate s.e. t
-------------------------------------------------
edge -1.2558 1.3561 0.047
2-star -0.0451 0.3551 0.060
3-star -0.1084 0.0974 0.073
triangle 1.4438 0.2073 0.058
-------------------------------------------------
Table 3. Goodness of fit for Markov model for the mutual friendship network (Figure 1)
-------------------------------------------------------------------------------
Simulated
Observed Mean std dev t
-------------------------------------------------------------------------------
edges 34 34.07 8.91 -0.0077
2-stars 139 138.35 77.45 0.0084
3-stars 181 178.05 156.49 0.0189
triangles 30 29.34 28.39 0.0231
std dev degrees 2.09 1.71 0.46 0.8296
skew degrees 0.08 -0.26 0.62 0.5480
global clustering: 0.65 0.54 0.21 0.5246
mean local clustering 0.64 0.47 0.19 0.4917
variance local clustering 0.14 0.10 0.04 0.9899
--------------------------------------------------------------------------------
Table 4. MCMCMLEs for realization-dependent exponential random graph model for the mutual
friendship network (Figure 1)
------------------------------------------------
Parameter estimate s.e. t
------------------------------------------------
edge -0.0354 1.7851 -0.037
2-star -0.0520 0.1094 -0.038
k-star 0.0674 0.8689 -0.040
k-triangles 0.7250 0.3159 -0.043
k-2-paths -0.5583 0.1727 -0.025
------------------------------------------------
Table 5. Goodness of fit of realisation-dependent model for the mutual friendship network
Simulated
Statistic Observed Mean std dev t
-------------------------------------------------------------------------
Edges 34 34.44 9.33 -0.047
2-stars 139 145.13 94.54 -0.065
k-stars 77.8 79.73 34.68 -0.054
k-triangles 46.0 47.01 28.39 0.023
k-2-paths 83.3 85.30 30.68 -0.064
Std Dev degrees 2.09 1.77 0.60 0.522
Skew degrees 0.08 -0.17 0.46 -0.548
Global Clustering: 0.65 0.55 0.14 0.696
Mean Local Clustering 0.64 0.50 0.15 0.439
Variance Local Clustering 0.14 0.09 0.04 1.207
-------------------------------------------------------------------------
29
Figure 1. A graph on 17 nodes (mutual friendship network)
30
Figure 2. A directed graph on 37 nodes (reported collaboration network)
31
Degree distribution for graph of Figure 1
3
Frequency
Mean =4.00
Std. Dev. =2.15058
0 N =17
0.00 2.00 4.00 6.00 8.00
degree
32
Figure 5. Dependence graph for a Markov random graph on 4 nodes
k
nodes
33
150
125
100
triangles
75
50
25
. . . . . . . . . . . . . . . . . . . 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.
05 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 00 05 10 15 20 25 30 35 40 45 50 55 60 65 70
tau
__
Figure 7. Boxplots for the triangle statistic as a function of the triangle parameter for a Markov
model on a graph of 17 nodes (θ = -1.2558, σ2 = -0.0451, σ3 = -0.1084, τ between 0.00 and 1.75)
34
Figure 8. Distribution of edge, 2-star, 3-star and triangle statistics in the Markov random graph
distribution with parameters: θ = -1.2558, σ2 = -0.0451, σ3 = -0.1084, τ = 1.4438.
35
k nodes
k nodes
k nodes
36