Science:, 1373 (2014) Justin Ruths and Derek Ruths
Science:, 1373 (2014) Justin Ruths and Derek Ruths
If you wish to distribute this article to others, you can order high-quality copies for your
colleagues, clients, or customers by clicking here.
Permission to republish or repurpose articles or portions of articles can be obtained by
following the guidelines here.
Science (print ISSN 0036-8075; online ISSN 1095-9203) is published weekly, except the last week in December, by the
American Association for the Advancement of Science, 1200 New York Avenue NW, Washington, DC 20005. Copyright
2014 by the American Association for the Advancement of Science; all rights reserved. The title Science is a
registered trademark of AAAS.
REPORTS
text). Some of these dilations may arise due to a
Control Profiles of Complex Networks surplus of sink nodes (Nt > Ns). In this case
we call the dilations due to surplus sink nodes,
Justin Ruths1* and Derek Ruths2 Ne = max(0, Nt – Ns), external dilation points (see
the branching at node E to F and G in Fig. 1). The
Studying the control properties of complex networks provides insight into how designers and remaining stem dilations give rise to a class of
engineers can influence these systems to achieve a desired behavior. Topology of a network has controls due to more nuanced network structures,
been shown to strongly correlate with certain control properties; here we uncover the fundamental called internal dilation points, Ni (see the branch-
structures that explain the basis of this correlation. We develop the control profile, a statistic ing at node A to nodes B and C in Fig. 1).
that quantifies the different proportions of control-inducing structures present in a network. We Taken together, the minimum number of in-
find that standard random network models do not reproduce the kinds of control profiles that dependent controls (Nc) required for full control
are observed in real-world networks. The profiles of real networks form three well-defined clusters of a complex network is, then, the sum of the
that provide insight into the high-level organization and function of complex systems. number of source nodes, external dilation points,
and internal dilation points,
any natural complex systems are im- paths, called stems, each driven by an indepen-
with Ns, Ne, and Ni not all zero (4, 9). One way to
(1)
epidemics, economic collapse, and social unrest, dictate the need for controls: There must be one calculate Nc directly is by using a maximum match-
has generated considerable interest in being able control for each stem in the system (7). ing algorithm, which, while accurate, is expensive
to control complex systems modeled as networks. An immediate question, then, is how many for large networks (see supplementary text).
Previous work has already established the con- stems exist in the system. The location of stems is With this understanding, we can examine the
nection between control theory and complex largely dictated by two topological network fea- breakdown of the origin of controls in terms of
networks (1–3). Controlling a complex directed tures: sources, which are nodes with edges only amounts of source nodes, external dilation points,
network reduces to (i) the selection of specific pointing away from them (e.g., node A in Fig. 1), and internal dilation points for different classes
nodes that, by virtue of being directly controlled, and sinks, which are nodes with edges only point- of real-world networks, as well as for widely used
can indirectly drive the rest of the network to any ing toward them (e.g., nodes F and G in Fig. 1). A generative network models (see supplementary text
arbitrary state and (ii) the prescription of time- source, because it does not have edges entering it, for the complete listing). To make direct compar-
varying control inputs that, when applied to these can only appear at the origin of a stem and there- isons across networks of different size, in the fol-
nodes, will drive the system to a desired state. fore must be directly controlled. Similarly, a sink, lowing discussion we normalize by the number
Work to date has focused on the number of lacking edges to influence other nodes, must lie of nodes, N, i.e., (ns, ne, ni) = (Ns /N, Ne /N, Ni /N)
controls and their attachment points required by a at the end of a stem. The number of stems in a and nc = Nc /N.
complex network (4). Existing studies have shown complex system, then, is at least max(Ns, Nt), where For any network, ncse = ns + ne can be com-
how the mathematics of structural controllability Ns and Nt are the number of sources and sinks, re- puted directly from the network’s degree dis-
can be used to identify a minimum number of spectively. Without lack of generality, we consider tribution, namely, the sum of the fraction of nodes
controls that can drive the system as a whole to here a graph with no disconnected nodes (8). with in-degree zero and out-degree zero. This ex-
any arbitrary state. A main outcome of past work In situations where a path must branch into plains why Liu et al. found that the degree dis-
revealed that the degree distribution of a network two or more paths in order to reach all nodes, a tribution of a network correlates so strongly with
alone (i.e., the probability distribution over the dilation occurs: Additional stems must be intro- the number of controls (1). Our findings show
number of links that nodes have) is correlated duced at these branching points, each one re- that knowing the full degree distribution is often
with the minimum number of controls (1). quiring a separate control (see supplementary unnecessary since, on average, the number of con-
Understanding the control properties of a com-
plex system requires not just knowing how many
controls are needed, but also characterizing the
functional origin of each control, an explanatory
detail not provided by the degree distribution cor-
relation. For example, a biochemical system and a
financial system might require the same number
of controls, but the structures within the networks
that give rise to these controls could be quite differ-
ent. Ultimately, planning and implementing a control
scheme on a network will depend greatly on how
these control points are situated in the network.
The formulation of control that we employ,
structural controllability, formalizes the idea that,
in the absence of a feedback loop (or, more gen-
erally, a cycle), a given node can control at most
one of its immediate neighbors (5). Thus, the struc- Fig. 1. An example of (left) a simple directed network and (right) one of the possible minimum
ture of the propagation of control influence through control structures, constructed of stems (blue) and cycles (purple and green). There is one source
the network consists of a backbone of directed (node A) and two sinks (nodes F and G). Three controls are required arising from one source (node A), one
external dilation (due to two sink nodes F and G), and one internal dilation (due to the A-B-C-D topology).
1
Engineering Systems and Design, Singapore University of The dashed lines indicate how control signals enter the network. For control 1, the same signal is sent to
Technology and Design, Singapore. 2Department of Computer nodes A and K. Controls 2 and 3 send different signals to nodes B and F, respectively. Gray arrows
Science, McGill University, Montreal, Quebec, Canada. indicate edges that are not involved in the control structure of the network. See the supplementary text for
*Corresponding author. E-mail: justinruths@sutd.edu.sg additional details regarding the determination of the control structure and its various components.
maximum matching algorithm (see supplementary Real networks have control profiles that are
text), and Ns and Ne are obtained from analysis vastly different from the synthetic networks con-
of node degrees, we can trivially solve for Ni. Our sidered (see Fig. 3). This is related to the obser-
goal is no longer to estimate Nc; hence, defining Ni vation above—that existing generation methods
as a function of Ns, Ne, and Nc is not circular be- are biased toward source-based controls. How-
0
ER BA LA DD real
Fig. 2. Source and sink nodes dominate the
correlation between degree distribution and the
number of required controls. The error in the ap-
real
proximation of the fraction of required controls is only
slightly improved by incorporating the nonzero terms neural (4) social (8) copurchase (4) corp. ownership (1) airport (2) autonomous (1)
of the degree distribution (ncdeg = Ncdeg/N is the
fraction of required controls after a degree-preserving
shuffle; ncse = ns + ne is the fraction of source and
sink nodes in the network). The mean improvement ER BA messaging (2) p2p (9) circuit (3) food web (22)
in approximation error for synthetic networks—Erdos-
Renyi (ER), Barabasi-Albert (BA), local attachment (LA),
and duplication-divergence (DD)—and real networks
is indicated by the difference in bar heights. Synthetic
synthetic LA DD social influence (6) transcription (2) www+blog (4)
networks were generated from a comprehensive set of
parameters with network sizes, N, ranging from 100 to Fig. 3. Real and synthetic networks have substantially different control profiles. The coloring of
50,000; average degree, L/N, ranging from 2 to 58; and breakout subfigures indicates the clustering observed in various types of real networks. Deeper shades of
for DD networks, a probability to retain copied edges the heatmap indicate a greater density of networks with control profiles located in that region. Networks
from 0.1 to 0.9 (see supplementary text). The error bars with Ns = Nt = Ni = 0 are omitted from this figure (9). The aggregated heatmaps are normalized to avoid
represent one standard deviation from the mean ap- biases induced by over- and underrepresented types of networks (see supplementary text for details).
proximation error. Numbers in parentheses indicate numbers of networks present in each plot.
Fossilized Nuclei and Chromosomes of geologic time (8). To date, evidence for evo-
lutionary conservatism in fern genomes has been
exclusively based on studies of extant plants
Reveal 180 Million Years of (9, 10). Here, we present direct paleontological
evidence for long-term genomic stasis in this
Genomic Stasis in Royal Ferns family in the form of a calcified osmundaceous
rhizome from the Lower Jurassic of Sweden with
pristinely preserved cellular contents, including
Benjamin Bomfleur,1* Stephen McLoughlin,1* Vivi Vajda2 nuclei and chromosomes.
The specimen was collected from mafic vol-
Rapidly permineralized fossils can provide exceptional insights into the evolution of life over geological caniclastic rocks [informally named the “Djupadal
time. Here, we present an exquisitely preserved, calcified stem of a royal fern (Osmundaceae) formation” (11)] at Korsaröd near Höör, Scania,
from Early Jurassic lahar deposits of Sweden in which authigenic mineral precipitation from Sweden [fig. S1 of (12)]. Palynological analysis in-
hydrothermal brines occurred so rapidly that it preserved cytoplasm, cytosol granules, nuclei, and even dicates an Early Jurassic (Pliensbachian) age for
chromosomes in various stages of cell division. Morphometric parameters of interphase nuclei match these deposits (11) (fig. S2), which agrees with
those of extant Osmundaceae, indicating that the genome size of these reputed “living fossils” has radiometric dates obtained from nearby volcanic
remained unchanged over at least 180 million years—a paramount example of evolutionary stasis. necks (13) from which the basaltic debris originated.
The fern rhizome was permineralized in vivo by
oyal ferns (Osmundaceae) are a basal mundaceous rhizomes from the Mesozoic are calcite from hydrothermal brines (11, 14) that per-