Professional Documents
Culture Documents
The FASEB Journal - 2001 - VOHRADSKY - Neural Network Model of Gene Expression
The FASEB Journal - 2001 - VOHRADSKY - Neural Network Model of Gene Expression
JIŘÍ VOHRADSKÝ
Institute of Microbiology, CAS,142 20 Prague, Czech Republic
ABSTRACT Many natural processes consist of net- genes regulate the transcription of cell genes leading to
works of interacting elements that, over time, affect a new state. The new state is defined by the concentra-
each other’s state. Their dynamics depend on the tion of currently expressed gene products, and so on.
pattern of connections and the updating rules for each This means that the state of transcription and transla-
element. Genomic regulatory networks are networks of tion in the cell is continuously updated from one state
this sort. In this paper we use artificial neural networks to another in a feed-forward-like circuit, where the
as a model of the dynamics of gene expression. The current state depends on the previous one, which
significance of the regulatory effect of one gene prod- depends on the preceding one, etc.
uct on the expression of other genes of the system is This approach can be formalized as a concept of
defined by a weight matrix. The model considers ‘genetic network’ where all genes and their products
multigenic regulation including positive and/or nega- participating in one regulatory event are members of a
tive feedback. The process of gene expression is de- network. The transcriptional state of a gene is defined,
scribed by a single network and by two linked networks in principle, by the states of all other genes in the
where transcription and translation are modeled inde- network (Fig. 1). This model can be simplified by
pendently. Each of these processes is described by defining the expression of genes as either on or off {0,1}
different network controlled by different weight matri- (1–3). The response of a gene in the model is then
ces. Methods for computing the parameters of the defined by Boolean rules that combine the states of
model from experimental data are discussed. Results expression of genes controlling the particular gene
computed by means of the model are compared with (e.g., gene A is expressed if the formula C AND (B OR
experimental observations. Generalization to a ‘black D) for genes B, C, D is true). When the Boolean rules
box’ concept, where the molecular processes occurring for all members of the model are known, the terminal
in the cell are considered as signal processing units state of the network can be computed. The control of
forming a global regulatory network, is discussed.— gene expression can thus be simplified to wiring be-
Vohradský, J. Neural network model of gene expres- tween genes and a set of Boolean rules associated with
sion. FASEB J. 15, 846 – 854 (2001) the synapses (using neural network terminology) con-
necting the nodes. The synapses and the rules define
Key Words: genetic network 䡠 regulation of gene expression the next state of the network. From the theory of
䡠 mathematical modeling random Boolean networks, it follows that the final state
of the network (the state after a finite number of state
updates) is defined by either a point attractor or a basin
The information for constructing and maintaining attraction (1, 4). A point attractor defines a stable state
the molecular components of a living cell lies within its that does not change with the update of the network.
genes. Genes directly encode proteins that make up the The basin of attraction represents a set comprising a
cell and directly or indirectly participate in the signal- limited number of regularly changing states (limited
ing and control of processes necessary for life. The cycle). Several methods to identify the network from
genes are transcribed to RNAs, which are translated to experimental data have been suggested (3, 5).
proteins. This Boolean simplification ignores the genes that
Transcriptional regulation is conferred through the
have different regulatory effects depending on their
action of gene products that bind to each gene’s
level of expression. Furthermore, these networks can-
transcriptional start site. These proteins bind directly to
not address those genes that influence the transcrip-
DNA and influence gene expression by altering the
tion of various genes to different degrees. The model
activity of transcriptional machinery. The activity of
has therefore been generalized to incorporate different
transcription factors is controlled by other gene prod-
levels of expression of regulatory genes and different
ucts. Thus, the transcription of a particular gene can be
degrees of the regulatory effect on a given gene.
viewed as a combinatorial action of products of other
D’Haesleer (6) suggested a linear model of gene con-
genes or even of its own gene product. Once the
trol where the expression level of a gene is defined by
expressed gene is translated into a functional gene
product, it affects the state of the cell and may directly
or indirectly influence the expression of other genes. 1
Correspondence: Institute of Microbiology, CAS,
In this way the expression state of the cell is controlled Videňská 1083, CZ-14220 Prague, Czech Republic. E-mail:
by a set of genes expressed at a given moment. These vohr@biomed.cas.cz
冘 w y ⫹ b 兲兴其
Figure 1. Formal description of transcriptional regulation by
⫺1
a neural network. Each node of the network represents one g i ⫽ 兵1 ⫹ exp关⫺共 ij j i (2)
gene; the connection between nodes represents the regula- j
tory actions of one gene on another. In principle, all genes
can control all others (fully connected network); in reality, where bi represents an external input that can be
only a few genes control the expression of one particular interpreted as a reaction delay parameter. High nega-
gene. The state of the network is updated in a stepwise
manner; the state of gene expression at time ti is determined tive and/or positive values of b result in a low influence
by the state at time ti-1. of factors given in the weight matrix. The actual rate of
expression is then modulated by a multiplicative con-
stant k1i, which represents the maximal expression rate
of a given gene.
the linear combination of expression levels of other Let the rate of expression of gene i be given by the
genes weighted by a value reflecting the level of their regulatory effects of other genes i and the effect of
regulatory effect. The weight matrix then summarizes degradation
the interactions among all members of the set. The
methods for computing the weight matrix from exper- dz i /dt ⫽ i ⫺ x i (3)
imentally measured expression time series are pre- The degradation effect xi is modeled by the kinetic
sented. The authors also discuss limits of the approach equation of a first order chemical reaction xi ⫽ k2i 䡠 zi ,
based on a linear model that oversimplifies the expres- and i represents the regulatory effect reflected in a
sion kinetics. Weaver et al. (7) extended this approach variable gi (i ⫽ k1i gi ). The constant k2 represents the
by introducing a nonlinear ‘squashing function’ modi- rate constant of degradation of the gene product i and
fying the output from the linear combination of k1 is its maximal rate of expression. The whole model
weights and expression levels of the regulatory genes. has the form:
The additive principle for updating the state of the
dz i 1
冘
network was kept. The method of reconstruction of the
⫽ k 1i ⫺ k 2i z i (4)
weight matrix from experimental data was also ana- dt 1 ⫹ exp关⫺共 w ij y j ⫹ b i 兲兴
lyzed. Genetic networks have also been modeled by a j
set of differential equations (8).
Equation 4 represents a special case of a class of
Each such model should be able not only to fit the
recurrent neural networks that can be described by the
experimentally measured expression time series ‘pro-
general formula (9)
files’, but also to correctly describe the underlying
process. As the weight matrix has to be derived from
the experimental data according to a certain model, an i
dy i
dt
⫽ fi 共 冘w y ⫺兲⫺y
j
ij j i i (5)
invalid model will produce an invalid matrix and hence
an invalid reconstruction of the regulatory connec- where fi is a nonlinear transfer function, w is the weight
tions. Unfortunately, little (if any) experimental data matrix, and is an external input to the node i. These
exist that would allow the model to be tested. Never- types of networks have been used as associative memo-
theless, rapid developments in DNA chip technology ries and as models of brain activity, and their dynamics
and quantitative proteomics that can provide such data have been thoroughly studied. The output of the
promise to fill this gap. In the current situation, the network at time t depends on the weight matrix w. If
model can be tested only on simplified artificial exam- the matrix is symmetric (wij ⫽ wji) the network reaches
ples where the output can be estimated. a stationary state in finite time. If the connection weight
matrix is far from being symmetric, the state transition to the reaction kinetics of the transcriptional/transla-
of the network can lead to a point attractor (stationary tional machinery of gene B. This is also valid for the
state) or can become oscillatory (limit cycle) or even B/C pair. Therefore, an increased concentration of the
chaotic, depending on the complexity of the network. A product will be followed by a delayed increase of B
and, with another delay, by an increase of C. This
scheme will be maintained until the concentration of C
RESULTS product is high enough to block the synthesis of A,
reflected in a decrease of the concentration of A, since
Gene regulation with positive and negative feedback all products are continuously being degraded. In turn,
a decreasing A directly causes a decrease of B and
Let us consider a simple model situation published in consequently of C. If the concentration of A or B is
ref 1 (Fig. 2). Here the product of gene A controls the close to 0, the whole process is terminated. If not, after
expression of gene B, which initiates the expression of the decrease of C, A is synthesized again; the whole
gene C. Gene B induces the expression of gene A cycle starts up again and is repeated ad infinitum.
forming positive feedback. Gene C in turn negatively This is a qualitative analysis of the behavior of the
controls the expression of gene A, forming negative model of expression shown in Fig. 2. We have recon-
feedback. This is a simple feedback circuit that can be structed the response of the network from Fig. 2. using
characterized by a network and a weight matrix (Fig. 2). the following weight matrix
冋 册
Considering only one isolated circuit, the state of the 0 10 ⫺10
network can reach two attractors, depending on the
w ij ⫽ 10 0 0 (6)
initial conditions: a point attractor, where the action of
0 10 0
the network is stopped; or a two-point attractor perpet-
ually oscillating between two states. qualitatively equivalent to the Boolean weight matrix
In the case of multiple copies of the circuit, the from Fig. 2, and solved the system of equations numer-
reaction of the system depends not only on the state of ically. The results are shown in Fig. 3.
the genes (on/off), but also on the concentration of The parameter b (Eq. 2) represents the delay with
each compound controlling the process. It is evident which the transcription/translation process of one
that with increasing concentration of the product of A, gene reacts to the regulatory impulse from other gene
expression of B will increase with a delay proportional products. High negative values mean a long delay. If all
Figure 3. Expression patterns of genes A (solid line), B (dashed), C (dotted) for different values of the reaction delay parameter
b. a, c) The reaction delay parameter b is the same for all members of the system; expression stops after some time at constant
levels. b) If expression of gene A is faster, the system terminates in an oscillatory pattern. Example shown in Fig. 2 was used to
calculate the kinetic profiles.
the expression of other genes of the system (Fig. 5) The time series of the amounts of protein will then
forming a feedback circuit. follow that of the mRNA (Fig. 6).
Equations 7 and 8 form a ‘two-compartment’ model We can consider that the rate-limiting step is not only
of gene expression. The two networks are connected by constrained by the mRNA concentration, but by one or
one synapse formed by mRNAi. Both networks can be more of the processes of the translational machinery.
evaluated from experimentally measured mRNA and In this case, the elements of the weight matrix wp gain
protein concentrations. Equations 7 and 8 can be non-zero values. In principle, the time series of mRNA
written for all n members of the regulatory process and protein can differ. This has been observed previ-
forming a system of 2n differential equations. ously and recently shown experimentally (13, 14). The
The translation of an mRNA to the corresponding simulation of such situation is shown in Fig. 7.
protein is a several-step process. It has been considered
that the components of the system are available in Reconstruction of the network architecture from
excess. Therefore, the rate of protein accumulation is experimental data
determined only by the amount of mRNA. The weight
matrix thus becomes zero and equation 8 simplifies to:
The network architecture is uniquely determined by
dp i the number of nodes and the wiring between the
⫽ k 1pi 䡠 r i ⫺ k 2pi 䡠 p i (9)
dt nodes. The wiring is defined by the weight matrix. The
Figure 7. Two-compartment model with translational control. a) mRNA levels; b) protein levels. Synthesis of protein A (solid
line) is enhanced by assigning a positive value to the weight matrix member wp11. The degradation of protein B was increased
by increasing the value of the decay parameter of protein B. All other parameters were kept unchanged. As a result, protein and
mRNA levels differ. A periodic pattern was reached under current combination of parameters. The model situation of Fig. 2 was
used to compute the profiles (solid line, A; dashed, B; dotted, C).