You are on page 1of 18

IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO.

4, JULY 2007 1049

Neural Network Adaptive Output Feedback Control


for Intensive Care Unit Sedation
and Intraoperative Anesthesia
Wassim M. Haddad, Senior Member, IEEE, James M. Bailey, Tomohisa Hayakawa, Member, IEEE, and
Naira Hovakimyan, Senior Member, IEEE

Abstract—The potential applications of neural adaptive control and surgery. However, there remain barriers to the application
for pharmacology, in general, and anesthesia and critical care of modern control theory and technology to medicine. These
unit medicine, in particular, are clearly apparent. Specifically, barriers include system uncertainties, inherent to biology, that
monitoring and controlling the depth of anesthesia in surgery
is of particular importance. Nonnegative and compartmental limit mathematical modeling and the ability to apply many of
models provide a broad framework for biological and physio- the tools of modern control technology. An additional imped-
logical systems, including clinical pharmacology, and are well iment is the relatively limited communication between control
suited for developing models for closed-loop control of drug engineers and the medical community. One area of medicine
administration. In this paper, we develop a neural adaptive output most suited for applications of control technology is clinical
feedback control framework for nonlinear uncertain nonnegative
and compartmental systems with nonnegative control inputs. The pharmacology in which mathematical modeling has had a
proposed framework is Lyapunov-based and guarantees ultimate predominant role [1], [2]. This discipline includes pharmacoki-
boundedness of the error signals. In addition, the neural adaptive netics, the relationship between drug dose and the resultant
controller guarantees that the physical system states remain in tissue concentrations as a function of time, and pharmacody-
the nonnegative orthant of the state space. Finally, the proposed namics, the relationship between tissue concentration and drug
approach is used to control the infusion of the anesthetic drug
propofol for maintaining a desired constant level of depth of effect. The mathematical foundation for pharmacokinetic and
anesthesia for noncardiac surgery. pharmacodynamic modeling is nonnegative and compartmental
dynamical systems theory [2]–[7] and so it is easy to see why
Index Terms—Adaptive control, automated anesthesia, bispec-
tral index (BIS), dynamic output feedback, electroencephalog- control theory and technology may have much to offer clinical
raphy, neural networks, nonlinear nonnegative systems, nonnega- pharmacology.
tive control, set-point regulation. Nonnegative and compartmental models provide a broad
framework for biological and physiological systems, including
clinical pharmacology, and are well suited for the problem
I. INTRODUCTION of closed-loop control of drug administration. Specifically,
nonnegative and compartmental dynamical systems [2]–[7]

C ONTROL engineering has impacted almost every aspect


of modern life, with applications ranging from relatively
simple systems, such as thermostats and automotive cruise
are composed of homogeneous interconnected subsystems (or
compartments) which exchange variable nonnegative quantities
of material with conservation laws describing transfer, accu-
control, to highly complex systems, such as advanced tactical mulation, and elimination between the compartments and the
fighter aircraft and variable-cycle gas turbine aeroengines. environment. It thus follows from physical considerations that
Control technology has also had an impact on modern medicine the state trajectory of such systems remains in the nonnegative
in areas such as robotic surgery, electrophysiological systems orthant of the state space for nonnegative initial conditions [7].
(pacemakers and automatic implantable defibrillators), life sup- Thus, it is not surprising that nonnegative and compartmental
port (ventilators, artificial hearts), and image-guided therapy models have wide applicability in biology and medicine, and
their use in the specific field of clinical pharmacology is partic-
Manuscript received December 1, 2005; revised September 20, 2006; ularly noteworthy [8].
accepted February 5, 2007. This work was supported in part by the National
Science Foundation under Grant ECS-0601311 and by the U.S. Air Force Office It has long been appreciated that if one can identify a physio-
of Scientific Research under Grants FA9550-06-1-0240 and F49620-03-1-0443. logical variable that quantifies therapeutic drug effect and mea-
W. M. Haddad is with the School of Aerospace Engineering, Georgia Institute
sures it continuously in real time, it should be possible to utilize
of Technology, Atlanta, GA 30332-0150 USA (e-mail: wm.haddad@aerospace.
gatech.edu). feedback (closed-loop) control, implemented with a computer,
J. M. Bailey is with the Department of Anesthesiology, Northeast Georgia to maintain this variable, and hence the drug effect, at the desired
Medical Center, Gainsville, GA 30503 USA (e-mail: james.bailey@nghs.com).
T. Hayakawa is with the Department of Mechanical and Environmental
value. By their very nature, cardiovascular and central nervous
Informatics, Tokyo Institute of Technology, Tokyo 332-0012, Japan (e-mail: functions are critical in the acute care environment, such as the
hayakawa@mei.titech.ac.jp). operating room or the intensive care unit (ICU), and hence, ma-
N. Hovakimyan is with the Department of Aerospace and Ocean Engineering, ture technologies have evolved for their quantitative assessment.
Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
(e-mail: nhovakim@vt.edu). For this reason, the primary application of closed-loop control
Digital Object Identifier 10.1109/TNN.2007.899164 of drug administration has been to hemodynamic management
1045-9227/$25.00 © 2007 IEEE
1050 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

and, more recently, to control of levels of consciousness. As an variability among patients in the drug concentration at the
example of the former application, the technology for the con- locus of effect (the effect site) that results from a given dose
tinuous measurement of blood pressure has been available for (pharmacokinetic variability) and there is also substantial vari-
some time and systems have been developed to maintain blood ability in the effect of a given concentration (pharmacodynamic
pressure at the desired value in the operating room and ICU [9], variability). Furthermore, there may be pharmacokinetic and
[10]. pharmacodynamic variability in the same patient at different
The primary interest of our research has been in closed-loop points in time. PID controllers may be overly simplistic in the
control of the level of consciousness. While there is direct face of this variability. More recently, Struys and his colleagues
application of this research to the clinical task of operating have described an adaptive controller of the delivery of propofol
room anesthesia, an application with potentially more impact anesthesia [16]. Clinical testing demonstrated acceptable per-
is closed-loop control of level of consciousness of patients formance, although as pointed out by Glass and Rampil [17],
in ICUs. Mechanical ventilation is uncomfortable and causes this may have been because the system was not highly stressed.
anxiety in the patient. Appropriate sedation of mechanically In [16], a relatively high dose of the opioid remifentanil, a neu-
ventilated ICU patients is very challenging. The common rotransmitter inhibitor resulting in significant analgesic effect,
clinical scenario is a patient who is either undersedated and was administered in conjunction to propofol. Consequently,
“fighting the ventilator” or is oversedated, with concomitant central nervous system excitation due to surgical stimulus
sequelae such as hypotension or prolonged emergence from was blunted and, thus, the need to adjust the propofol dose as
sedation when it is deemed appropriate to begin weaning the surgical stimulus was diminished. It is unknown whether the
patient from the ventilator. We can envision a future in which control system would have been effective in the absence of
infusion pumps contain a small computer chip that allow the deep narcotization.
ICU nurse to connect the patient to the monitor of level of The algorithms of Schwilden et al. [12] and Struys et al. [16]
consciousness, feed the signal into the chip, which then drives were based on specific pharmacokinetic and pharmacodynamic
the pump to titrate intravenous hypnotic drugs to the appro- models with fixed pharmacokinetic or pharmacodynamic
priate level of sedation. Given the shortage of experienced ICU parameters, respectively. However, there is a great deal of
nurses, we believe this would be a significant advance, allowing uncertainty about model selection and pharmacokinetic and
clinicians to focus on tasks other than titrating sedation. pharmacodynamic parameter selection; models which are
In contrast to the control of hemodynamic variables, progress appropriate for one patient may not be appropriate for another
in the closed-loop control of anesthesia or sedation has been patient. Model-independent control algorithms may be more
more difficult to attain. This has been primarily due to the lack useful. Given the complexity, uncertainties, and nonlinearities
of a method for measuring the depth of anesthesia. Ever since inherent in pharmacokinetic and pharmacodynamic models
the pioneering work of Bickford [11], it has been known that needed to capture the wide effects of pharmacological agents
the electroencephalograph (EEG) changes with the induction and anesthetics in the human body, neural networks can provide
of anesthesia. However, the EEG is a complex of multiple time an ideal framework for developing adaptive controllers for
series and it is difficult to identify one component of the EEG clinical pharmacology. Neural networks consist of a weighted
which correlates with the clinical signs of anesthesia. Schwilden interconnection of fundamental elements called neurons, which
and his colleagues developed and clinically tested a closed-loop are functions consisting of a summing junction and a nonlinear
controller for the delivery of intravenous anesthesia using the operation involving an activation function [18]. One of the
median frequency of the EEG power spectrum as a performance primary reasons for the large interest in neural networks is
variable [12]. Their system did not appear to offer any advantage their ability to approximate a large class of continuous non-
over standard manual dosing control. It was unclear whether this linear maps from the collective action of simple, autonomous
was due to the algorithm or the deficiencies of the median EEG processing units interconnected in simple ways. In addition,
frequency as a measure of a depth of anesthesia. Subsequent to neural networks have attracted attention due to their inherently
this early research, there has been substantial progress in the de- parallel and highly redundant processing architecture that
velopment of processed EEG monitors that analyze the raw data makes it possible to develop parallel weight update laws. This
to extract a single measure of the depth of anesthesia. The best parallelism makes it possible to effectively update a neural
known of these monitors is the bispectral index (BIS) monitor. network online. These properties make neural networks a
The BIS is a single composite EEG measure that is well viable paradigm for adaptive system identification and control
correlated with the depth of anesthesia [13], [14]. The BIS in clinical pharmacology.
signal ranges from 0 (no cerebral electrical activity) to 100 In this paper, we extend the results of [19] and [20] to nonneg-
(the normal awake state). All available evidence indicates ative and compartmental dynamical systems with applications
that a BIS signal less than 60 is associated with lack of con- to the specific problem of automated anesthesia. Specifically, we
sciousness. With the commercial availability of a more reliable develop an output feedback neural network adaptive controller
measure of consciousness, Absalom et al. developed a pro- that operates over a tapped delay line (TDL) of available input
portional–integral–derivative (PID) controller for the delivery and output measurements. The neuroadaptive laws for the neural
of the intravenous agent propofol using the BIS signal as a network weights are constructed using a linear observer for the
performance variable [15]. This system exhibited undesirable nominal normal form system error dynamics. The approach is
oscillations inappropriate for clinical use. The most likely applicable to a general class of nonlinear nonnegative dynam-
cause was the need for adaptive control. There is substantial ical systems without imposing a strict positive real requirement
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1051

on the transfer function of the linear error normal form dy- Hermitian matrix, for a vector norm, for the Frobe-
namics [21]. Furthermore, since in pharmacological applica- nius matrix norm, and for the Fréchet derivative of at
tions involving active drug administration control inputs as well . Finally, denotes the Kronecker product of matrices
as the system states need to be nonnegative, the proposed neu- and . The following definition introduces the notion of a
roadaptive output feedback controller also guarantees that the nonnegative (respectively, positive) function.
control signal as well as the physiological system states remain Definition 2.1: Let . A real function
nonnegative. We emphasize that the proposed framework ad- is a nonnegative (respectively, positive) function if
dresses adaptive output feedback controllers for nonlinear non- (respectively, ) on the interval .
negative and compartmental systems with unmodeled dynamics Definition 2.2 introduces the notion of essentially nonnega-
of unknown dimension while guaranteeing ultimate bounded- tive vector fields [7], [25].
ness of the error signals corresponding to the physical system Definition 2.2: Let , where
states as well as the neural network weighting gains. Output is an open subset of that contains . Then, is essentially
feedback controllers are crucial in clinical pharmacology since nonnegative if , for all , and such
key physiological (state) variables cannot be measured in prac- that , where denotes the th element of .
tice or in real time. Note that if , where , then is essen-
The contents of the paper are as follows. In Section II, we tially nonnegative if and only if is essentially nonnegative,
provide mathematical preliminaries on nonlinear nonnegative that is, , , where denotes
dynamical systems that are necessary for developing the main the th entry of .
results of this paper. In Section III, we develop Lyapunov-like In this paper, we consider controlled nonlinear dynamical
theorems for partial boundedness and partial ultimate bounded- systems of the form
ness for nonlinear dynamical systems necessary for obtaining
less conservative ultimate bounds for neuroadaptive con- (1)
trollers as compared to ultimate bounds derived using classical
where , , , ,
boundedness and ultimate boundedness notions. In Section IV,
is locally Lipschitz continuous and satisfies ,
we present our main neuroadaptive output feedback control
is continuous, and is piecewise
framework for adaptive set-point regulation of nonlinear un-
continuous.
certain nonnegative and compartmental dynamical systems.
Definition 2.3 and Proposition 2.1 are needed for the main
In Section V, we develop a new and novel nonlinear pharma-
results of this paper.
cokinetic and pharmacodynamic model for the intravenous
Definition 2.3: The nonlinear dynamical system given by (1)
anesthetic drug propofol. To demonstrate the efficacy of the
is nonnegative if for every and , ,
proposed neuroadaptive control framework, we apply our
the solution , , to (1) is nonnegative.
framework to control the infusion of propofol for maintaining
Proposition 2.1 ([7]): The nonlinear dynamical system given
a desired constant level of depth of anesthesia for noncardiac
by (1) is nonnegative if is essentially nonnegative
surgery. In Section VI, we validate our neuroadaptive feedback
and , .
control algorithm for clinical pharmacology via a series of clin-
It follows from Proposition 2.1 that a nonnegative input signal
ical evaluation trials. The clinical trials demonstrate excellent
, , is sufficient to guarantee the nonnegativity
regulation of unconsciousness allowing for a safe and effective
of the state of (1).
administration of the anesthetic agent propofol. Finally, in
Section VII, we draw some conclusions. III. PARTIAL BOUNDEDNESS AND PARTIAL
ULTIMATE BOUNDEDNESS
II. MATHEMATICAL PRELIMINARIES
In this section, we present Lyapunov-like theorems for par-
In this section, we introduce notation, several definitions, and tial boundedness and partial ultimate boundedness of nonlinear
some key results concerning nonlinear nonnegative dynamical dynamical systems. These notions allow us to develop less con-
systems [7], [22]–[24] that are necessary for developing the servative ultimate bounds for neuroadaptive controllers as com-
main results of this paper. Specifically, for , we write pared to ultimate bounds derived using classical boundedness
(respectively, ) to indicate that every compo- and ultimate boundedness notions [26]. Specifically, consider
nent of is nonnegative (respectively, positive). In this case, the nonlinear autonomous interconnected dynamical system
we say that is nonnegative or positive, respectively. Likewise,
is nonnegative1 or positive if every entry of is non- (2)
negative or positive, respectively, which is written as (3)
or , respectively. Let and denote the nonnegative
and positive orthants of , that is, if , then and where , is an open set such that ,
are equivalent, respectively, to and . , is such that, for every ,
Furthermore, we write to denote transpose, for the and is locally Lipschitz continuous in
trace operator, to denote the minimum eigenvalue of a , is such that, for every ,
1In this paper, it is important to distinguish between a square nonnegative
is locally Lipschitz continuous in , and
(respectively, positive) matrix and a nonnegative–definite (respectively, posi- , , is the maximal interval of
tive–definite) matrix. existence for the solution , , to (2)
1052 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

and (3). Note that under the aforementioned assumptions the Proof: First, let and assume . If
solution to (2) and (3) exists and is unique over , , then it follows from (4) that
. For Definition 3.1, we assume that . , . Alternatively, if there exists
Definition 3.1: such that , then it follows from the continuity
1) The nonlinear dynamical system (2) and (3) is bounded of that there exists such that and
with respect to uniformly in if there exists , . Hence, it follows from (4) and (5) that
such that, for every , there exists
such that implies , .
The nonlinear dynamical system (2) and (3) is globally
bounded with respect to uniformly in if, for every (7)
, there exists such that
implies , . which implies that . Next, let and
2) The nonlinear dynamical system (2) and (3) is ultimately assume and . Now, for every such
bounded with respect to uniformly in with ultimate that , , it follows from (4) and (5) that
bound if there exists such that, for every ,
there exists such that implies
, . The nonlinear dynamical system (2)
and (3) is globally ultimately bounded with respect to which implies that , . Next, if
uniformly in with ultimate bound if, for every there exists such that , then it follows as
, there exists such that in the proof of the first case given previously that
implies , . , . Hence, if , then
Note that if a nonlinear dynamical system is (globally) , . Finally, if and is a class
bounded with respect to uniformly in , then there exists function it follows that is a class function, and hence,
such that it is (globally) ultimately bounded with respect . Hence, the nonlinear dynamical system (2) and (3) is
to uniformly in with an ultimate bound . Conversely, if globally bounded with respect to uniformly in .
a nonlinear dynamical system is (globally) ultimately bounded Theorem 3.2: Consider the nonlinear dynamical system
with respect to uniformly in with an ultimate bound (2) and (3). Assume there exist a continuously differentiable
, then it is (globally) bounded with respect to uniformly function and class functions and
in . The following results present Lyapunov-like theorems such that (4) holds. Furthermore, assume that there exists
for partial boundedness and partial ultimate boundedness. For a continuous, positive–definite function such that
these results, define , where , , and
and
is a given continuously differentiable function. Furthermore, let
, , , denote the open ball centered at with (8)
radius , let denote the closure of , and recall the
definitions of class , class , and class functions [26]. where is such that for some
Theorem 3.1: Consider the nonlinear dynamical system (2) . Then, the nonlinear dynamical system (2) and (3) is ul-
and (3). Assume there exist a continuously differentiable func- timately bounded with respect to uniformly in with ulti-
tion and class functions and such mate bound . Furthermore,
that . If, in addition, and is a class func-
tion, then the nonlinear dynamical system (2) and (3) is globally
ultimately bounded with respect to uniformly in with ul-
(4) timate bound .
(5) Proof: First, let and assume .
As in the proof of Theorem 3.1, it follows that
where is such that for some . , . Next, let , where
Then, the nonlinear dynamical system (2) and (3) is bounded and assume
with respect to uniformly in . Furthermore, for every and . In this case, it follows from Theorem
, implies that , where 3.1 that , . Suppose, ad ab-
surdum, , , or, equivalently,
(6)
, . Since
is compact, is continuous, and ,
and . If, in addition, , it follows from Weierstrass’ the-
and is a class function, then the nonlinear orem [27, p. 154] that exists. Hence,
dynamical system (2) and (3) is globally bounded with respect it follows from (8) that:
to uniformly in , and for every , ,
, where is given by (6) with . (9)
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1053

which implies that tive output function and satisfies . We assume that ,
, and are smooth (at least mappings) and the control
(10) input in (12) is restricted to the class of admissible controls
consisting of measurable functions such that , .
Now, letting it follows that , which As discussed in Section I, control (source) inputs of drug de-
is a contradiction. Hence, there exists such livery systems for physiological and pharmacological processes
that . Thus, it follows from Theorem 3.1 are usually constrained to be nonnegative as are the system
that , , which states. Hence, in this paper, we develop neuroadaptive dynamic
proves that the nonlinear dynamical system (2) and (3) is ulti- output feedback control laws for nonnegative systems with
mately bounded with respect to uniformly in with ulti- nonnegative control inputs. Specifically, for a given desired set
mate bound . Furthermore, point and for a given , our aim is to design a
. Finally, if and is a class function nonnegative control input , , predicated on the system
it follows that is a class function, and hence, . measurement , , such that for all
Hence, the nonlinear dynamical system (2) and (3) is globally , where , and , , for all
ultimately bounded with respect to uniformly in with ul- .
timate bound . In this paper, we assume that for the nonlinear dynamical sys-
The following result on ultimate boundedness of intercon- tems (12) and (13), the conditions for the existence of a glob-
nected systems is needed for the main theorem in this paper. For ally defined diffeomorphism transforming (12) and (13) into a
this result, recall the definition of input-to-state stability given normal form [28], [29] are satisfied. Specifically, we assume that
in [26]. there exist a global diffeomorphism and func-
Proposition 3.1: Consider the nonlinear interconnected dy- tions and such
namical system (2) and (3). If (3) is input-to-state stable with that, in the coordinates
viewed as the input and (2) and (3) is ultimately bounded with
respect to uniformly in , then the solution , (14)
, of the interconnected dynamical system (2) and (3) is
ultimately bounded.
Proof: Since (2) and (3) is ultimately bounded with respect where ;
to (uniformly in ), there exist positive constants and , denotes the th derivative of
such that , . Furthermore, since , denotes the relative degree of with respect to the output
(3) is input-to-state stable with viewed as the input, it follows , , and is the (vector) relative
that is finite, and hence, there exist a class function degree of , the nonlinear dynamical system given by (12)
and a class function such that and (13) is equivalent to

(15)
(11) (16)
(17)
which proves that the solution , , to (2) and
(3) is ultimately bounded. where , , ,

IV. NEURAL ADAPTIVE OUTPUT FEEDBACK CONTROL FOR (18)


NONLINEAR NONNEGATIVE UNCERTAIN SYSTEMS
In this section, we consider the problem of characterizing (19)
neural adaptive dynamic output feedback control laws for non-
linear nonnegative and compartmental uncertain dynamical sys-
, is a known matrix of 0s and
tems to achieve set-point regulation in the nonnegative orthant.
1s capturing the multivariable controllable canonical form rep-
Specifically, consider the controlled square nonlinear uncertain
resentation [30], is such that is asymptotically
dynamical system given by
stable, is an unknown function and satisfies
(12) , is a known matrix of 0s and 1s capturing
the system output, and is an unknown ma-
(13)
trix function such that , . Furthermore, we
where , , is the state vector, , , assume that for a given there exist and
is the control input, , , is the system output, such that and
is essentially nonnegative but otherwise unknown
and satisfies , is an unknown non- (20)
negative input matrix function, and is a nonnega- (21)
1054 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

where and is given with , present values of the system outputs and system inputs, and their
, and , . delayed values given by
To ensure that for a bounded state , , the dynamics
given by (16) are bounded, we assume that (16) is input-to-state
stable at with viewed as the input; that is,
there exist a class function and a class function
such that

(25)

(22) where .
For the statement of our main result, define the projection
operator given by
where denotes the Euclidean vector norm. Unless otherwise
stated, henceforth we use to denote the Euclidean vector
norm. Note that is an equilibrium point of
(15) and (16) if and only if there exists such that (20) if
and (21) hold. if
and (26)
Finally, we assume that the functions
and , where otherwise
, can be approximated over a compact set
by a linear in the parameters neural network up to a where , ,
desired accuracy. In this case, there exist and , is the norm bound imposed on ,
such that and ,
and . Note that for a given matrix and
, where and , and
, it follows:

(23)

(24)
(27)
for all , where and are
optimal unknown (constant) weights that minimize the approx- where denotes the th column of the matrix .
imation errors over , and Assumption 4.1: For a given , assume there exist
are sets of basis functions such that each component of nonnegative vectors and such that
and takes values between 0 and 1, and and are
the modeling errors. Note that denotes the total number (28)
of basis functions or, equivalently, the number of nodes of the (29)
neural network.
Since and are continuous, we can choose and Furthermore, assume that the equilibrium point of (12) is
from a linear space of continuous functions that forms globally asymptotically stable with . Finally, assume
an algebra and separates points in . In this case, it follows that there exists a global diffeomorphism such
from the Stone–Weierstrass theorem [27, p. 212] that is a that can be transformed into the normal form given by (15)
dense subset of the set of continuous functions on . Now, as and (16), and (16) is input-to-state stable at with
is the case in the standard neuroadaptive control literature [18], viewed as the input.
we can construct the signal , Consider the neural adaptive output feedback control law
where , involving given by
the estimates of the optimal weights and basis functions as our
adaptive control signal. if
(30)
In order to develop an output feedback neural network, we otherwise
use the recent approach developed in [31] for reconstructing
the system states via the system delayed inputs and outputs. where
Specifically, we use a memory unit as a particular form of a
tapped delay line that takes a scalar time-series input and pro-
vides an -dimensional vector output consisting of the (31)
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1055

is nonsingular, , , is given by (25), , , , are norm bounds im-


and are sets of basis functions such posed on , and and are the posi-
that each component of and takes values between 0 tive–definite solutions of the Lyapunov equations (34) and
and 1, , , and , .
Here, the update laws satisfy (41)

Furthermore, , , and , , for all


.
(32) Proof: The proof is given in the Appendix.
A block diagram showing the neuroadaptive control architec-
ture given in Theorem 4.1 is shown in Fig. 1. It is important to
(33) note that the existence of a global neural network approximator
for an uncertain nonlinear map using the system outputs and in-
where and are positive–definite ma- puts, and its delayed values cannot, in general, be established. In
trices, is a positive–definite solution of the Lyapunov the proof of Theorem 4.1, as is common in the neural network
equation literature, we assume that for a given arbitrarily large compact
set , there exists an approximator for the unknown non-
(34) linear map up to a desired accuracy. This assumption ensures
that in the error space defined in the Appendix, there exists at
and , , is the solution to the estimator dynamics least one Lyapunov level set such that the set inclusions invoked
in the proof of Theorem 4.1 are satisfied. In the case where
and are continuous on , it follows from the Stone–Weier-
strass theorem that and can be approximated over an
(35) arbitrarily large compact set in the sense of (23) and (24).
(36) Furthermore, as shown in [31], if the system dynamics evolve
on a bounded set, then there exist functions and such
where , , is given by (19), that , , are of order and tend to zero as
is such that is asymptotically stable, and . In this case, one recovers the actual bounds and
. in the neural network approximation associated with (23) and
Theorem 4.1: Consider the nonlinear uncertain dynamical (24). Finally, we assume that is sufficiently close to the
system given by (12) and (13) with , , given by (30). optimal weight so that is
Assume Assumption 4.1 holds. Then, there exists a compact nonsingular and bounded for all so that there exists
positively invariant set such that , .
such that , where and Implementing the neuroadaptive controller (31) requires a
, and the solution , fixed-point iteration at each integration step, that is, the con-
, of the closed-loop system given by (12), (30), troller contains an algebraic constraint on . For each choice of
(32), (33), (35), and (36) is ultimately bounded for all and , this equation must be examined for solvability
with ultimate bound in terms of . It is more practical to avoid this iteration by using
, , where one-step delayed values of in calculating . Implementations
using both approaches result in imperceptible differences in our
numerical studies.
If the dynamical system has an unknown dimension but
with known relative degree, then Theorem 4.1 applies with a
slight modification to the input vector of the neural network,
that is, in (25) should be replaced by a sufficiently large
value that is greater than the largest possible system dimension.
Furthermore, if the system is exponentially minimum phase
(37) at with in (17) and constant control input ,
then the neural adaptive output feedback controller can be
significantly simplified. Specifically, the control law (30) with
(38)
and update law

(42)
(39)
guarantee ultimate boundedness of the closed-loop system. In
this case, the update law (33) and the estimator dynamics (35)
(40) and (36) are superfluous. For further details, see [21].
1056 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

Fig. 1. Block diagram of the closed-loop system.

Fig. 2. Pharmacokinetic model for drug distribution during anesthesia.

In Theorem 4.1 we assumed that the equilibrium point of can result in patients psychologically traumatized by pain and
(12) is globally asymptotically stable with . In gen- awareness during surgery. Thus, control of drug effect is clini-
eral, however, unlike linear nonnegative systems with asymp- cally important since overdosing or underdosing incur risk for
totically stable plant dynamics, a given set point the patient.
for the nonlinear nonnegative dynamical system (12) may not To illustrate the application of the neuroadaptive control
be asymptotically stabilizable with a constant control framework presented in Section IV for general anesthesia we
. However, if is homogeneous, cooperative, that develop a model for the intravenous anesthetic propofol. The
is, the Jacobian matrix is essentially nonnegative for pharmacokinetics of propofol are described by the three-com-
all , the Jacobian matrix is irreducible for all partment model [7], [33] shown in Fig. 2, where denotes the
[23], and the zero solution of the undisturbed mass of drug in the central compartment, which is the site for
system (12) is globally asymptotically stable, then drug administration and is generally thought to be comprised
the set point satisfying (20) and (21) is a unique equi- of the intravascular blood volume (blood within arteries and
librium point with and is also asymptotically stable veins) as well as highly perfused organs (organs with high
for all [32]. This implies that the solution ratios of blood flow to weight) such as the heart, brain, kidney,
to (12) with is asymptotically stable for all . and liver. These organs receive a large fraction of the cardiac
output. The remainder of the drug in the body is assumed to
V. NEUROADAPTIVE OUTPUT FEEDBACK CONTROL reside in two peripheral compartments, one identified with
FOR GENERAL ANESTHESIA muscle and one with fat; the masses in these compartments
Almost all anesthetics are myocardial depressants, meaning are denoted by and , respectively. These compartments
that they decrease the strength of the contraction of the heart receive less than 20% of the cardiac output.
and lower cardiac output, that is, the volume of blood pumped A mass balance of the three-state compartmental model
by the heart per unit time. As a consequence, decreased cardiac yields
output slows down redistribution kinetics, that is, the transfer of
blood from the central compartments (heart, brain, kidney, and
liver) to the peripheral compartments (muscle and fat). In ad-
dition, decreased cardiac output could increase drug concentra- (43)
tions in the central compartments, causing even more myocar-
dial depression and further decrease in cardiac output. This in-
(44)
stability can lead to overdosing that, at the very least, can delay
recovery from anesthesia and, in the worst case, can result in res-
piratory and cardiovascular collapse. Alternatively, underdosing (45)
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1057

where , is the volume of the central compart- levels of consciousness, and hence, is amenable for feedback
ment (about 15 l for a 70-kg patient), , , is the rate of control in general anesthesia.
transfer of drug from the th compartment to the th compart- The BIS, an EEG indicator, has been proposed as a measure
ment, is the rate of drug metabolism and elimination (me- of anesthetic effect. This index quantifies the nonlinear rela-
tabolism typically occurs in the liver), and , , is the in- tionships between the component frequencies in the electroen-
fusion rate of the anesthetic drug propofol into the central com- cephalogram, as well as analyzing their phase and amplitude.
partment. The transfer coefficients are assumed to be functions The BIS signal is related to drug concentration by the empirical
of the drug concentration since it is well known that the phar- relationship
macokinetics of propofol are influenced by cardiac output [34]
and, in turn, cardiac output is influenced by propofol plasma BIS BIS (47)
concentrations, both due to venodilation (pooling of blood in
dilated vains) [35] and myocardial depression [36]. where BIS denotes the baseline (awake state) value and, by
Experimental data indicate that the transfer coefficients convention, is typically assigned a value of 100, is the
are nonincreasing functions of the propofol concentration [35], propofol concentration in g/ml in the effect-site compartment
[36]. The most widely used empirical models for pharmaco- (brain), is the concentration at half maximal effect and
dynamic concentration-effect relationships are modifications of represents the patient’s sensitivity to the drug, and determines
the Hill equation [37]. Applying this almost ubiquitous empir- the degree of nonlinearity in (47). Here, the effect-site com-
ical model to the relationship between transfer coefficients im- partment is introduced to account for finite equilibration time
plies that between the central compartment concentration and the central
nervous system concentration [44].
The effect-site compartment concentration is related to the
concentration in the central compartment by the first-order
(46) model [44]
where, for , , is the drug concentra-
tion associated with a 50% decrease in the transfer coefficient,
is a parameter that determines the steepness of the concen- (48)
tration-effect relationship, and are positive constants. Note where in min is a time constant. In reality, the effect-
that both pharmacokinetic parameters are functions of and , site compartment equilibrates with the central compartment in
that is, there are distinct Hill equations for each transfer coeffi- a matter of a few minutes. The parameters , , and are
cient. Furthermore, since for many drugs the rate of metabolism determined by data fitting and vary from patient to patient. BIS
is proportional to the rate of transport of drug to the liver index values of 0 and 100 correspond, respectively, to an iso-
we assume that is also proportional to the cardiac output electric EEG signal (no cerebral electrical activity) and an EEG
so that . signal of a fully conscious patient; the range between 40 and 60
To illustrate the neuroadaptive control of propofol, we as- indicates a moderate hypnotic state [42]. Fig. 3 shows the com-
sume that and are independent of and . Also, since bined pharmacokinetic/pharmacodynamic model for the distri-
decreases in cardiac output are observed at clinically utilized bution of propofol.
propofol concentrations we arbitrarily assign a value of 4 For set-point regulation, define , where
g/ml since this value is in the midrange of clinically utilized is the set point satisfying the equilibrium condition for
values. We also assign a value of 3 [38]. This value is within (43)–(45) and (48) with , , ,
the typical range of those observed for ligand-receptor binding , and , so that
(see the discussion in [39]). The nonnegative transfer and loss is given by
coefficients , , , , and , and the parameters
, , and are uncertain due to patient
gender, weight, preexisting disease, age, and concomitant med-
ication. Hence, the need for adaptive control to regulate intra-
venous anesthetics during surgery is essential.
Although propofol concentration in the blood is correlated
with lack of responsiveness, and presumably consciousness (49)
[40], the concentration cannot be measured in real time during
surgery. Since we are more interested in drug effect (depth of
hypnosis) rather than drug concentration, we consider a model (50)
involving pharmacokinetics (drug concentration as a function
of time) and pharmacodynamics (drug effect as a function of
concentration) for controlling consciousness. Specifically, we (51)
use an electroencephalogram (EEG) signal as a measure of (52)
drug effect of anesthetic compounds on the brain [41]–[43].
Since electroencephalography provides real-time monitoring of where . The existence of this equilibrium point
the central nervous system activity, it can be used to quantify follows from the fact that the Jacobian of (43)–(45) and (48) is
1058 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

Fig. 3. Combined pharmacokinetic/pharmacodynamic model.

essentially nonnegative and every solution of (43)–(45) and (48)


is bounded [6]. Next, linearizing about 0 and computing
the eigenvalues of the resulting Jacobian matrix, it can be shown
that is asymptotically stable.
In the following, simulation involving the infusion of the
anesthetic drug propofol, we set g/ml, ,
and BIS , so that the BIS signal is shown in Fig. 4. The
target (desired) BIS value, BIS , is set at 50. Now, using
the neuroadaptive output feedback controller

(53)

where

(54)
Fig. 4. BIS index versus effect site concentration.
BIS BIS (55)

, , with update laws where and , it follows from Theorem 4.1


that there exist positive constants and such that BIS
, , for all nonnegative values of the phar-
macokinetic transfer and loss coefficients , , , ,
(56) and , as well as all nonnegative coefficients , , and .
A flowchart for the neuroadaptive control algorithm is shown in
Fig. 5.
For our simulation, we assume (0.228 l/kg)( kg),
(57) where 70 kg is the mass of the patient,
min , min , min ,
where and are positive constants and , min , min ,
, is the solution to the estimator dynamics min , , and g/ml [33],
[38]. Note that the parameter values for and probably
exaggerate the effect of propofol on cardiac output. They
BIS BIS have been selected to accentuate nonlinearity but they are
not biologically unrealistic. Furthermore, to illustrate the
(58)
proposed neuroadaptive controller, we switch the pharmaco-
(59) dynamic parameters and , respectively, from 5.6 g/ml
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1059

Fig. 6. Compartmental masses versus time.

Fig. 7. Concentrations in the central and effect site compartments versus time.

Fig. 5. Flowchart for the neuroadaptive control algorithm.


For our simulation, we used

and 2.39 to 7.2 g/ml and 3.39 at 15 min and back to


5.6 g/ml and 2.39 at 30 min. Here, we consider noncar-
diac surgery since cardiac surgery often utilizes hypothermia
which itself changes the BIS signal. With ,
, , g/min ,
g/min , , and initial con-
ditions 0 g, 0 g/ml,
, , and
, Fig. 6 shows the masses of propofol in the
three compartments versus time. Fig. 7 shows the concentra-
tions in the central and effect-site compartments versus time.
Note that the effect-site compartment equilibrates with the
central compartment in a matter of a few minutes. Fig. 8 shows
the compensator states versus time. Finally, Fig. 9 shows the
BIS index and the control signal (propofol infusion rate) versus
(60)
time.
1060 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

loop control of anesthesia [12], [16]. However, the neuroadap-


tive controller [see (53)–(59)] does not account for time delays
due to the proprietary signal-averaging algorithm within the BIS
monitor. The algorithm also ignores signal noise and the fact
that when there is excessive noise the BIS monitor does not gen-
erate a signal, although this failure is usually brief. Only exten-
sive clinical testing will indicate the significance of these as-
sumptions and approximations and the direction of future algo-
rithm development. Given the clinical observation that there is
often a substantial delay between observed changes in patient
status and a change in the BIS signal, other measures of depth
of anesthesia may be needed [45].

VI. CLINICAL EVALUATION TRIALS


We have begun clinical trials with the neuroadaptive con-
troller shown in (53)–(59) at the Northeast Georgia Medical
Center, Gainesville, GA. In initial clinical testing, we imple-
Fig. 8. Compensator states versus time. mented (53)–(59) using a Dell Latitude C610 laptop computer
with a Pentium (R) III processor running under Windows XP,
an Aspect A 2000 BIS monitor (rev 3.23), and a Harvard PHD
2000 programmable research pump. The BIS monitor sends a
data stream, which is updated every 5 s. This data stream con-
tains the BIS signal as well as other parameters such as date,
time, signal quality indicator, raw EEG information, and elec-
tromyographic data. The data are sent to the serial port of the
laptop computer.
The infusion rate for the controller is calculated by em-
ploying a forward Euler method to update the neural network
weights and every 0.5 s, using the BIS signal. The
infusion rate is communicated to the infusion pump using a 9600
b/min, eight data bits, two stop bits, and zero parity protocol
with the aid of a USB-serial port adaptor. An updated infusion
rate is sent to the pump every 1 s. Pharmacokinetic simulations
predict that a pump-update every 5 s or less is adequate in the
context of the algorithm under evaluation. An update interval
of 1 s was selected in anticipation that future algorithms might
benefit from the faster update rate.
Fig. 9. BIS versus time and control signal (infusion rate) versus time. The neuroadaptive control algorithm was programmed in
Java, an object-oriented programming language chosen for
its multiplatform portability tools for rapid prototyping. The
where , , and 15 mg/min. Even program is organized into five modules, namely, bisloader,
though we did not calculate the analytical bounds given by (37) bislogger, controller, pumplogger, and pumploader. Bisloader
due to the fact that one has to solve an optimization problem and bislogger handle communication between the BIS monitor
with respect to (23) and (24) to obtain and , , and the computer, while pumploader and pumplogger manage
the closed-loop BIS signal response shown in Fig. 9 is clearly the Harvard pump apparatus. The module bisloader finds the
acceptable. Furthermore, the basis functions for , , serial port that receives the BIS signal by using the Java class
are chosen to cover the domain of interest of our pharmacoki- CommPort identifier, and then invokes bislogger. Bislogger
netic/pharmacodynamic problem since we know that the BIS uses the Java class SerialPort EventListener to read the signal,
index varies from 0 to 100. Hence, the basis functions are and uses the class StringTokenizer to parse the BIS signal
distributed over that domain. The number of basis functions, from the input stream. The infusion rate is calculated by the
however, is based on trial and error. This goes back to the controller module. Finally, pumploader opens the serial port
Stone–Weierstrass theorem which only provides an existence communicating to the pump and establishes the communication
result without any constructive guidelines. Finally, we note that protocol, while pumplogger delivers the infusion rate to the
simulations using a larger number of neurons resulted in im- pump.
perceptible differences in the closed-loop system performance. The protocol for clinical evaluation of the system was ap-
The neuroadaptive control algorithm [see (53)–(59)] does not proved by the Institutional Review Board of Northeast Georgia
require knowledge of the pharmacokinetic and pharmacody- Medical Center. Patients are enrolled after giving informed
namic parameters, in contrast to previous algorithms for closed- consent. Our protocol excludes patients requiring emergency
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1061

Fig. 10. Controlled BIS index versus time for the neuroadaptive controller.

Fig. 11. Infusion rate versus time for the neuroadaptive controller.
surgery, pediatric patients, hemodynamically unstable patients,
and patients for whom we anticipate difficult airway manage-
for adaptive set-point regulation of nonlinear uncertain nonneg-
ment. Otherwise, all elective surgical patients who can provide
ative and compartmental systems. Using Lyapunov methods,
informed consent are candidates. Preoperative management,
the proposed framework was shown to guarantee ultimate
including administration of antianxiolytic drugs, is left to the
boundedness of the error signals corresponding to the physical
discretion of the attending anesthesiologist. Propofol is deliv-
system states and the neural network weighting gains, while
ered using the BIS-computer-pump system with a target value
additionally guaranteeing the nonnegativity of the closed-loop
of 50. In addition to propofol, all patients receive infusions
system states associated with the plant dynamics.
of either sufentanil or fentanyl with loading doses of 2 or
A nonlinear four-compartment pharmacokinetic and phar-
0.25 g/ml and continuous infusions of 2 or 0.25 ( g/ml)/h,
macodynamic patient model for the disposition of anesthetic
respectively, to provide analgesia. To ensure patient safety, an
drug propofol was developed, and the proposed neuroadaptive
independent anesthesia provider observes the progress of the
control framework was used to monitor and control a desired
study and can terminate the study if it appears that the patient’s
constant level of consciousness for noncardiac surgery. In
safety is being jeopardized by either overdosing or underdosing
addition, we validated our neuroadaptive feedback control
of propofol.
algorithm via a series of clinical evaluation trials. The clinical
To date, we have performed seven clinical trials with the neu-
trials demonstrated excellent regulation of unconsciousness
roadaptive contoller. For the last clinical trial, Figs. 10 and 11
allowing for a safe and effective administration of the anes-
show the controlled BIS index and the control signal versus
thetic agent propofol. Even though measurement noise was
time of the adaptive and neuroadaptive controller, respectively.
not addressed in our framework, it should be noted that EEG
These results are typical of all seven clinical trials. In all our
signals may have as much as 10% variation due to noise. In par-
design studies, no initial propofol dose is administered by the
ticular, the BIS signal may be corrupted by electromyographic
anesthesiologist. Induction of anesthesia is delivered by the neu-
noise, that is, signals emanating from muscle rather than the
roadaptive controller. In some of the clinical trials, we have ob-
central nervous system. Although electromyographic noise can
served a slight overshoot at the induction of anesthesia with the
be minimized by muscle paralysis, there are other sources of
BIS signal dropping below the target. We suspect that the ini-
measurement noise, such as electrocautery and x-ray, that need
tial target overshoot stems from the signal averaging time delays
to be accounted for within the control design process. This is
within the BIS monitor. In addition, the BIS signal and propofol
evident in Figs. 10 and 11. Future work will involve extensions
infusion rate oscillations are due to measurement noise in the
to adaptive output feedback controllers with measurement noise
BIS signal.
and disturbance rejection guarantees.
While our objective in this paper was to develop a con-
VII. CONCLUSION trol design methodology for ICU sedation and intraoperative
anesthesia, the results of this paper can have implications for
Nonnegative and compartmental dynamical systems are other uses of closed-loop control of drug delivery. There are
widely used to capture system dynamics involving the inter- numerous potential applications such as control of glucose,
change of mass and energy between homogenous subsystems heart rate, blood pressure, etc. Finally, we note that the range
or compartments. Thus, it is not surprising that nonnegative of applications of the proposed controller design framework
and compartmental models are remarkably effective in de- is not limited to biological and medical systems. Specifically,
scribing the dynamical behavior of biological, physiological, epidemiology, ecology, economic systems, telecommunications
and pharmacological systems. In this paper, we developed a systems, transportation systems, and power systems, to cite but
neural adaptive dynamic output feedback control framework a few examples, all lead to nonnegative dynamical systems.
1062 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

APPENDIX To show ultimate boundedness of the closed-loop system


To prove Theorem 4.1, first, define (32), (33), and (62)–(64), consider the Lyapunov-like function

if (61)
otherwise (65)

Next, defining , , and where and satisfy (34) and (41), respectively.
, and using (15)–(21), (23), (24), and (30), it follows Note that (65) satisfies (4) with
from (15), (16), and (35): , , and
, where
and denotes the column stacking op-
erator. Furthermore, is a class function. Now,
letting and , , denote the solution to (62) and
(35), respectively, and using (23), (24), (27), (32), and (33), it
follows that the time derivative of along
the closed-loop system trajectories is given by

(62)
(63)

and

(64)

where , ,
, , and and
are such that each component of and takes values
between 0 and 1.
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1063

Hence, it follows from (66) that in either case:

(67)

where , , and are given by (38) and (40), respectively.


Now, for

(68)

or

(69)

it follows that for all


, that is, for all
and , where
(see Fig. 12)

(66)
(70)
For the two cases given in (61), the last term on the right-hand
side of (66) gives the following.
1) If , then , and hence (71)

Next, define

2) Otherwise, , and hence


(72)

where is the maximum value such that , and define

(73)
1064 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

Fig. 12. Visualization of sets used in the proof of Theorem 4.1.

where is also positively invariant. In addition, since (32), (33),


and (62)–(64) are ultimately bounded with respect to
and (63) is input-to-state stable with viewed
as the input, it follows from Proposition 3.1 that the solution
, , of the closed-loop
(74)
system (32), (33) and (62)–(64) is ultimately bounded for all
To show ultimate boundedness of the closed-loop system (32), .
(33), and (62)–(64) assume2 that (see Fig. 12). Now, Finally, , , is a restatement of (30). Now,
since for all since , , and , , it follows
and , it follows that is positively from Proposition 2.1 that , , for all .
invariant. Hence, if
, then it follows from Theorem 3.2 that the solution
REFERENCES
, , to (32), (33), and (62)–(64) are
[1] P. G. Welling, Pharmacokinetics: Processes, Mathematics, and Appli-
ultimately bounded with respect to uniformly cations, 2nd ed. Washington, DC: Amer. Chemical Soc., 1997.
in with ultimate bound given by [2] J. A. Jacquez, Compartmental Analysis in Biology and Medicine.
which yields (37). In addition, since (63) is input-to-state stable Ann Arbor, MI: Univ. Michigan Press, 1985.
[3] W. Sandberg, “On the mathematical foundations of compartmental
with viewed as the input, it follows from Proposition 3.1 that analysis in biology, medicine and ecology,” IEEE Trans. Circuits Syst.,
the solution , , to (63) is also ultimately bounded. vol. CS-25, no. 5, pp. 273–279, May 1978.
Next, it follows from [46, Th. 1] that there exist a continu- [4] D. H. Anderson, Compartmental Modeling and Tracer Ki-
netics. Berlin, Germany: Springer-Verlag, 1983.
ously differentiable, radially unbounded, positive–definite func- [5] K. Godfrey, Compartmental Models and Their Applications. New
tion and class functions and such York: Academic, 1983.
that [6] J. A. Jacquez and C. P. Simon, “Qualitative theory of compartmental
systems,” SIAM Rev., vol. 35, pp. 43–79, 1993.
[7] W. M. Haddad and V. Chellaboina, “Stability and dissipativity theory
for nonnegative dynamical systems: A unified analysis framework for
(75) biological and physiological systems,” Nonlinear Anal.: Real World
Since the upper bound for is given by , it follows that Appl., vol. 6, pp. 35–65, 2005.
[8] J. M. Bailey and W. M. Haddad, “Drug-dosing control in clinical phar-
the set given by: macology: Paradigms, benefits, and challenges,” Control Syst. Mag.,
vol. 25, pp. 35–51, 2005.
[9] W. G. He, H. Kaufman, and R. J. Roy, “Multiple model adaptive con-
trol procedures for blood pressure control,” IEEE Trans. Biomed. Eng.,
(76) vol. BME-33, no. 1, pp. 10–19, Jan. 1986.
[10] R. R. Rao, B. Aufderheide, and B. W. Bequette, “Experimental studies
is also positively invariant. Now, since and are positively on multiple-model predictive control for automated regulation of
invariant, it follows: hemodynamic variables,” IEEE Trans. Biomed. Eng., vol. 50, no. 3,
pp. 277–288, Mar. 2003.
[11] R. G. Bickford, “Automatic electroencephalographic control of
anesthesia (servoanesthesia),” Electroencephalographic Clin. Neuro-
physiol., vol. 3, pp. 83–86, 1951.
(77) [12] H. Schwilden, J. Schutler, and H. Stoeckel, “Closed-loop feedback con-
trol of methohexital anesthesia by quantitative EEG analysis in hu-
2This assumption is standard in the neural network literature and ensures that, mans,” Anesthesiology, vol. 67, pp. 341–347, 1987.
D
in the error space ~ , there exists at least one Lyapunov level set ~D D ~ . [13] P. S. Glass, M. Bloom, L. Kearse, C. Rosow, P. Sebel, and P. Man-
In the case where the neural network approximation holds in with delayed berg, “Bispectral analysis measures sedation and memory effects of
values, this assumption is automatically satisfied. See the discussion at the end propofol, midazolam, isoflurane, and alfentanil in normal volunteers,”
of Theorem 4.1 for further details. Anesthesiology, vol. 86, pp. 836–847, 1997.
HADDAD et al.: NEURAL NETWORK ADAPTIVE OUTPUT FEEDBACK CONTROL FOR ICU SEDATION AND ANESTHESIA 1065

[14] P. S. Sebel, E. Lang, I. J. Rampil, P. F. White, R. Cork, M. Jopling, N. [40] T. Kazama, K. Ikeda, and K. Morita, “The pharmacodynamic interac-
T. Smith, P. S. Glass, and P. Manberg, “A multicenter study of bispec- tion between propofol and fentanyl with respect to the suppression of
tral electroencephalogram analysis for monitoring anesthetic effect,” somatic or hemodynamic responses to skin incision, peritoneum in-
Anesthesia Analgesia, vol. 84, pp. 891–899, 1997. cision, and abdominal wall retraction,” Anesthesiology, vol. 89, pp.
[15] A. R. Absalom, N. Sutcliffe, and G. N. Kenny, “Closed-loop control of 894–906, 1998.
anesthesia using bispectral index. Performance assessment in patients [41] P. S. Glass, M. Bloom, L. Kearse, C. Rosow, P. Sebel, and P. Man-
undergoing major orthopedic surgery under combined general and re- berg, “Bispectral analysis measures sedation and memory effects of
gional anesthesia,” Anesthesiology, vol. 96, pp. 67–73, 2002. propofol, midazolam, isoflurane, and alfentanil in normal volunteers,”
[16] M. M. R. F. Struys, T. D. Smet, L. F. M. Versichelen, R. V. den Anesthesiology, vol. 86, pp. 836–847, 1997.
Broecke, and E. P. Mortier, “Comparison of closed-loop controlled [42] J. C. Sigl and N. G. Chamoun, “An introduction to bispectral analysis
administration of propofol using BIS as the controlled variable versus for the electroencephalogram,” J. Clin. Monit., vol. 10, pp. 392–404,
“standard practice” controlled administration,” Anesthesiology, vol. 1994.
95, pp. 6–17, 2001. [43] E. Mortier, M. Struys, T. De Smet, L. Versichelen, and G. Rolly,
[17] P. S. A. Glass and I. J. Rampil, “Automated anesthesia: Fact or fantasy,” “Closed-loop controlled administration of propofol using bispectral
Anesthesiology, vol. 95, pp. 1–2, 2001. analysis,” Anaesthesia, vol. 53, pp. 749–754, 1998.
[18] F. L. Lewis, S. Jagannathan, and A. Yesildirak, Neural Network Control [44] T. W. Schnider, C. F. Minto, and D. R. Stanski, “The effect com-
of Robot Manipulators and Nonlinear Systems. London, U.K.: Taylor partment concept in pharmacodynamic modelling,” Anaes. Pharmacol.
& Francis, 1999. Rev., vol. 2, pp. 204–213, 1994.
[19] N. Hovakimyan, F. Nardi, A. Calise, and N. Kim, “Adaptive output [45] X. S. Zhang, R. J. Roy, and E. W. Jensen, “EEG complexity as a mea-
feedback control of uncertain nonlinear systems using single-hidden- sure of depth of anesthesia for patients,” IEEE Trans. Biomed. Eng.,
layer neural networks,” IEEE Trans. Neural Netw., vol. 13, no. 6, pp. vol. 48, no. 12, pp. 1424–1433, Dec. 2001.
1420–1431, Nov. 2002. [46] E. D. Sontag and Y. Wang, “On characterizations of the input-to-state
[20] N. Hovakimyan, B.-J. Yang, and A. J. Calise, “Adaptive output stability property,” Syst. Control Lett., vol. 24, pp. 351–359, 1995.
feedback control methodology applicable for non-minimum phase
systems,” Automatica, vol. 42, pp. 513–522, 2006.
[21] T. Hayakawa, W. M. Haddad, J. M. Bailey, and N. Hovakimyan, “Pas- Wassim M. Haddad (S’87–M’87–SM’01) received
sivity-based neural network adaptive output feedback control for non- the B.S., M.S., and Ph.D. degrees in mechanical en-
linear nonnegative dynamical systems,” IEEE Trans. Neural Netw., vol. gineering from the Florida Institute of Technology,
16, no. 2, pp. 387–398, Mar. 2005. Melbourne, in 1983, 1984, and 1987, respectively,
[22] A. Berman, M. Neumann, and R. J. Stern, Nonnegative Matrices in with specialization in dynamical systems and control.
Dynamic Systems. New York: Wiley, 1989. From 1987 to 1994, he served as a consultant for
[23] A. Berman and R. J. Plemmons, Nonnegative Matrices in the Mathe- the Structural Controls Group of the Government
matical Sciences. New York: Academic, 1979. Aerospace Systems Division, Harris Corporation,
[24] D. S. Bernstein and D. C. Hyland, “Compartmental modeling and Melbourne, FL. In 1988, he joined the faculty of
second-moment analysis of state space systems,” SIAM J. Matrix Anal. the Mechanical and Aerospace Engineering Depart-
Appl., vol. 14, pp. 880–901, 1993. ment at Florida Institute of Technology where he
[25] D. S. Bernstein and S. P. Bhat, “Nonnegativity, reducibility, and founded and developed the Systems and Control Option within the graduate
semistability of mass action kinetics,” in Proc. IEEE Conf. Decision program. Since 1994, he has been a member of the faculty in the School of
Control, Phoenix, AZ, Dec. 1999, pp. 2206–2211. Aerospace Engineering at Georgia Institute of Technology where he holds
[26] H. K. Khalil, Nonlinear Systems. Upper Saddle River, NJ: Prentice- the rank of a Professor. His research contributions in linear and nonlinear
Hall, 1996. dynamical systems and control are documented in over 450 archival journal
[27] H. L. Royden, Real Analysis. New York: Macmillan, 1988. and conference publications. He is the coauthor of the books Hierarchical
[28] C. I. Byrnes and A. Isidori, “Asymptotic stabilization of minimum Nonlinear Switching Control Design with Applications to Propulsion Systems
phase nonlinear systems,” IEEE Trans. Autom. Control, vol. 36, no. (New York: Springer-Verlag, 2000), Thermodynamics: A Dynamical Systems
10, pp. 1122–1137, Oct. 1991. Approach (Princeton, NJ: Princeton Univ. Press, 2005), Impulsive and Hybrid
[29] A. Isidori, Nonlinear Control Systems. New York: Springer-Verlag, Dynamical Systems: Stability, Dissipativity, and Control (Princeton, NJ:
1995. Princeton Univ. Press, 2006), and Nonlinear Dynamical Systems and Control:
[30] C.-T. Chen, Linear System Theory and Design. New York: Holt, A Lyapunaov-Based Approach (Princeton, NJ: Princeton Univ. Press, 2007, to
Rinehart and Winston, 1984. appear). His recent research is concentrated on nonlinear robust and adaptive
[31] E. Lavretsky, N. Hovakimyan, and A. J. Calise, “Upper bounds for ap- control, nonlinear dynamical system theory, large-scale systems, hierarchical
proximation of continuous-time dynamics using delayed outputs and nonlinear switching control, analysis and control of nonlinear impulsive
feedforward neural networks,” IEEE Trans. Autom. Control, vol. 48, and hybrid systems, system thermodynamics, thermodynamic modeling of
no. 9, pp. 1606–1610, Sep. 2003. mechanical and aerospace systems, network systems, nonlinear analysis and
[32] P. De Leenheer and D. Aeyels, “Stabilization of positive linear sys- control for biological and physiological systems, and active control for clinical
tems,” Syst. Control Lett., vol. 44, pp. 259–271, 2001. pharmacology.
[33] B. Marsh, M. White, N. Morton, and G. N. Kenny, “Pharmacokinetic Dr. Haddad is a National Science Foundation Presidential Faculty Fellow and
model driven infusion of propofol in children,” Brit. J. Anaesth., vol. a member of the Academy of Nonlinear Sciences.
67, pp. 41–48, 1991.
[34] R. N. Upton, G. I. Ludrook, C. Grant, and A. Martinez, “Cardiac output
is a determinant of the initial concentration of propofol after short-term
administration,” Anethesia Analgesia, vol. 89, pp. 545–552, 1999. James M. Bailey received the B.S. degree in premed-
[35] M. Muzi, R. A. Berens, J. P. Kampine, and T. J. Ebert, “Venodilation ical studies from the Davidson College, Charlotte,
contributes to propofol-mediated hypotension in humans,” Anethesia NC, in 1969, the Ph.D. degree in chemistry (phys-
Analgesia, vol. 74, pp. 877–883, 1992. ical) from the University of North Carolina, Chapel
[36] E. F. Ismail, S. J. Kim, and M. R. Salem, “Direct effects of propofol Hill, in 1973, and the M.D. degree from the Southern
on myocardial contractility in situ canine hearts,” Anesthesiology, vol. Illinois University School of Medicine, Springfield,
79, pp. 964–972, 1992. in 1982.
[37] A. V. Hill, “The possible effects of the aggregation of the molecules of He was a Helen Hay Whitney Fellow at the Cali-
haemoglobin on its dissociation curves,” J. Physiol., vol. 40, pp. iv–vii, fornia Institute of Technology, Pasadena, from 1973
1910. to 1975 and an Assistant Professor of chemistry and
[38] T. Kazama, K. Ikeda, K. Morita, M. Kikura, M. Doi, T. Ikeda, and T. biochemistry at the Southern Illinois University from
Kurita, “Comparison of the effect site Ke0s of propofol for blood pres- 1975 to 1979. After receiving the M.D. degree, he completed the residency in
sure and EEG bispectral index in elderly and young patients,” Anesthe- anesthesiology and then a fellowship in cardiac anesthesiology at the Emory
siology, vol. 90, pp. 1517–1527, 1999. University School of Medicine affiliated hospitals. From 1986 to 2002, he was
[39] R. G. Eckenhoff and J. S. Johansson, “On the relevance of “clinically an Assistant Professor of anesthesiology and then Associate Professor of anes-
relevant concentrations” of inhaled anesthetics in in vitro experiments,” thesiology at Emory, where he also served as the Director of the critical care
Anesthesiology, vol. 91, pp. 856–860, 1999. service. In September 2002, he moved his clinical practice to Northeast Georgia
1066 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 18, NO. 4, JULY 2007

Medical Center, Gainesville, as the Director of cardiac anesthesia and Consul- Naira Hovakimyan (M’01–SM’02) received the
tant in critical care medicine. He remains affiliated with the Emory University Ph.D. degree in physics and mathematics from
School of Medicine Department of Anesthesiology as a Clinical Associate Pro- the Institute of Applied Mathematics of Russian
fessor. He is board-certified in anesthesiology, critical care medicine, and trans- Academy of Sciences, Moscow, Russia, in 1992.
esophageal echocardiography. He is the author or coauthor of 98 journal arti- She joined the Institute of Mechanics at the Ar-
cles, conference publications, or book chapters. His research interests have fo- menian Academy of Sciences as a Research Scien-
cused on pharmacokinetic and pharmacodynamic modeling of anesthetic and tist, where she worked until 1997. In 1997, she was
vasoactive drugs and, more recently, applications of dynamical system theory awarded a governmental postdoctoral scholarship to
in medicine. work in INRIA, France. The subject areas in which
she has published include differential pursuit-evasion
games, optimal control of robotic manipulators, ro-
bust control, and adaptive estimation and control. In 1998, she was invited to the
Tomohisa Hayakawa (S’00–M’04) received the School of Aerospace Engineering of Georgia Technology University, Atlanta,
B.Eng. degree in aeronautical engineering from where she worked as a Research Faculty Member until 2003. In 2003, she joined
the Kyoto University, Kyoto, Japan, in 1997, the the Department of Aerospace and Ocean Engineering, Virginia Polytechnic In-
M.S. degree in aerospace engineering from the State stitute and State University, Blacksburg, as an Associate Professor. She is the
University of New York (SUNY), Buffalo, NY, in author of over 100 refereed publications. Her current interests are in the theory
1999, and the M.S. degree in applied mathematics of adaptive control and estimation, neural networks, and stability theory.
and the Ph.D. degree in aerospace engineering both Dr. Hovakimyan is the Associate Fellow of the American Institute of Aero-
from Georgia Institute of Technology, Atlanta, in nautics and Astronautics (AIAA), the member of the American Mathematical
2001 and 2003, respectively. Society (AMS) and the International Society of Dynamic Games (ISDG), and
After serving as a Research Fellow at the Depart- she is serving as an Associate Editor for the IEEE Control Systems Society,
ment of Aeronautics and Astronautics, Kyoto Uni- IEEE TRANSACTIONS ON NEURAL NETWORKS, IEEE TRANSACTIONS ON
versity, and at the Japan Science and Technology Agency (JST), he joined the CONTROL SYSTEMS TECHNOLOGY, Computational Management Science of
Department of Mechanical and Environmental Informatics, Tokyo Institute of Springer, International Journal of Control, and Systems and Automation. She is
Technology, Tokyo, Japan, in 2006, where he is an Associate Professor. His re- the 2004 and 2005 recipient of Pride@Boeing award. She is the recipient of the
search interests include stability of nonlinear systems, nonnegative and compart- Society of Instrument and Control Engineers (SICE) International scholarship
mental systems, hybrid systems, nonlinear adaptive control, neural networks and for the best paper of a Young Investigator at the VII ISDG Symposium (Japan,
intelligent control, stochastic dynamical systems, and applications to aerospace 1996).
vehicles, multiagent systems, robotic systems, financial dynamics, and biolog-
ical/biomedical systems.

You might also like