You are on page 1of 12

Computers and Chemical Engineering 30 (2006) 1464–1475

Bioprocess control: Advances and challenges


Joseph S. Alford ∗
Eli Lilly and Company, Indianapolis, IN 46285, United States
Received 15 February 2006; received in revised form 5 May 2006; accepted 16 May 2006
Available online 31 July 2006

Abstract
Most commercial process control tools and techniques used in the chemical process industries are applicable to bioprocesses (e.g. processes
that make certain medicines). While bioprocesses include several different unit operations, the focus of most of this paper will be the control of
bioreactors (i.e. fermentors) in which a near optimal environment is desired for microorganisms to grow, multiply, and produce a desired product.
Bioreactor control provides special challenges due to significant process variability, the complexity of biological systems, the need, in many cases, to
operate in a sterile environment, and the relatively few real-time direct measurements available that help define the state of the culture. This has led to
some creative solutions as well as the identification of topics where further research and development and vendor software functionality are needed.
© 2006 Elsevier Ltd. All rights reserved.

Keywords: Bioprocess control; Advanced process control; Data management; Bioprocess instrumentation; Alarm management

1. Introduction and observing the color and texture of the foam layer riding on
top of the liquid broth.
In the early 1970s, when the author’s career in bioprocess Much progress has occurred during the past 30 years. Some
automation began, fermentation processes (e.g. making peni- of this has resulted from a few companies, including Eli Lilly
cillin), were perceived to be as much art as science. Making and Company, using a combination of commercial technolo-
antibiotics was not much more sophisticated than making wine. gies, complemented by significant customization of computer
Ingredients were put into a tank which was then closed and man- systems (see Section 9), and the in-house development of novel
ually sterilized, manually inoculated with living cells, and then applications. Commercial solutions have been slowly catching
agitated and aerated for several days. The only automated feed- up to the needs of bioprocesses, although several gaps and oppor-
back control was a pneumatic controller used for temperature. tunities still remain.
Data recording consisted of operators walking the production Regarding the current state of commercial computer control
floor, periodically recording values from gauges onto a clipboard systems used for bioprocesses, a variety of process control and
sheet and then hand copying a few of these values onto the paper data analysis platforms and products currently exist. They gen-
manufacturing ticket. Most of the parameters of real value to the erally represent variations of the generic system shown in Fig. 1.
fermentation culture [e.g. pH, dissolved oxygen (DO2 ), nutrient With some exceptions, the low end of process control prod-
concentrations] were not monitored or controlled on-line. Sci- ucts involves benchtop bioreactors which are controlled via
entists, in trying to evolve less variable and higher productivity personal computers (PCs) with Linux or Windows operating sys-
processes, were frustrated with the lack of appropriate sensors, tems and which incorporate either commercial (e.g. LabView)
tools, and support systems. Symbolizing this “state of affairs,” or bioreactor vendor proprietary application software packages.
sometimes the best indicator as to how well a fermentation was Intermediate level systems may involve Programmable
progressing was the subjective analysis of an experienced opera- Logic Controllers (e.g. Rockwell) + a Human Machine Interface
tor looking through a site glass located on the top of the fermentor (HMI) + a historian.
High end systems (used in many fermentation pilot plants
and production facilities) utilize Distributed Control Systems
∗ Tel.: +1 317 276 7653; fax: +1 317 276 1403. (e.g. Fox IA, Emerson DeltaV, Fisher Provox) which, in turn,
E-mail address: Alford Joseph S@Lilly.com. are usually linked to proprietary or commercial historians (e.g.

0098-1354/$ – see front matter © 2006 Elsevier Ltd. All rights reserved.
doi:10.1016/j.compchemeng.2006.05.039
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1465

Fig. 1. Generic bioprocess control and information system schematic.

Fig. 2. Culture growth stages.


Aspen Tech, OSI-PI). In some cases, DCS/Historian systems are
further interfaced with third party products (e.g. alarm loggers, the cells and eliminating cellular debris from the stream con-
real-time expert systems, advanced data analysis software) to taining desired product. Further, especially for mammalian cell
provide additional functionality. They may also be interfaced bioprocesses, there are a number of unit operations that exist
with other internal company systems such as laboratory assay to provide several log reductions in viruses that may exist in
systems, manufacturing execution systems (MES), or plant site the product stream as a result of using certain mammalian cell
data warehouses (e.g. Discoverant). lines and/or due to any use of animal sourced (especially bovine)
Improvements to bioprocess productivity generally come raw materials. These viral reduction unit operations can include
from two sources: (1) cell lines and (2) process control. detergent-based inactivation, column chromatography, low pH
(e.g. pH < 4) inactivation, and nanofiltration. With each of these
product recovery/purification operations (some of which are
shown in Fig. 3), the operation is either primarily manual, or con-
trolled with a local PC or programmable logic controller (PLC).
Availability of appropriate measurements and control algorithms
are generally satisfactory for post-fermentation unit operations.
Opportunities for improvement are primarily in the areas of data
analysis (described later) and automated batch sequencing.
Therefore, the most challenging unit operation from the per-
In this paper, the focus is on bioprocess control. Biopro-
spective of process control is the bioreactor itself, and this will
cess control is defined as providing a near optimal environment
be the focus for most of the rest of this paper.
for microorganisms to grow, multiply, and produce a desired
The benefits of bioreactor automation have been the subject
product. This includes providing the right concentration of nutri-
of several conferences and articles in the literature, such that
ents to the culture (e.g. carbon, nitrogen, oxygen, phosphorous,
automation is an expectation and no longer needs justification
sulfur, minerals), removing any toxic metabolic products (e.g.
for most manufacturing plants. Benefits have included:
CO2 ), and controlling important internal cellular parameters
(e.g. temperature, pH).
A. process variability reduction;
Note: While subtle differences exist in the definitions of the
B. productivity (yield) improvements;
terms bioreactor and fermentor, the terms are interchangeable for
C. increased on-line monitoring and troubleshooting capability.
many processes and will be used interchangeably in this paper.
The scope of bioprocess control theoretically includes not 1.1. Pre-inoculation operations: creating a consistent
only the sequence of steps leading up to and including the fer- optimum sterile medium
mentor (Fig. 2), but also, in many cases, to several of the product
recovery and purification steps (Fig. 3). For example, follow- Bioprocess media composition has historically been deter-
ing the fermentation, subsequent steps may involve rupturing mined by development scientists and engineers using statistical
1466 J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475

Fig. 3. Product recovery steps following fermentation.

“Design of Experiment” (DOE) methods in small-scale batch unlike GA, PSO has no evolution operations such as crossover
equipment. A relatively small dimensional space is defined by and mutation. Reasons for choosing PSO versus other evolu-
the scientist, based on experience and literature searches. This tionary computation techniques are its ease of implementation,
space is typically explored in detail using three to five nutrients relatively few parameters to adjust, reputation for finding global
of special interest (e.g. those nutrient concentrations expected to optimum and fast convergence.
highly influence culture growth and productivity, process eco- One of several differences between particle swarm and tra-
nomics, etc.). This approach assumes that other factors (e.g. ditional methodologies is that, rather than defining fixed pre-
sterilization operations and the control of earlier phases in grow- determined experiments using different nutrient concentrations;
ing the culture; see Fig. 2) are consistent. DOE methods typically an iterative approach is used in which the results from previous
result in a broth nutrient “local optimum,” but not necessarily a iterations are used to define the next iteration of experiments.
“global optimum.” Note: Broth nutrient optimization is a classi- To explain particle swarm further, using a global mountain
cal non-linear optimization problem. peak search analogy, consider the objective of finding the high-
Recent work with non-traditional techniques indicates greater est point in the Himalayan mountain range. A statistical design
opportunity exists to (1) realize a “global optimum” with respect approach might be to identify a “best guess” localized terrain,
to nutrient make-up and (2) understand and better control activi- encompassing a small number of mountains, which may or
ties associated with pre-inoculation nutrient concentration vari- may not include Mount Everest. “N” locations would be pre-
ability (e.g. during sterilization). Specific to nutrient make- defined in this area, with many located on the boundaries of
up, “evolutionary computation” algorithms, such as particle the defined area. Climbers would journey to these locations and
swarm optimization, have shown that, compared to traditional measure the altitude (i.e. fermentation yield). The results, in
approaches, a similar number of small-scale equipment exper- combination with response surface mapping algorithms, would
iments can explore a higher dimensional space (i.e. number hopefully identify the approximate local point of maximum
of different nutrients) and larger ranges of individual nutri- altitude.
ents (Cockshott & Hartman, 2001; Cockshott & Sullivan, 2001; With particle swarm, a larger area (i.e. terrain) would be
Strobel & Hartman, 1999). Note that particle swarm optimiza- defined, encompassing many mountains in the range. A team
tion, genetic algorithms, etc., are subsets of evolutionary com- of several climbers would be randomly distributed throughout
putation which, in turn, is a subset (along with neural nets and the range at which time they would note their altitude (i.e. ini-
fuzzy logic) of computational intelligence. Particle swarm opti- tial broth composition and resulting fermentation yield—which
mization (PSO) is a population-based stochastic optimization is iteration #1). Each of the climbers would then hike toward a
technique, developed by Dr. Eberhart and Dr. Kennedy in 1995, predefined new location (symbolic of a new chemical broth com-
which was inspired by the social behavior of phenomena such position) and note their new altitude (i.e. fermentation yield),
as bird flocking, i.e. a large number of birds eventually converg- which is iteration #2. The comparison of altitude changes (i.e.
ing on a common roost. PSO shares several similarities with yield improvement) for each of the climbers (from iteration 1 to
evolutionary computation techniques such as genetic algorithms 2) as well as where the highest climber (best yield) is located is
(GA). The system is initialized with a population of random solu- used to determine the direction (i.e. the next broth composition
tions and searches for optima by updating generations. However, to try) for each of the climbers for the following iteration.
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1467

While not a guarantee, the particle swarm approach increases cell mass. This major load change is associated with correspond-
the space that can be searched with a similar number of experi- ing major changes during the time course of the fermentation
ments and increases the chance that a global optimum (i.e. Mount in oxygen and nutrient requirements, carbon dioxide removal,
Everest) will be found. “base” needed to maintain pH, etc. Many bioprocesses are addi-
It has been suggested that the two techniques described above tionally complicated in that an induction or product formation
can be used in complementary ways, with evolutionary algo- de-repression activity occurs part way through the bioprocess in
rithms doing the initial broad search and a statistical design then which the culture is induced to change from a “growth” mode to
used for fine tuning. a “product synthesis” mode. This induction is often triggered by
The “particle swarm” evolutionary technique has been tried a programmed shift in temperature or by a chemical addition.
on two existing fermentation processes at Lilly. In both fermen- Also, while this paper focuses primarily on computer-based
tation processes, broth medium composition providing >50% closed loop aspects of bioprocesses, there is significant work that
yield improvement resulted, compared to the previous existing goes into the design of the bioreactor itself. As one example, an
production medium. agitator normally exists for most aerobic bioprocesses, the mis-
Regarding sterilization, it is known that large differences sion of which is to (1) break up sparging gas bubbles into smaller
can exist in the heat up and cool down times of sterilization bubbles (to increase bubble surface area to enhance oxygen and
operations, depending primarily on the size of fermentors (e.g. carbon dioxide transfer) and (2) provide good mixing to achieve
large production fermentors usually have longer total steriliza- a homogeneous broth. Homogeneity is needed, e.g. to provide a
tion cycles than pilot scale fermentors). Since chemical reactions uniform dissolved oxygen range throughout the fermentor such
of nutrient components are usually occurring during steriliza- that all cells have sufficient oxygen. Homogeneity is also needed
tion, differences in heat cycles can cause significantly different so that slug additions of caustic (for pH control) and antifoam
post-sterilization broth composition of nutrients. For example, are quickly dispersed so as to, e.g. minimize local damage to
glucose, when exposed to sterilizing temperatures, can form cells at the point of entry. Unfortunately, the agitator RPM and
polymers, can react with amino acids to generate Browning types of agitator blades needed to achieve these goals often cre-
reaction products, and can also react with inorganic phosphorus. ates high shear conditions that are harmful to the cells. So, the
For some fermentations, the difference in heat cycles can cause final reactor design plays an important role in the control of a
sufficient differences in post-sterilization nutrient composition bioprocess and is usually a compromise with respect to shear
which can then significantly impact the subsequent extent that effects, mixing, gas mass transfer rates, and process economics
the culture can grow and produce product. Work has been pub- (i.e. utility costs).
lished on characterizing the extent that chemical reactions occur
during sterilization, using a parameter defined as Ro (similar in 3. Process measurements
concept to the use of Fo in characterizing the degree to which
contaminating microorganisms are killed during sterilization). “If you can’t measure it, you can’t manage it.”
Both Fo and Ro are integrating functions of time and tempera-
ture. It is suggested that Fo and Ro can be used collectively as a 3.1. Primary on-line sensors
basis of achieving both a desired kill of contaminating organisms
and a consistent post-sterilization nutrient composition (Boeck, A small number of different on-line sensors are commonly
Wetzel, Burt, Huber, Fowler, & Alford, 1988; Boeck, Alford, used in bioprocesses. They include temperature, bioreactor back
Pieper, & Huber, 1989). pressure, gas flow rates, agitation rate, dissolved oxygen, and pH.
A few additional probes are used in some fermentations [e.g.
2. Post-bioreactor inoculation operations dissolved carbon dioxide, redox (oxidation reduction), and opti-
cal density]. Sometimes redundant sensors (e.g. electrochemical
The remainder of the process control part of this article will probes such as pH and DO2 ) are used.
focus on the control of the post-inoculation culture environment, The challenge in the development of additional (especially
typically by computer control systems. Inoculation normally invasive) on-line sensors has historically been, and continues to
defines the beginning of seed and fermentor phases of the overall be:
bioprocess cycle. Topics considered include the current state of
sensors, control algorithms, and data analysis tools. • The need to operate dependably in a harsh, agitated, sterile,
To provide the reader with a sense of process complexity, and sometimes gritty environment.
the non-linear batch nature of a bioreactor operation is first • The need to maintain calibration over long time periods (e.g.
described. several weeks).
A fermentation will begin with an inoculation step in which • Interpretation of output (e.g. for optical density in complex
a relatively small number of pure culture cells are transferred to media, redox).
the bioreactor. The cells then grow exponentially until such time
that something (e.g. oxygen, carbon source) becomes limiting, or 3.2. Primary at-line sensors
something [e.g. substrate, dissolved carbon dioxide (dCO2 ), or
product] becomes inhibiting. The highest cell mass achieved will Several additional at-line analytical measurements are avail-
normally be several orders of magnitude greater than the starting able that can provide periodic values of important bioprocess
1468 J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475

Fig. 5. Culture oxygen uptake vs. time.

The information from these gas analyzers are typically


combined with other measurements or on-line estimates (e.g.
DO2 , gas flow rates, broth volume) to calculate and make
Fig. 4. Process mass spectrometer (an example of Process Analytical Technol-
ogy). available in near real-time a large number of additional
parameters about a bioprocess. These include: culture oxy-
gen uptake (see Fig. 5), culture carbon dioxide evolution,
parameters. Examples include: respiration quotient, kL a (oxygen mass transfer coefficient),
metabolic heat output, etc. This information has proven so
• broth glucose concentration (e.g. YSI analyzer); valuable to operators, scientists, and technical service per-
• broth glutamine concentration (e.g. Nova blood gas analyzer); sonnel that Lilly has process mass spectrometers interfaced
• cell size distribution (e.g. Lasentech). to most of their development and production fermentors
(Alford, 2006).
While a few auto-sampling systems are known to exist, the 2. Broth composition measurements, including use of near
general limitations of these at-line sensors include (1) their use infrared (NIR), mid infrared (MIR), ion chromatography,
of manual operations (e.g. sampling and sample preparation) and HPLC. These systems can provide analysis of nutrient
and (2) measurement frequency that may not practically allow compositions, ammonia, product concentrations, and other
for closed loop feedback control. metabolites.

3.3. Process analytical technologies The Food and Drug Administration (FDA) is now actively
promoting the consideration of Process Analytical Technol-
A few analytical systems have evolved over the past 20 ogy (PAT) in pharmaceutical manufacturing processes. They
years that provide powerful additional real-time or near real- define PAT as “a system for designing, analyzing, and con-
time information about bioprocesses. They include: trolling manufacturing through timely measurements (i.e. dur-
ing processing) of critical quality and performance attributes
1. Gas analysis, using process mass spectrometry (replacing of raw and in-process materials and processes with the goal
earlier use of infrared carbon dioxide analyzers, paramag- of ensuring final product quality (U.S. Department of Health
netic oxygen analyzers, and gas chromatographs). and Human Services, Food and Drug Administration, 2004).”
These systems typically provide analysis of the concentra- It is expected that the adoption of the concepts and technolo-
tions of oxygen, nitrogen, and carbon dioxide in gas streams. gies behind PAT will help lead the pharmaceutical industry
They sometimes also provide information on other gases that in its evolution toward more advanced process control. This
may be present (e.g. alcohols, hydrogen sulfide). One of the is because several bioprocess measurements, historically per-
most popular types of process mass spectrometers is the mag- formed off-line with results obtained too late to be of pro-
netic sector type (e.g. from Thermo-Electron & Hamilton cess control relevance, will now be performed more frequently
Sustrand) due to their very high accuracy and stability (see and in near real-time and, therefore, available as control loop
Fig. 4). inputs.
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1469

One of the issues in using PAT systems is whether these 3.5. Sensor advances/issues
are smart sensors or computer systems (PAT systems typi-
cally include computers that handle data processing, calibration, In addition to advances in the measurements available online
analysis, etc., and which typically require software configur- and at-line, the technologies involved with sensors are also
ing and/or programming). How these systems are defined will rapidly progressing.
impact the degree to which they must be validated.
• Many sensors are becoming much smaller, leading to micro
3.4. Virtual sensors sensor arrays.
• Sensors (and valves) are becoming smarter-providing mainte-
Despite the range of measurements available with the above nance and diagnostic information as well as traditional mea-
listed technologies, some challenges still exist, e.g. surements and/or valve position indication. (Note: While this
information is valuable, it can also contribute to information
1. Estimating cell mass in fermentations using complex media and alarm overload for operators.)
ingredients, i.e. ingredients that include solids (e.g. soy bean • Digital highways are available to communicate this addi-
grits). tional information to computer control systems (e.g. Fieldbus,
2. Estimating broth primary carbon substrate (e.g. glucose) con- Profibus).
centration on-line—for use in continuous control. • Some sensor-to-computer communications are becoming
3. Estimating product concentration on-line. “wireless.”

Accurate estimates/measurements of these parameters are In general, the pharmaceutical industry moves slowly and
important as most published bioprocess models include cell cautiously in adopting such new technologies—due in part to
mass, primary carbon substrate concentration and product con- the regulatory environment in which companies operate, the
centration as primary “state” variables. need to prospectively validate (provide documented verifica-
Some success has been reported in estimating parameters tion) that they work, and the challenge of obtaining quality
such as these via use of virtual sensors. These sensors make use control and management approval in a typically conservative
of available on-line information and a model (via, e.g. statistical and bureaucratic change control environment. Therefore, bio-
regression, neural nets) to predict those parameters which are process engineers in industry are watching these trends, but will
impractical to continuously measure directly. Not surprisingly, typically not be the first ones to try them out.
many of these virtual sensors make use of culture oxygen uptake As an example of an opportunity for further research and
or carbon dioxide evolution (CER) calculated parameters (avail- development, significant success has been achieved in using
able from process mass spectrometry and other measurements) neural nets as virtual sensors for applications such as on-line
as a key input, as these are strongly correlated to, e.g. glucose substrate (e.g. glucose) concentration, cell mass, etc. However,
consumption and cell growth. For example, some bioprocesses neural nets are known as a “black box” type of model meaning
(especially ones using complex media) have used the simple they are not based on first principles (i.e. deterministic). One
equation: of the consequences of this is that they are generally poor at
dX extrapolating. They are also weak in accommodating lags (usu-
CER = k1 × + k2 × X (1) ally requiring trial and error estimates of delay constants) which
dt
are frequently encountered in bioprocesses. There is significant
(where X is cell mass and t is time) to accurately estimate cell appeal in developing “hybrid models” such that all available first
mass on-line, especially for seed vessels and the growth phases principle information is utilized in the model with neural nets
of fermentations (Alford, 1978). This equation assumes that cell then used to estimate parameters (e.g. specific culture growth
respiration activity (represented by CER) comes primarily from rate constant) that is a complex non-linear function of several
two types of activities: (1) cell maintenance (proportional to parameters. Psichogios and Ungar (1992) successfully used this
X) and (2) cell growth (proportional to dX/dt). Fitting Eq. (1) approach in modeling a bioprocess. However, they separately fit
to historical X and CER data results in the determination of the model constants in the first principles model and the node
k1 and k2 . CER, already calculated on-line, then allows for the weights in the neural net. We would challenge vendors and uni-
additional on-line calculation of cell mass (X). As the batch versities to develop algorithms that could simultaneously fit all
proceeds in time, this equation is eventually forgiving of any the constants in both portions of the hybrid model, including
error in the initial estimate of X (i.e. X at t = 0) and can be used process delay parameters, which could then theoretically gener-
as a rational basis for certain process control decisions, such as ate more meaningful values for the constants and more accurate
when to transfer the seed inoculum to the fermentor. models.
While virtual sensors can be quite accurate during growth
stages of cultures, they are often less accurate during product 4. Control techniques
synthesis portions of the process. This is because cellular activ-
ities involved with product synthesis are less well understood, A comprehensive review of fermentor control techniques was
and significant cell death and product inhibition may be occur- published by Rani and Rao (1999). This review summarizes
ring. and references over 100 other published articles, many deal-
1470 J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475

respects. For example, the most common model for a limiting


substrate is the well-known Michaelis–Menten equation:
Max Rate × S
Reaction rate = (2)
Km + S
where Km is the Michaelis–Menten constant and is equal to
the substrate concentration at which the reaction rate is half
the Max Rate.
So, when a substrate (S) such as glucose becomes limiting
to the culture [when S in Eq. (2) becomes small compared to
Km ], then the relationship between metabolic activity (e.g.
glucose consumption) and glucose concentration changes
from zero order to first order. For some fermentations (e.g.
Bakers Yeast), there is an additional non-linear complication
in that glucose concentrations that are too high will cause the
culture to shift from a metabolism of making yeast to mak-
ing ethanol. The control of dissolved oxygen is another good
example where a brief power outage can cause the loss of
agitation causing the DO2 to go to near zero which, in turn,
causes the culture to slow and/or stop some metabolic activ-
Fig. 6. Typical computer controlled fermentor.
ities. The dynamics of recovery will be much different than
if the DO2 had only dropped a small amount. The control
ing with various aspects of advanced control. These include of dissolved CO2 for mammalian cell-based processes is yet
dynamic programming, on-line adaptive control, on-line opti- another example where concentrations too low or too high
mization, non-linear control, and optimal control. Many of the are detrimental to the culture while a range in the middle is
articles involve neural nets. OK.
Bioprocesses employ most of the same types of control as 2. Sub-optimal control
are used in other chemical industries. Over half of most bio- The pharmaceutical industry is not unlike many other
process control loops can be handled by traditional single input industries in which observers have noted that control loops
single output feedback PI (proportional + integral) controllers. are often found in manual rather than automatic mode and
(See Fig. 6 for typical manipulated variables such as sterile air- that loops are not well tuned. Some observers suggest a gap
flow, nutrient feed, pressure, cooling water flow and agitation exists between what is typically taught in university process
rate, and controlled variables such as pH, temperature, and dis- control courses and what is needed in industries such as the
solved oxygen.) batch oriented pharmaceutical industry.
The most frequently found more advanced type of control More specifically, many university process control courses
are DO2 cascade controllers in which dissolved oxygen (i.e. the focus on theoretical aspects of continuous “steady state”
master loop) is cascaded to the control loop for agitation, airflow, types of processes and include such topics as Laplace trans-
substrate feed rate, or some other parameter (the slave loop). forms, Nyquist Stability Criterion, Bode plots, Root Locus
Other examples of advanced control known to exist for indus- diagrams, and Routh Stability Arrays. In contrast to this, a
trial bioprocesses are: large percentage of today’s industrial processes are batch
non-steady-state processes and the techniques mentioned
1. Multivariable control (e.g. to simultaneously control DO2 above are rarely used by practicing automation engineers.
and dCO2 using independently controlled air and pure oxy- There appears to be a need for more focus in educational
gen gas feeds). forums on dealing with process transients, process phase
2. Model reference control (e.g. to control glucose based on transitions, and other non-linear control dynamics of batch
a virtual sensor). Model reference controllers are especially processing and more training on loop tuning techniques and
effective in minimizing the problem of dead time (i.e. trans- control valve characteristics and selection.
portation delay) associated with at-line instruments. 3. Advanced control algorithms (Fuzzy Logic, Model Predic-
3. Adaptive control (e.g. adaptive gain). tive Control, etc.)

While most of these techniques work satisfactorily, there are As with new hardware technologies, the pharmaceutical
reasons for caution and opportunities for improvement: industry tends to move slowly and cautiously with regards to
new approaches in control algorithms.
1. Process non-linearity Practicing engineers are monitoring the evolution of tech-
PID is a controller intended for processes that behave in a niques such as Fuzzy Logic Controllers and Model Predictive
linear fashion (or in a pseudo-linear region of a larger non- Control (MPC)—and considering their potential application to
linear space). Microorganism cultures are non-linear in many bioprocesses. For example, Nyttle and Chidambaram (1993)
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1471

showed how a fuzzy logic controller could use glucose (sub- have only sometimes been successful in evolving an improved
strate) feed to force a fermentation to follow a predetermined process control strategy.
product (Penicillin) time varying concentration profile. Of significant recent interest is the use of neural nets for such
There is some reluctance to aggressively pursue such tech- purposes. This approach is less time consuming, is easier for
nologies for industrial bioprocesses due to: engineers to learn and use, and avoids the need to define deter-
ministic equations of the process. Vlassides, Ferrier, and Block
A. Uncertainty as to how to prospectively validate them. (2001) include a discussion and application of this approach for
B. A desire to keep things simple for the supporting engineers wine fermentations.
who are often instrument technicians or B.S. Chemical Engi- However, as suggested before in Section 3.5, there is promise
neers with only one university course in process control (i.e. in using hybrid models in defining the process and pursuing pro-
do as much as possible with PID feedback and cascade tech- cess optimization in which neural nets would be complimented
niques). with known first principle equations.
C. Lack of compelling published evidence (so far) that such Another kind of process optimization involves minimizing
algorithms significantly increase bioprocess productivity. energy costs. Fermentors are often the highest energy con-
suming unit operation in a plant due to the high volumes of
Specific to MPC, additional concerns include: compressed air and high agitation power required. Shields and
Kao (1994) developed and implemented a model in which
A. The technique was initially developed for continuous pro- airflow and agitation RPM were managed so as to control
cesses. While also applicable to batch processes, the per- DO2 at a desired setpoint while minimizing the total energy
ceived complexity is greater due to significant changes in consumption.
process gain and dynamics during the batch.
B. Reluctance to perturb the process to generate data to build
the model (especially regarding pH and temperature). 6. Other automation system functionality
C. The perception (real or imagined) that outside consultants
are needed to help develop (and sometimes support) the pro- In addition to control algorithms, process control computers
cess model and overall optimization application. are expected to provide additional utilities/functionality related
D. Concerns of sustainability. A recent published survey by to process automation. These can include batch sequencing,
Invensys (Rosenzweig, 2005) noted “The real issue for MPC alarm management, interlocks, diagnostics, links to historians,
is keeping operators from turning it off. Over 50% of all pre- interfacing to PAT and at-line analytical systems, and pro-
dictive controllers are in manual.” viding effective easily configured color-graphic displays for
operators.
On the positive side, good progress has recently been made As a specific example of the opportunities and issues involved
in simplifying the implementation of MPC. For example, Inven- with these other process control functions, alarm management
sys, Emerson, Solutia, and Aspen Technology have developed is discussed.
improved tools that reduce the effort, expertise, and expense Alarm management is a significant problem area for most
previously required with using MPC. companies in the chemical industries, including bioprocesses.
Nuisance alarms often exist which: (1) frustrates operators, (2)
5. Process optimization trigger needless process deviation investigations, and (3) gives a
perception to others of a process NOT in control. Further, smart
Much has been published regarding the creation of models alarms which: (1) can access all pertinent process information
of bioprocesses as a means of better understanding the process within the computer, (2) provide insight to operators as to root
and in helping generate optimum control strategies. Tradition- cause, (3) display the expected operator response, and (4) link
ally, control setpoints for industrial bioprocesses have been to paging systems are often difficult to configure. Opportunities
determined by trial and error experimentation in pilot plants; for improvement exist with both company implementation prac-
a sometimes long and labor intensive process. Note that some tices and with the functionality provided by vendor off-the-shelf
setpoints are typically time varying, such as aeration, agitation, products.
and nutrient feed rates. The situation is gradually becoming worse as smart sensors,
In teaming with process microbiologists, the author has had smart valves, and communication protocols such as fieldbus
experience with techniques in which a matrix of several biopro- and high speed Ethernet are quickly expanding the amount of
cess non-linear state equations is specified (a challenge in itself) information available to the process control system and plant
with parameter estimation (also non-trivial) then pursued using engineers are tempted to add more alarms in linking to some of
experimental data. A similar size matrix of adjoint equations is this information.
then specified and Hamiltonians, Pontryagin’s Maximum Princi- The companies doing a good job with alarms tend to be those
ple, and Calculus of Variations then used to compute an optimal that adhere to the fundamental definition of an alarm—i.e. that an
time varying control variable setpoint (e.g. a nutrient feed rate). alarm represents an abnormal condition that requires a response.
These efforts have always been very time consuming and have Lack of adherence to this is the single greatest source of nuisance
always resulted in better understanding of the bioprocess, but alarms and information overload to operators.
1472 J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475

From a vendor product perspective, many systems provide


little more than the ability to compare a current value with a
setpoint and generate an alarm should this gap exceed a user
specified value. If this alarm is then sent to a historian or alarm
logger, the alarm record contains little more than a calendar
time stamp, a cryptic description of the parameter in alarm, and
perhaps a value for the parameter in alarm. Alarm record sorting
capability is minimal as there is little alarm tag information to
sort on. Alarm operator acknowledge records may or may not
exist to help in understanding the history of and pursuing root
cause analysis of process deviation events.
Process control system historian vendors have had suffi-
ciently limited capability that many industrial companies have
resorted to use of third party alarm loggers. In the case of Eli
Lilly and Company, a real-time expert system was developed
and interfaced to the bioprocess historians (Alford et al., 1999a;
Alford et al., 1999b).
To illustrate with a bioprocess example: a DO2 alarm in a Fig. 7. Current process trend compared to historical average profile ±2S.D.
typical commercial control system might normally be generated
by comparing a DO2 probe output (current value) to a prede-
termined number (e.g. setpoint). If this difference exceeds a all of the above described “intelligent and remote alarming”
predetermined threshold, an output is triggered to control room functionality. One bioprocess manufacturing plant, following
displays and devices and a record is generated to the histo- installation of this system, reported a 4% yield increase and
rian containing the measured value, loop tag, and calendar time double digit reduction in variability, primarily due to ear-
stamp. However much more is possible. A smarter alarm might lier identification and better management of poor perform-
be generated by using coded “if-then-else” rules to look at all ing fermentations. Also, as the name “expert system” implies,
other relevant information within the automation system (e.g. such systems offer the opportunity to practically capture and
on-line oxygen uptake and kL a calculated parameters, redun- put on-line the expertise of experienced operators, engineers,
dant DO2 probes, the relative phase time, etc.), in which case a and technical service personnel in analyzing incoming data.
more accurate conclusion might be that the control DO2 probe is It is of interest that of several manufacturing plants report-
drifting and is no longer valid. Such a “smarter” alarm might then ing on productivity in the months following Lilly’s only early
indicate this more “intelligent conclusion” in the console presen- retirement program (in 1993 in which all plants suffered key
tation to operators, and also note the suggested “predetermined” personnel losses), the one using a real-time expert system
operator response to the alarm. If the facility is unmanned during was the only one reporting no significant plant productivity
portions of the week, the system would “page” the appropriate loss.
personnel. Alarm records in the historian would include fer- However, since real-time expert systems are third party
mentor #, lot #, batch phase/step identifier, alarm class (safety, functionality that must be separately supported, validated, and
product quality, environmental, etc.), and other tag informa- interfaced to other systems, the hope is that commercial pro-
tion to facilitate alarm record sorting and report generation. In cess control and historian vendors will evolve their systems to
addition, the capability would exist to display the alarm record accommodate more of this intelligent alarming and record anal-
information in “relative time” (i.e. time since the beginning of ysis functionality. Alford, Kindervater, and Stankovich (2005)
the batch or batch phase). For batch processes, such as most describe alarm management best practices in greater detail and
pharmaceutical processes, this “relative time” is far more valu- notes some of the things process control and historian commer-
able to analysts than “calendar time, since the importance of cial vendors can do to help.
an alarm (or any trend plot) is usually from the perspective of
when it occurred relative to the beginning of a batch or process 7. Historians and data analysis
phase.
As value adding as the above described smarter alarm might In general, process automation systems today are capturing
seem, the “end in mind” is suggestive of even more alarm large volumes of bioprocess data and storing them in databases.
functionality—specifically the ability to compare a parameter’s Further, computer control/historian utilities are available and
(e.g. DO2 ) value in a currently running batch process to the “time relatively easy to use so that data, in terms of trend plots, is
varying” range of values established by the historical average ±1 easy to visualize. In addition, tools are available to help mine
or 2 standard deviations of recent satisfactory production lots. the information and knowledge content in the stored data. These
The current value’s movement outside this time varying band tools include model building software (e.g. neural nets), statisti-
might then be the logic that triggers an alarm (see Fig. 7). cal analysis (e.g. JMP), principle component analysis, decision
Lilly’s use of real-time expert systems in most of their bio- tree generation, three dimensional graphics, rule-based systems
process development and manufacturing facilities accomplishes to analyze incoming data in real-time, etc.
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1473

Companies providing effective data analysis tools include


SAS, Aegis, Spotfire, Gensym, Curvacious, and several oth-
ers. Unfortunately, most of these tools are not embedded within
commercial historians and therefore require interfacing the his-
torians to third party systems—or at least manually downloading
selected data sets into Excel spreadsheets and then further trans-
ferring the data to the analysis tool.
The problem is that, while several excellent commercial
data analysis and mining products exist, they are not used
very much in the bioprocess industry. This has frequently been
attributed to the significant manual effort required in prepar-
ing data sets for presentation to the applicable tool. A rule of
thumb is that it takes about 90% of the overall data mining
project time and effort to select and prepare data sets for analy-
sis and about 10% of the time to run the data analysis tool. This
90% prep time is a major barrier to greater use of data mining
tools.
To help develop and evaluate bioprocess data mining tech-
niques, Lilly joined a MIT sponsored consortium on fermen-
tation data analysis in the early 1990s, chaired by Prof. Greg Fig. 8. Principle component analysis.
Stephanopoulos. As part of this effort, Lilly examined 2 years
of bioprocess data from one of their plants to find about 115
lots for which the data was perceived as valid. A large num- not any aspect of bioreactor control between inoculation and
ber of lots was necessary given the large number of vari- induction.
ables affecting the process and the high natural variability If this effort sounds daunting, this is just the effort required
of the process. For example, in just considering the inocula- in looking at a single unit operation, i.e. the fermentor. Consider
tion step, a presumed set number of cells from a seed tank the more general case when trying to determine the root causes
(based on measured liquid volume and estimated cell den- of variability in a down-stream operation (see Fig. 3). The cause
sity) must be transferred to the fermentor, during which time could be in any of the preceding unit operations, such as the
the cells could become oxygen starved or come into contact fermentation, or, in fact, in a manually controlled operation early
with insufficiently cooled transfer piping. Therefore, a fermen- in the life of the culture (see Fig. 2). In collecting data to analyze,
tor lot will not always start out with the same number of some of the data may need to be manually added to the database
viable cells—which can then significantly impact the subsequent (i.e. from manually monitored steps of the overall operation) and
growth, oxygen uptake, dissolved oxygen, and other trends ver- portions of the data may come from different historians, stored
sus time. in different formats. Further, there is typically not a one to one
Regarding whether fermentations were acceptable for anal- correlation of fermentor lots to purification lots (e.g. due to the
ysis, lots that became contaminated or had sensors that failed combining of lots for some operations) so lot tracking genealogy
during the run were identified from batch records and thrown is often a challenge.
out. Then, each of the process trends (about 30/lot) were called So, the key to greater progress in data mining is in evolving
up and visually inspected to look for data outliers (e.g. spuri- tools to facilitate data set preparation. Products such as Process
ous points due to momentary power failures during electrical Insights (from Pavillion) and Discoverant (from Aegis) have
storms). Outlier points were then deleted from the data set to be significant data collection and preparation functionality to help
used for analysis. with this.
The next challenge was having to estimate each parameter at
specific time intervals (required for some analysis tools). This 8. Concluding comment
required either averaging multiple data points or decompression
of compressed data in the historian for continuous variables (e.g. The state of bioprocess control is consistent with a recent
pH). For off-line assays, this required taking once/day assays and article (Spear, 2005) in which it is noted that the process control
estimating (via interpolation) values at other times. industry clearly has a problem. There are two speeds in play:
Once this work was done, the data was normalized and (1) the rapid pace of digital communications development and
presented in the format expected by the analysis tool. One (2) the more sedate rate (measured in years, if not decades) of
of the several mining techniques applied to the data set was many of its customers.
principle component analysis, shown in Fig. 8. One of the New biosensors, smarter field devices, and digital commu-
benefits of this analysis was the elimination of most pro- nications are about to give the bioprocess industry significantly
posed hypotheses regarding the difference between good and more data than it had before. The problem is that users are not
bad runs that relate to the time period between inoculation yet very good about efficiently consolidating, analyzing, and
(time = 0) and induction; i.e. the cause of bad runs is probably mining the data that they already have. Plants already have large
1474 J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475

numbers of nuisance alarms and data stays as data rather than up processes and in comparing different plants making the
becoming information and knowledge. same product).
If technologists learn to be smarter in what they do with the 7. The ability to display alarms, operator comments, off-line
data they already have, they will be in much better position to generated discrete assay data and process control continu-
intelligently manage the increase in data as new and smarter ous data on the same plot/screen.
field devices and measurement techniques are developed. 8. Interface with rule-based expert systems for real-time pro-
While pharmaceutical companies and vendors are working cess diagnostics, data analysis, and intelligent remote alarm-
on these improvement opportunities, research institutions can ing applications.
be working on: 9. Use of relative input–output addressing. This enables the
same application recipe to be used for any similarly instru-
1. Better understanding (including models) of the relationship mented bioreactor.
between intracellular metabolic pathways and extracellular 10. Data records which can be sorted, queried, and displayed
parameters. (Note that fermentor control is primarily about by tank #, lot #, process step/phase, and any other tagged
trying to influence the cells internal environment by manip- attributes.
ulating the external environment.) 11. Ability to view data plots and receive alarms from remote
2. Improving hybrid neural net functionality. locations, including employee homes.
3. Fast, efficient, and easy to configure plant scheduling tools. 12. Ability to decompress historized data into interpolated esti-
4. More user friendly versions of advanced control techniques. mates at periodic time intervals (required for some data
analysis tools).
9. Additional information: the Lilly fermentation 13. Ability to calculate bands (i.e. the time varying average
computer system value ±2 standard deviations) based on analysis of selected
historical data, for use as a backdrop in process control data
In the early 1970s, Lilly scientists and engineers defined trends.
a vision for the monitoring and control of bioprocesses that
could not be realized with commercial systems available at that Acknowledgments
time. Therefore, a team was organized that developed Lilly’s
customized fermentation process control and historian system The author wishes to acknowledge the commitment to excel-
(Alford, 1981, 1990). lence and the many opportunities given to him by Eli Lilly &
During the years that followed, Lilly incorporated new com- Co. management during his career. He further acknowledges the
puter science technology as it became commercially available. A contributions of his mentors, peers, and associates for their great
major PAT addition was the use of process mass spectrometers. help in projects and in his understanding of bioprocesses.
This system was replicated into all of Lilly’s major fermenta-
tion research, development, and manufacturing facilities during
the late 1970s and early 1980s. Plant site historians were elec- References
tronically linked together to facilitate sharing and comparing Alford, J. (1978). Modeling Cell Concentration in Complex Media, Biotechnol-
of process data. Interface to real-time expert systems started in ogy and Bioengineering: Vol. XX, John Wiley and Sons, pp. 1873–1881.
1990. Many of these systems are still operating today, over 25 Alford, J. (1981). Development of the fermentation computer system at Eli
years later. Lilly and Company. In Proceedings from the third international conference
Several features of this system remain as advanced function- on computer applications in fermentation technology.
Alford, J. (1990). Implementation of the custom built fermentation computer
ality as compared to today’s commercial systems. Automation system at Eli Lilly and Company. In An invited textbook chapter in computer
vendors should make note that these are practical to do and are control of fermentation processes. CRC Press.
value adding to customers: Alford, J. (2006). Bioprocess mass spectrometry: A PAT application. Journal of
Process Analytical Technology, 3(June (3)).
1. On-line changeability of all process control recipe logic. Alford, J., Cairney, C., Higgs, R., Honsowetz, M., Huynh, V., Jines, A., et al.
(1999a). Real rewards from artificial intelligence. Intech, 46(4), 52–55.
This is especially valuable for research and development Alford, J., Cairney, C., Higgs, R., Honsowetz, M., Huynh, V., Jines, A., et
operations when changes are frequent and cannot all be al. (1999b). On-line expert-system applications use in fermentation plants.
prospectively anticipated prior to a run. Intech, 46(7), 50–55.
2. Automated audit trails for on-line changes. Alford, J., Kindervater, J., & Stankovich, R. (2005). Alarm management for
3. Continuous and batch sequencing logic in a single configu- regulated industries. Chemical Engineering Progress, 101(April (4)).
Boeck, L., Wetzel, R., Burt, S., Huber, F., Fowler, G., & Alford, J. (1988).
ration environment (i.e. both paradigms implemented in the Sterilization of bioreactor media on the basis of computer-calculated
same recipe). thermal input, designated as Fo. Journal of Industrial Microbiology, 3,
4. Process control recipe creation that is simple enough that it 305–310.
can be done by technicians. Boeck, L., Alford, J., Pieper, R., & Huber, F. (1989). Interaction of media com-
5. Trend plots displayed in relative time (i.e. time since the ponents during bioreactor sterilization: Definition and importance of Ro.
Journal of Industrial Microbiology, 4, 247–252.
beginning of the batch—or batch step). Cockshott, A. R., & Hartman, B. E. (2001). Improving the fermentation medium
6. Process data from different fermentors at different plants for Echinocandin B production. Part II: Particle swarm optimization. Process
displayed on the same plot (especially valuable in scaling Biochemistry, 36, 661–669.
J.S. Alford / Computers and Chemical Engineering 30 (2006) 1464–1475 1475

Cockshott, A. R., & Sullivan, G. R. (2001). Improving the fermentation medium Spear, M. (2005). Time to turn up the speed? Process control developments
for Echinocandin B production. Part I: Sequential statistical experimental continue to outrun their uptake by industry. Chemical Processing, (June),
design. Process Biochemistry, 36, 647–660. 50.
Nyttle, V. G., & Chidambaram, M. (1993). Fuzzy logic control of a fed-batch Strobel, R., & Hartman, B. (1999). Particle swarm optimization for improv-
fermentor. Bioprocess Engineering, 9, 115–118. ing glycopeptide fermentation. Recent advances in fermentation technology,
Psichogios, D., & Ungar, L. (1992). A hybrid neural network—First principles RAFT III. Sarasota, FL.
approach to process modeling. AIChE Journal, 38(October (10)). U.S. Department of Health and Human Services, Food and Drug Administration
Rani, Y., & Rao, R. (1999). Control of fermentors—A review. Bioprocess Engi- (2004). Guidance for industry PAT: A framework for innovative pharmaceu-
neering, 21, 77–88. tical development, manufacturing, and quality assurance. Rockville, MD.
Rosenzweig, M. (2005). Plants get smarter. Chemical Processing, (June), Available on www.fda.gov/cder/guidance.
16–21. Vlassides, S., Ferrier, J., & Block, D. (2001). Using historical data for bioprocess
Shields, R. W., & Kao, E. I. (1994). A cost minimizing fermentation dis- optimization: Modeling wine characteristics using artificial neural networks
solved oxygen control algorithm. Bioprocess Engineering, 10(2), 91– and archived process information. Biotechnology and Bioengineering, 73(1),
98. 55–68.

You might also like