Professional Documents
Culture Documents
Data sets involving nonlinear, sparse grouped data are common in the health
Products Used
sciences, especially in drug trials, where they are used to measure drug absorp-
■ MATLAB®
tion, distribution, metabolism, and elimination. In this approach, patients are
■ Simbiology™
grouped using characteristics such as age, sex, weight, and smoking history.
Given the expense of drug trials, however, it is not always possible to obtain
sufficient patient data.
Nonlinear mixed-effects (NLME) mod- ditional error terms, and their distribu- a PK model, fitting the model to data, and
eling provides a good solution for mod- tions and covariances must be specified. analyzing the results.
eling sparse datasets. These models ac- Mixed-effects models provide a reason-
count for both fixed effects (population able compromise between ignoring data Visualizing and Preprocessing
the Data
parameters assumed to be constant each groupings entirely, thereby losing valu-
Data visualization and preprocessing re-
time data is collected) and random effects able information, and fitting each group
veals patterns in and distributions of the
(sample-dependent random variables). with a separate model, which requires
data. It also lets us deal with outliers and
In modeling, random effects act like ad- significantly larger sample sizes.
with bad or missing data points. For ex-
Using population pharmacokinetics (pop- ample, we may want to determine what type
PK) data as an example, this article demon- of elimination route phenobarbital has from
PopPK is the study of pharmacokinet- strates a workflow for implementing a nonlin- this infant population, or look at the ranges
ics (what the body does to a drug) in a
ear mixed-effects model using SimBiology™. of concentrations for each time point.
population of subjects receiving a drug
of interest. In preclinical or clinical stud-
The Phenobarbital Case Study1 Data can be imported from a number
ies, subjects are given a dosing regime.
Samples are then collected from each This well-known case study involved 59 pre- of sources, including text files, Excel files,
subject at designated time points to term infants who were given phenobarbital MAT files, and the MATLAB® workspace.
determine the concentration of the drug If a data file has common headers, such as
to prevent seizures during the first 16 days
in the patient over time. The goals of
popPK are to understand how subject- after birth. We will use the data collected ID or Time, SimBiology automatically rec-
specific characteristics determine PK during this study to estimate model pa- ognizes and stores the headers as the group
parameters and to identify sources of rameters to best fit this data. This involves and independent variables.
variability throughout the population.
visualizing and preprocessing data, creating
1
Grasela TH Jr, Donn SM. “Neonatal population pharmacokinetics of phenobarbital derived from routine clinical data.” Dev Pharmacol Ther 1985:8(6). 374-83. PubMed Abstract
Creating a PK Model
We begin by defining the core of a PK mod-
el, the base model. This often consists of a
number of compartments, a dosing type,
and an elimination route. We may have a
priori knowledge about the base model, or
observe a trend in our data that will sug-
gest where to start with a component. If we
have no trends or prior knowledge, we can
experiment with different base models and
see what works best.
Fitting the Model to Data Analyzing the Results The SimBiology desktop offers several
In this step, we estimate model parameters After fitting our model to the data, we’ll want prepackaged diagnostic plot types, includ-
based on the external data. In our example, to determine how well our fitting performed. ing trellis plots, which plot both the observed
we want to estimate the volume of the com- SimBiology generates two types of outputs: and predicted time courses of drug concen-
partment (Central) and the parameter rep- a data panel summarizing the results, and tration for each patient. The plot in Figure 3
resenting the clearance of the drug from the diagnostic plots specified in the Fit Param- shows that the predicted results accurately
compartment (Cl-Central). We will need to eters task. The data panel includes log-trans- replicate the observed data for four of the
calculate random effects for both. formed fixed estimates for the parameters, a subjects. Other plot types include observed
list of the random effects for each parameter versus predicted concentration values, box
In the Fit Parameters task on the SimBiol-
for each patient, a summary of statistics on plots of the random effects calculated for
ogy desktop (Figure 3), we specify param-
the fit, and the estimated covariance matrix. each parameter, residual errors over time,
eters that we want to estimate, parameters
We could make this an iterative workflow and the distribution of residuals.
for which we want to calculate random ef-
by examining the goodness-of-fit statistics, We can quickly capture our work by
fects, and the dataset to fit to. We also per-
such as Akaike and Bayesian information automatically creating an HTML or XML
form dataset mapping to identify group
criteria (AIC and BIC, respectively), from report in the SimBiology report generator.
and independent variables and specify the
various models and selecting the one that The report will be stored as a node in the
covariance pattern. We click Run to begin
best fits our data set. SimBiology project. A SimBiology project
the parameter fit.
Worldwide CONTACTS
The author would like to thank Priya Moorthy, www.mathworks.com/contact
Sam Roberts, and Sowmini Sampath for their
contribution to the demo on which this article e-mail
info@mathworks.com
is based.
91752v00 07/09
■ Computational Biology
www.mathworks.com/computational-
biology.html