You are on page 1of 10

Environmental Modelling & Software 26 (2011) 434e443

Contents lists available at ScienceDirect

Environmental Modelling & Software


journal homepage: www.elsevier.com/locate/envsoft

Overview of the atmospheric model evaluation tool (AMET) v1.1 for evaluating
meteorological and air quality models
K. Wyat Appel a, *, Robert C. Gilliam a, Neil Davis b, Alexis Zubrow b, Steven C. Howard a
a
Atmospheric Modeling and Analysis Division, National Exposure Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, RTP,
NC 27711, USA
b
University of North Carolina - Institute for the Environment, Chapel Hill, NC 27599, USA

a r t i c l e i n f o a b s t r a c t

Article history: This paper describes the details of the Atmospheric Model Evaluation Tool (AMET) v1.1 created by
Received 26 May 2010 scientists in the Atmospheric Modeling and Analysis Division (AMAD) of the U.S. Environmental
Received in revised form Protection Agency (EPA). AMET was first developed to evaluate the performance of the 5th Generation
7 September 2010
Mesoscale Model (MM5) and the Weather Research and Forecasting (WRF) meteorological model output
Accepted 30 September 2010
Available online 28 October 2010
and was later extended to include capabilities for evaluating output data from the Community Multiscale
Air Quality (CMAQ) model as well. AMET is designed to leverage several open-source software packages
that are used in combination to 1) pair the modeled and observed values in time and space, 2) store these
Keywords:
Air quality model evaluation
paired datasets in an easily accessible and searchable database and 3) access and analyze these data
Meteorological model evaluation using a statistical package. Through this process, AMET is able to provide a convenient method for
CMAQ evaluating meteorological and air quality model predictions. The use of a searchable, relational database
WRF allows the entire dataset to be quickly subset into only those data that are of the most interest for the
MM5 current analysis, a process that is often tedious and time consuming without the use of a database. In
addition to common summary statistics (e.g. RMSE, bias, and correlation), several of the many analysis
features available in AMET include scatter plots, time series plots, box plot and spatial plots as part of
operational model evaluation. Additionally, several unique analysis functions are also available in AMET,
and the system provides a framework within which users may extend the current functionality for their
own custom analyses. While AMET was designed to work specifically with MM5, WRF and CMAQ model
output, it could easily be modified to work with output data from other meteorological and air quality
models.
Published by Elsevier Ltd.

Software Availability Year first available: 2007


Software Required: MySQL Database Software (http://www.mysql.
Software name: Atmospheric Model Evaluation Tool (AMET) com) R statistical software (http://www.r-project.org)
Developers: Robert C. Gilliam and K. Wyat Appel, Neil Davis and Program Language: Perl; Fortran;
Alexis Zubrow Availability: Available for download through the Community Model
Address: 109 T.W. Alexander Drive, Research Triangle Park, NC and Analysis System (CMAS) website (http://www.
27711 cmascenter.org)
Email Addresses: appel.wyat@epa.gov, gilliam.robert@epa.gov
1. Introduction

Evaluation of model simulations, which usually includes


Abbreviations: Atmospheric Model Evaluation Tool, AMET; Weather Research
and Forecasting, WRF; 5th Generation Mesoscale Model, MM5; Community Mul- comparing the model predicted values against the available
tiscale Air Quality, CMAQ; Atmospheric Modeling and Analysis Division, AMAD; observed values, is critical to establishing the model’s credibility
Environmental Protection Agency, EPA; Community Modeling and Analysis System, (Dennis et al., 2010). For the development of the Community
CMAS; Meteorological Assimilation Data Ingest System, MADIS; Meteor- Multiscale Air Quality (CMAQ; Byun and Schere, 2006; Foley et al.,
ologyeChemistry Interface Processor, MCIP.
* Corresponding author. US EPA, 109 TW Alexander Dr., Research Triangle Park,
2010), evaluation of model predicted concentrations for many
NC 27711. Tel.: þ1 91 954 10757; fax: þ1 91 954 11379. different atmospheric pollutants is required since the model is used
E-mail address: appel.wyat@epa.gov (K. Wyat Appel). to make regulatory decisions by the U.S. Federal and State

1364-8152/$ e see front matter Published by Elsevier Ltd.


doi:10.1016/j.envsoft.2010.09.007
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 435

The AMET Flow Chart

Observations Model Output Model


Evaluation R Scripts

• Meteorological • MM5/WRF (MET) Database


Domain Statistics
• Air Quality • CMAQ (AQ)
MySQL relational Diurnal Statistics
database stores all
Observation-Model model-observation Time Series Plots
Synchronization pairs
Spatial Plots
Match observations with model Convienient method
predictions in time and space to store data from a Box Plots
(Fortran) large number of
model simulations Scatter Plots

Generate database records Bar Plots


(Fortran)
Other, User-developed tools “Soccer Goal”
Plots
Open source format allows for
Populate Database user community development “Bugle” Plots
(Perl) of new tools and analysis
techniques Histograms

Fig. 1. Flow chart showing the basic structure of the AMET meteorological and air quality modules.

Table 1
List of R scripts currently available in the AMET meteorological and air quality modules.

Script Name AMET Module Description


run_met_aq Meteorological/Air Quality Compares air quality and meteorological data. Creates a spatial plot of correlation and
a time series plot of bias.
run_spatial_surface Meteorological Creates a series of maps comparing surface observations and model values.
run_summary Meteorological Creates a summary plot of a series of meteorological variables. Includes time series,
scatter and box plots.
run_time series Meteorological Creates a time series plot of temperature, mixing ratio, wind speed and direction for one
or more sites.
run_daily_barplot Meteorological Creates a series of daily bar plots for all sites in the domain.
run_plot_prof Meteorological Creates time-height plots of wind speed and direction profile statistics.
run_plot_raob Meteorological Creates vertical profile plots of rawinsonde sites.
run_wind_prof Meteorological Creates wind arrow profiles for a set of specific monitoring sites.
run_boxplot Air Quality Creates a box plot of observed and modeled quartiles.
run_boxplot_hourly Air Quality Creates a 24-h diurnal box plot of hourly data.
run_bugleplot Air Quality Creates a “bugle plot”, which plot bias and error as a function of concentration and includes
curves representing performance goals.
run_pave_overlay Air Quality Creates a PAVE overlay file of observed values.
run_plot_spatial Air Quality Creates spatial plots of the observed values, model values, and the difference between the two.
run_scatterplot Air Quality Creates a scatter plot of observed versus modeled values and includes select summary statistics
on the plot.
run_scatterplot_MtoM Air Quality Creates a model-to-model scatter plot at the selected network sites.
run_scatterplot_single Air Quality Creates a scatter plot for sites from a single network and includes many statisics on the plot.
run_scatterplot_skill Air Quality Creates a scatter plot that includes forecast skill metrics for ozone.
run_soccerplot Air Quality Creates a “soccer goal” plot, which plots bias verses error with performance lines in the shape of a
soccer goal.
run_stacked_barplot Air Quality Creates a stacked bar plot of total PM2.5 mass from CSN data and color codes the contribution from sulfate,
nitrate, ammonium, total carbon and other PM mass.
run_stats_plots Air Quality Creates plots of NMB, NME, FB, FE and correlation, along with a comma delimited file with various statistics.
run_time series Air Quality Create a time series plot for a site or multiple sites (averaged) for a select time period.
436 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443

Fig. 2. Sample plots from the run_summary script. The plot on the left is for temperature, while the plot on the right is for wind direction.

Fig. 3. Sample plot from the run_plot_prof script from the AMET meteorological module. The color shading and white lines indicates model-predicted wind speed, while the black
dots indicate the model-predicted planetary boundary layer height.
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 437

Fig. 4. Sample bias plots from a) the run_spatial_surface script from the AMET meteorological module and b) the run_stats_plots script from the AMET air quality module.
438 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443

continental United States and the Northern Hemisphere. The result


are terabytes of data available for analysis against observed data
which conventional methods of data analysis (e.g. spreadsheets) are
not well suited. In addition to longer duration simulations and larger
domains, the shorter wall-clock model run times have resulted in
many more model simulations being performed. The increase in the
duration, spatial size and number of model simulations all require
greater organization and efficiency to systematically evaluate the
large amounts of data being produced. While some model evalua-
tion software tools already exist (e.g. the Meteorological Evaluation
Tool available for evaluating WRF simulations), there are limited
free, open-source software tools available to perform comprehen-
sive meteorological and air quality model evaluations.
The Atmospheric Model Evaluation Tool (AMET) was developed by
the United States Environmental Protection Agency’s (EPA) Atmo-
spheric Modeling and Analysis Division (AMAD) to aid in its evalua-
tion of the meteorological and air quality models within the CMAQ
modeling system. Since its initial development, AMET has been
updated and maintained through the Institute of the Environment at
the University of North Carolina at Chapel Hill. AMET organizes,
provides consistency and speeds-up the evaluation process for
operational meteorological and air quality model simulations.
Although it was developed specifically to aid in the evaluation of the
CMAQ modeling system, the AMET software can be easily modified to
work with other modeling systems.
This paper describes the structure and functionality of the AMET
software. A broad description of the AMET software will be
provided, followed by a detailed description of how the AMET
software works, including a description of process by which
observed and modeled values are paired, how those paired data are
organized and stored in the database, and an overview of the
analysis scripts currently available in AMET. Finally, examples of the
analysis products available from the AMET software will be pre-
sented and described.

2. Tool description

AMET is used to pair observations and gridded model output in


space and time and to evaluate the model performance for a set of
predicted or diagnosed atmospheric fields. There are separate
modules in AMET for evaluating meteorological and air quality model
output. This separation is necessary because both the observed and
predicted meteorological and air quality data are quite different,
utilizing different file formats for both the observed and model data.
In addition, the observed meteorological and air quality data are often
obtained from networks that use different sampling protocols, which
can make pairing meteorological and air quality data difficult. One
advantage of separate meteorological and air quality modules in
AMET is that the modules can be installed individually, allowing
a user to reduce installation time and complexity if only meteoro-
Fig. 5. Sample plots from the a) run_scatterplot script and b) run_scatterplot_skill
logical or air quality analysis is required. Although the meteorological
script available in the AMET air quality module. The run_scatterplot script can be used
for any species, while the run_scatterplot_skill script was designed specifically for
and air quality evaluations use separate modules, there is an analysis
assessing model performance for ozone. tool available in the software that allows for cross-comparison of the
meteorological and air quality data.
governments. The evaluation is further complicated by the AMET is built on several open-source software packages and
requirement of evaluating the meteorological predictions that are uses the MySQL (http://www.mysql.com) database software to
used to provide gridded meteorological data to the CMAQ model. store the paired observation and model output along with various
Less than a decade ago, CMAQ model simulations typically metadata. The pairing is performed by several scripts, which also
spanned only several weeks to a month (in some rare cases several insert the paired data into the database using Fortran and Perl
months) and were limited spatially (e.g. eastern half of the United programs. Finally, the analysis of the data utilizes the R statistical
States). The increase in computing power over time has resulted in software package (http://www.r-project.org/). The AMET software
larger model domains, longer simulation time periods and a greater includes cshell scripts which help streamline the installation
number of model simulations (e.g. model sensitivities). Today, process, including setting up the database structure for use with
CMAQ simulations routinely span several months to a year and in AMET. Fig. 1 presents a flow chart of the basic structure of the AMET
some cases multiple years, and have expanded spatially to cover the software.
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 439

Fig. 6. Sample plots from the run_timeseries script available in the AMET air quality module. The top plot shows the observed (black open circles; from the IMPROVE network in
this example) and CMAQ predicted (red open triangles) concentrations. The bottom plot shows the time series of model bias (blue open circles) and error (red open triangles). The
dashed line represents the average error for the period.

2.1. Pairing observations and modeled outputs In most cases, the Site Compare program is able to read the
observation data files that are provided directly from the major air
The first step required as part of the AMET software is to pair the quality networks in the United States. The air quality networks that
observed and modeled values in space and time. In the air quality Site Compare/AMET currently supports are the Chemical Speciation
module of AMET, this is accomplished through the Site Compare Network (CSN; previously STN), the Interagency Monitoring of
software, which is a Fortran program released as a tool along with PROtected Visual Environments (IMPROVE; http://vista.cira.
the CMAQ model and is available for download through the colostate.edu/improve/) network, the Clean Air Status and Trends
Community Modeling and Analysis System (CMAS; http://www. Network (CASTNET; http://www.epa.gov/castnet/), the National
cmascenter.org/) website (packaged as part of AMET). Site Atmospheric Deposition Program (NADP; http://nadp.sws.uiuc.edu/)
Compare performs matching using the nearest neighbor approach, networks and the SouthEastern Aerosol Research and CHaracter-
and applies temporal averaging to the modeled data if required (e.g. ization Study experiment (SEARCH; http://www.atmospheric-
if an hourly CMAQ output file and daily average observation are research.com/studies/SEARCH/index.html). Some data obtained
provided Site Compare will compute a daily averaged CMAQ value through the EPA’s Air Quality System (AQS) may require a pre-
for the correct location to match with the observational data). The processor that is included with the Site Compare software. Users
software provides a means of matching modeled and observed should obtain the observation data directly from the various network
species with different names and combining the modeled or resources and then refer to the AMET user’s guide for instructions on
observed species to create derived species (e.g. total carbon aerosol how use those data with AMET.
from elemental and organic carbon). In addition to the Site Compare For the meteorology module of AMET, all pairing of observations
software, text files containing the site information (site name, with model data is done using Perl. The model output file is probed
location, time-zone, etc.) for each network are required and are for time/date information and grid projection. The Perl script cycles
provided for several U.S. networks. The Site Compare software is run through each model output time, extracts corresponding observa-
separately for each observation network and individual comma tions, determines the location of the observation sites relative to the
separated files are created for each network which contain the model grid, pairs the observations with the model values and inserts
paired observed and model data for any or all (user specified) of the the paired data into the MySQL database. The meteorological
species available from that network. The paired data are then matching is performed using either bilinear interpolation or nearest
inserted into the MySQL database using a Perl script. neighbor. Additionally, AMET attempts to match near-coastal
440 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443

Currently, the meteorological model of AMET is designed to work


with meteorological output from either the 5th Generation Mesocale
Model (MM5; Grell et al., 1994), the Weather Research and Fore-
casting (WRF; Skamarock et al., 2008) model or the Meteor-
ologyeChemistry Interface Processor (MCIP; Otte and Pleim, 2010). A
pre-processor is required for MM5 model output to be used with
AMET, however no pre-processor is required for the WRF or MCIP
output as those data can be read directly by the Perl module. The
AMET meteorological module supports evaluation of both surface
and upper-air meteorological fields (surface only for MCIP data). The
surface fields currently available in the AMET meteorological module
include temperature, water vapor mixing ratio, wind speed, and wind
direction. The upper-air fields available for analysis are wind speed
and direction from vertical wind profilers; wind speed and direction
and potential temperature from aircraft profile (ACARS) data; and
potential temperature, relative humidity, wind speed and direction
from the National Weather Service balloon soundings.

2.2. The AMET database structure

MySQL is an open-source relational database software that can


used to store and quickly access large amounts of data. Aside from
providing a convenient method for storing the observationemodel
pairs, the use of a relational database allows the entire set of data to
be quickly subset based on any criteria available in the database
(e.g. by specific date, location or location metadata). The ability to
subset the data quickly is extremely useful since model perfor-
mance often varies spatially and temporally, and the spatial and
temporal subsets are often different for each pollutant or chemical
species. The MySQL software is free to non-commercial users and is
available for many different computing platforms.
In AMET, the MySQL database consists of project tables that
contain all the paired data for a particular model simulation. Each
project table consists of individual “records”. A record consists of the
paired observed/modeled data for a particular site at a particular
time, along with additional information indicating the date and time
of the record and the location of the site given in lat/lon coordinates.
A single project may consist of hundreds to thousands to millions of
records depending on the length and spatial size of the simulation.
For any given record in the AMET database, several important
data fields are required. The record must include a site name or site
identification number, location information in the form of latitude
and longitude in order to plot the site data in space, and the start
and end date for the observation.
While it should be obvious that the start and end times of obser-
vations vary, it is also possible that the duration of the observation can
vary as well, ranging from very short durations on the order of an
Fig. 7. Sample plots from the a) run_boxplot script and b) run_boxplot_hourly script hour, to longer averaging time periods (e.g. a week). As such, the start
available in the AMET air quality module. The shading represents the inter-quartile and end time data are used to plot the records in time and are also
range of the observations and model predictions, while the lines represent the median used for temporal averaging of the data. Finally, the record should
of the observations and model predictions. contain all the observational and modeled data for that particular site
and time, which is generated through the data pairing programs. The
number of fields depends on the number of different variables
monitors with land based grid-cells as opposed to oceanic grid-cells, measured at a particular site, with some observational networks
and both the model output and observations are considered to be on measuring only a few different variables, while other networks can
identical temporal resolutions (i.e. instantaneous values represent- have a large number of different measurements available.
ing 1 h). The meteorological module is designed to use observations In addition to the individual project tables, the database also
from the Meteorological Assimilation Data Ingest System (MADIS; contains separate project log tables for the meteorological and air
http://madis.noaa.gov). The MADIS is a widely used, comprehensive quality modules. The project log tables store information about each
meteorological observational dataset, and requires a free subscrip- project, including when the project was created, who created it, and
tion to access the observation data. Once a subscription to the MADIS specific information about the model simulation. The database also
has been established, AMET can either automatically download the has site tables for the two modules that store metadata about each
observation data for the period of the model simulation, or the data observation site, data which can then be used in database queries to
can be retrieved prior to the execution of the script. subset the records.
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 441

Fig. 8. Sample plot from the run_barplot script from the AMET air quality module for two different CMAQ model simulations. The height of the bars represent the total PM2.5 mass,
 þ
while the color shading represents the contributions of sulfate (SO2
4 ), nitrate (NO3 ), ammonium (NH4 ), total carbon TC), and other PM2.5 mass (Other) to the total PM2.5 mass.

2.3. R statistical package scripts quality modules, along with a brief description of the script func-
tions. These scripts are used to generate statistics and plot the
The R project for Statistical Computing, “R”, is an open-source results using the various plotting functions available in the R soft-
software for statistical computing and graphics based on the “S” ware. Many of the output products available in AMET are frequently
language, and is available for many different computer platforms. used in the air quality modeling community. In addition to using the
The R software can be used through a command line interface or provided R scripts, the more advanced user could use these scripts as
with the use of scripts, and contains a large number of functions for a template for creating new R scripts tailored to their particular
computing common statistical metrics and generating graphics (e.g. application and analysis.
box plots and scatter plots). Additionally, the R software contains
a library for interfacing with the MySQL database software. All these 3. Examples of tool applications
features make R an attractive computational and plotting software
package for AMET. In 2007, AMETv1.1 was released as a tool by CMAS as part of the
The information stored in the database in each project table is CMAQv4.6 model release. The AMET installation and users guide
queried through individual cshell scripts for each R analysis script (http://www.cmascenter.org/help/documentation.cfm?
(the R analysis scripts are describing in detail in the next section). MODEL¼amet&VERSION¼1.0) provide detailed information on
The R software has available a MySQL library which contains how to set-up and run AMET with the CMAQ modeling system.
commands for interfacing with the MySQL database. Through the Since its release in the fall of 2007, the AMET has been downloaded
various commands included in the R MySQL library, the database over 550 times, indicating the tool has garnered significant interest
can be queried based on any of the information stored in the data- in the meteorological and air quality communities.
base tables. For example, one could limit their data analysis to For the past five years, AMET has been used extensively by
a single day or week, or to a single state or collection of states. The AMAD for research using the CMAQ modeling system (Appel et al.,
AMET scripts include instructions for performing common queries 2007, 2008, 2010; Pleim and Gilliam, 2009; Gilliam and Pleim,
(e.g. queries based on a date range or location). However, anyone 2010; Otte, 2008a,b). Additionally, AMET has been used by EPA’s
familiar with basic database queries could easily formulate their Office of Air Quality Planning and Standards (OAQPS) to aid in the
own query criteria to subset the data in any way they choose. evaluation of their air quality simulations for federal rule making
The AMET software already includes a collection of R scripts that applications and by other groups to evaluate MM5 and WRF model
perform various analyses such as computing standard statistical simulations for efforts not necessarily associated with CMAQ model
metrics such as bias, error and correlation; creating common applications. AMET is currently under active development at the
statistical plots such as scatter plots and time series plots; and Institute for the Environment and at the EPA.
creating unique plots designed specifically for AMET. Table 1 lists the The following (Figs. 2e9) highlight some key analysis capabil-
R scripts currently available for the AMET meteorological and air ities of AMET (refer to Table 1 for additional information). Fig. 2
442 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443

module, while Fig. 4b is a similar plot of Normalized Mean Bias


(NMB) available from the run_stats_plots script in the air quality
module. These plots are intended to provide a spatial assessment of
the model performance. Both scripts have the capability of plotting
other statistics, including error (absolute or normalized), Root
Mean Square Error (RMSE) and correlation. The run_stats_plots
script also includes a comma separated file containing numerous
statistics in addition to those plotted by default. These scripts are
often used early in a model assessment effort to identify regions
where model performance may be deficient and where further
investigation using the other analysis scripts available in AMET
would be most useful.
Fig. 5 shows examples of the run_scatterplot and run_scatter-
plot_skill scripts, two of the scatter plot scripts available in the
AMET air quality module. Along with the run_stats_plots, the
scatter plot is often used early in a model assessment effort to
gauge the overall model performance and identify any large model
biases that may be present. The run_scatterplot script allows
multiple observation networks and model simulations to be
plotted, along with up to five summary statistics (which can be
changed by the user). The run_scatterplot_skill script is designed
specifically for assessing ozone model performance based on the
ambient air quality standard for ozone (currently 75 ppb). Along
with standard summary statistics, the plot also provides several
forecast skill metrics based on the ozone standard, including
accuracy, probability of detection (POD) and false alarm rate (FAR).
A sample plot from the plot_time series script available in the air
quality module is shown in Fig. 6. The script creates two plots by
default, the top plot showing the observed and model predicted
values, and second plot below showing the corresponding model bias
and error. The plot can be used to plot individual sites, or in the case of
multiple sites, the values are averages. A similar time series plot is
available for the meteorological module, which also allows the user to
group multiple sites into an averaged time series plot. The time series
plots are particularly useful for examining model performance over
time and for analyzing the model performance at individual sites.
Fig. 7 presents examples of the run_boxplot script for the CSN
Fig. 9. Sample plots from the a) run_soccerplot script and b) the run_bugleplot script
daily sulfate and the run_boxplot_hourly script for hourly ozone
from the AMET air quality module.
from the AQS available in the AMET air quality module. Plots like
these are useful for assessing how well the air quality model
presents examples of the run_summary script available in the simulates both the average atmospheric concentration (by
AMET meteorological module. The plot on the left is for near- comparison of the median values) and the distribution of the inter-
surface temperature and includes a scatter and box plot, while the quartile range (25%e75%) of concentrations.
plot on the right is for wind direction and includes a directional Fig. 8 illustrates an example of the run_barplot script from the
scatter plot and histogram. Wind speed and moisture analyses are AMET air quality module, showing the total fine particulate mass
also available, but are not shown here. The summary plots available (PM2.5) concentration (indicated by the height of the bar) from the
in AMET are particularly useful for quickly gauging the overall observations and two different CMAQ model simulations. The
performance of a meteorological simulation. The scatter plot and different colors represent the contribution to the total mass from
wind direction error plot allow for quick assessment of where the sulfate, nitrate, ammonium, total carbon and other PM2.5. The
largest bias and error are in a simulation, while the summary stacked bar plot is extremely useful for determining which species
statistics provide an overall assessment of the simulation perfor- contribute the largest amount to the overall concentration of PM2.5
mance in reproducing near-surface observations. mass, while also determining how well each individual species is
Fig. 3 presents an example of one of the time-height plots available simulated by the model.
in the AMET meteorological module. The color shading represents the Fig. 9 presents examples of the run_soccerplot and run_bugle-
model predicted wind speed, while the black dots represent the plot scripts available in the AMET air quality module. The soccer
height of the model predicted boundary layer. The observed wind goal plot (Fig. 9a) is essentially a scatter plot, with a normalized bias
speed, along with statistical metrics such as Root Mean Square Error metric plotted on the x-axis and a normalized error metric plotted
(RMSE), bias and the index of agreement are also provided by this on the y-axis. Lines are plotted at pre-determined bias and error
analysis script. This type of script is useful for determining how well values, which together form a box in a shape similar to a soccer
the meteorological model simulates the boundary layer characteris- goal. Any number of species from any number of networks can be
tics at different times of the day. Correct simulation of the boundary plotted on the soccer plot, allowing for a quick assessment of the
layer is particularly important for accurate predictions of atmospheric overall performance of the air quality simulation.
pollutants from air quality models. The bugle plot is designed to be used with a single species from
Fig. 4a provides an example of a spatial plot of model bias from any number of networks, and plots either a normalized bias metric
the run_spatial_surface script available in the AMET meteorological (Fig. 9b) or normalized error metric (not shown) against the
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 443

average species concentration. Criteria and goal lines, which vary as easily compute common indicator ratios (e.g. O3/NOY); improving
a function of concentration, are also included on the plot and form existing scripts to add flexibility and improve the quality of the plots;
a shape similar to that of a bugle. The idea behind the bugle plot is improving the organization of the database to increase overall speed;
that at lower concentrations the acceptable range for bias and error and improving the AMET interface. One possibility for improving the
is larger than at higher concentrations. AMET interface would be the development of a JAVA-based interface
Beyond the examples described above, the structure of the AMET that will enable AMET to be easily used on multiple computing
software allows for various other types of analyses. While most of platforms. Many of these improvements are currently in develop-
the examples above utilize surface observations, AMET is capable of ment and will be included with the next release of the AMET software.
analyzing just about any type of data, including aloft observations
(e.g. radiosondes, ozonesondes). Additionally, while the existing Acknowledgements
analysis scripts in AMET are based on data paired in time and space,
through the development of new R scripts AMET can easily be The authors would like to thank Sarav Arunachalam with the
extended to include analysis of unpaired data such as cumulative University of North Carolina’s Institute for the Environment for
distributions, spectral density analysis and principle component helping develop the AMET release code and the AMET installation
analysis. In its most basic form, the AMET software provides and users guides. The authors would also like to thank Sharon
a convenient way to store, organize and access observation and Phillips of the EPA’s Office of Air Quality Planning and Standards,
model data for any type of analysis that a user may wish to perform. Air Quality Analysis Division for her help with the testing and
evaluation of the AMET software.
4. Conclusions
Appendix
AMET is a comprehensive software package that is used to pair
observations with gridded model data for meteorological and air Disclaimer e The United States Environmental Protection
quality related applications. AMET performs statistical calculations Agency through its Office of Research and Development funded and
and creates a variety of output products to enable researchers to managed the research described here. It has been subjected to
better understand and evaluate model predictions and improve the Agency review and approved for publication.
science within the models. AMET output includes tabular statistics,
graphical representations of statistics, and spatial plots. AMET is built
References
entirely on open-source software, and while currently set-up to work
with MM5, WRF and CMAQ model data, the software could easily be Appel, K.W., Gilliland, A.B., Sarwar, G., Gilliam, R.C., 2007. Evaluation of the
extended to work with other models. For example, the AMET soft- Community Multiscale air quality (CMAQ) model version 4.5: sensitivities
ware has been used to evaluate Comprehensive Air Quality Model impacting model performance: Part IeOzone. Atmos. Environ. 41, 9603e9615.
Appel, K.W., Bhave, P.V., Gilliland, A.B., Sarwar, G., Roselle, S.J., 2008. Evaluation of
with Extensions (CAMx; ENVIRON, 2010) model output with the use the Community Multiscale Air Quality (CMAQ) model version 4.5: sensitivities
of file conversion software available for that model. impacting model performance; Part IIeparticulate matter. Atmos. Environ. 42,
The AMET software is designed to organize observed and pre- 6057e6066.
Appel, K.W., Roselle, S.J., Gilliam, R.C., Pleim, J.E., 2010. Sensitivity of the Community
dicted data into a relational database that can be easily accessed to Multiscale Air Quality (CMAQ) model v4.7 results for the eastern United States
help improve the efficiency in which a model’s performance can be to MM5 and WRF meteorological drivers. Geosci. Model. Dev. 3, 169e188.
assessed. AMET has been used by various groups in the meteoro- Byun, D.W., Schere, K.L., 2006. Review of the governing equations, computational
algorithms, and other components of the Models-3 Community Multiscale Air
logical and air quality communities to synthesize model results and Quality (CMAQ) modeling system. Appl. Mech. Rev. 55, 51e77.
enhance model development. The use of a common model evaluation Dennis, R., Fox, T., Fuentes, M., Gilliland, A., Hanna, S., Hogrefe, C., Irwin, J., Rao, S.T.,
tool allows for better understanding of the model evaluation process Scheffe, R., Schere, K., Steyn, D., Venkatram, A., 2010. A framework for evalu-
ating regional-scale numerical photochemical modeling systems. Environ. Fluid
and easier interpretation of the evaluation results.
Mech.. doi:10.1007/s10652-009-9163-2.
AMET will continue to grow and improve through internal ENVIRON, 2010. User’s Guide to the Comprehensive Air Quality Model with
development within AMAD, external development through Extensions (CAMx) Version 5.20. Available at. ENVIRON International Corpo-
ration, 773 San Marin Drive, Suite 2115Novato, California 94998. http://www.
collaborating partners, and through contributions and suggestions
camx.com/files/CAMxUsersGuide_v5.20.pdf.
from the user community. Several signinficant improvements to Foley, K.M., Roselle, S.J., Appel, K.W., Bhave, P.V., Pleim, J.E., Otte, T.L., Mathur, R.,
the AMET software are planned for the next release of the tool. One Sarwar, G., Young, J.O., Gilliam, R.C., Nolte, C.G., Kelly, J.T., Gilliland, A.B.,
improvement will be to combine the meteorological and air quality Bash, J.O., 2010. Incremental testing of the Community Multiscale Air Quality
(CMAQ) modeling system version 4.7. Geosci. Model. Dev. 3, 205e226.
modules into a single streamlined module that will allow for Gilliam, R.C., Pleim, J.E., 2010. Performance assessment of new land-surface and
greater cross-comparability of the meteorological and air quality planetary boundary layer physics in the WRF-ARW. J. Appl. Meteor. Climatol.
data. A second improvement focuses on greatly expanding the doi:10.1175/2009JAMC2126.1.
Grell, G.A., Dudhia, A.J., Stauffer, D.R., 1994. A description of the Fifth-Generation
types of datasets that can be used for model evaluation. These PennState/NCAR Mesoscale model (MM5). NCAR Technical Note NCAR/TN-
changes will allow the user to include non-standard observational 398þSTR. Available at. http://www.mmm.ucar.edu/mm5/doc1.html.
datasets, such as field campaign data or observations from outside Otte, T.L., 2008a. The impact of nudging in the meteorological model for retro-
spective air quality simulations. Part I: evaluation against national observation
of the U.S. with no changes to the underlying AMET code. Addi- networks. J. Appl. Meteor. Climatol. 47, 1853e1867.
tionally, the use of satellite data has been streamlined with the Otte, T.L., 2008b. The impact of nudging in the meteorological model for retro-
development of tools to regrid satellite data to regular grids, (e.g. spective air quality simulations. Part II: evaluating collocated meteorological
and air quality observations. J. Appl. Meteor. Climatol. 47, 1868e1887.
regrid swath data to the CMAQ lambert conformal grid), as well as Otte, T.L., Pleim, J.E., 2010. The meteorology-Chemistry interface processor (MCIP)
to calculate comparable variables (e.g. column aggregated aerosol for the CMAQ modeling system. Geosci. Model. Dev. 3, 243e256.
optical depth values from model output). Pleim, J.E., Gilliam, R.C., 2009. An Indirect data Assimilation Scheme for Deep Soil
temperature in the PleimeXiuLand surface model. J. Appl. Meteor. Climatol. 48,
Additional future enhancements to the AMET software will
1362e1376.
include implementing new analysis scripts (e.g. Taylor plots); adding Skamarock, W.C., Klemp, J.B., Dudhia, J., Gill, D.O., Barker, D.M., Duda, M.G., Huang,
non-parametric analysis techniques such as cumulative distribution X-Y, Wang, W., and Powers, J.G., 2008. A description of the advanced research
frequency plots, QeQ plots and histograms; adding temporal WRF version 3. NCAR Tech Note NCAR/TN 475 STR, 125 pp, Available from UCAR
Communications, P.O. Box 3000, Boulder, CO 80307.
decomposition capabilities such as Kolmogorov-Zurbenko filtering Zurbenko, I.G., 1986. The spectral analysis of time series. North-Holland Series in
(Zurbenko, 1986) and spectral density plots; adding the ability to Statistics and Probability.

You might also like