Professional Documents
Culture Documents
Overview of the atmospheric model evaluation tool (AMET) v1.1 for evaluating
meteorological and air quality models
K. Wyat Appel a, *, Robert C. Gilliam a, Neil Davis b, Alexis Zubrow b, Steven C. Howard a
a
Atmospheric Modeling and Analysis Division, National Exposure Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, RTP,
NC 27711, USA
b
University of North Carolina - Institute for the Environment, Chapel Hill, NC 27599, USA
a r t i c l e i n f o a b s t r a c t
Article history: This paper describes the details of the Atmospheric Model Evaluation Tool (AMET) v1.1 created by
Received 26 May 2010 scientists in the Atmospheric Modeling and Analysis Division (AMAD) of the U.S. Environmental
Received in revised form Protection Agency (EPA). AMET was first developed to evaluate the performance of the 5th Generation
7 September 2010
Mesoscale Model (MM5) and the Weather Research and Forecasting (WRF) meteorological model output
Accepted 30 September 2010
Available online 28 October 2010
and was later extended to include capabilities for evaluating output data from the Community Multiscale
Air Quality (CMAQ) model as well. AMET is designed to leverage several open-source software packages
that are used in combination to 1) pair the modeled and observed values in time and space, 2) store these
Keywords:
Air quality model evaluation
paired datasets in an easily accessible and searchable database and 3) access and analyze these data
Meteorological model evaluation using a statistical package. Through this process, AMET is able to provide a convenient method for
CMAQ evaluating meteorological and air quality model predictions. The use of a searchable, relational database
WRF allows the entire dataset to be quickly subset into only those data that are of the most interest for the
MM5 current analysis, a process that is often tedious and time consuming without the use of a database. In
addition to common summary statistics (e.g. RMSE, bias, and correlation), several of the many analysis
features available in AMET include scatter plots, time series plots, box plot and spatial plots as part of
operational model evaluation. Additionally, several unique analysis functions are also available in AMET,
and the system provides a framework within which users may extend the current functionality for their
own custom analyses. While AMET was designed to work specifically with MM5, WRF and CMAQ model
output, it could easily be modified to work with output data from other meteorological and air quality
models.
Published by Elsevier Ltd.
Fig. 1. Flow chart showing the basic structure of the AMET meteorological and air quality modules.
Table 1
List of R scripts currently available in the AMET meteorological and air quality modules.
Fig. 2. Sample plots from the run_summary script. The plot on the left is for temperature, while the plot on the right is for wind direction.
Fig. 3. Sample plot from the run_plot_prof script from the AMET meteorological module. The color shading and white lines indicates model-predicted wind speed, while the black
dots indicate the model-predicted planetary boundary layer height.
K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443 437
Fig. 4. Sample bias plots from a) the run_spatial_surface script from the AMET meteorological module and b) the run_stats_plots script from the AMET air quality module.
438 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443
2. Tool description
Fig. 6. Sample plots from the run_timeseries script available in the AMET air quality module. The top plot shows the observed (black open circles; from the IMPROVE network in
this example) and CMAQ predicted (red open triangles) concentrations. The bottom plot shows the time series of model bias (blue open circles) and error (red open triangles). The
dashed line represents the average error for the period.
2.1. Pairing observations and modeled outputs In most cases, the Site Compare program is able to read the
observation data files that are provided directly from the major air
The first step required as part of the AMET software is to pair the quality networks in the United States. The air quality networks that
observed and modeled values in space and time. In the air quality Site Compare/AMET currently supports are the Chemical Speciation
module of AMET, this is accomplished through the Site Compare Network (CSN; previously STN), the Interagency Monitoring of
software, which is a Fortran program released as a tool along with PROtected Visual Environments (IMPROVE; http://vista.cira.
the CMAQ model and is available for download through the colostate.edu/improve/) network, the Clean Air Status and Trends
Community Modeling and Analysis System (CMAS; http://www. Network (CASTNET; http://www.epa.gov/castnet/), the National
cmascenter.org/) website (packaged as part of AMET). Site Atmospheric Deposition Program (NADP; http://nadp.sws.uiuc.edu/)
Compare performs matching using the nearest neighbor approach, networks and the SouthEastern Aerosol Research and CHaracter-
and applies temporal averaging to the modeled data if required (e.g. ization Study experiment (SEARCH; http://www.atmospheric-
if an hourly CMAQ output file and daily average observation are research.com/studies/SEARCH/index.html). Some data obtained
provided Site Compare will compute a daily averaged CMAQ value through the EPA’s Air Quality System (AQS) may require a pre-
for the correct location to match with the observational data). The processor that is included with the Site Compare software. Users
software provides a means of matching modeled and observed should obtain the observation data directly from the various network
species with different names and combining the modeled or resources and then refer to the AMET user’s guide for instructions on
observed species to create derived species (e.g. total carbon aerosol how use those data with AMET.
from elemental and organic carbon). In addition to the Site Compare For the meteorology module of AMET, all pairing of observations
software, text files containing the site information (site name, with model data is done using Perl. The model output file is probed
location, time-zone, etc.) for each network are required and are for time/date information and grid projection. The Perl script cycles
provided for several U.S. networks. The Site Compare software is run through each model output time, extracts corresponding observa-
separately for each observation network and individual comma tions, determines the location of the observation sites relative to the
separated files are created for each network which contain the model grid, pairs the observations with the model values and inserts
paired observed and model data for any or all (user specified) of the the paired data into the MySQL database. The meteorological
species available from that network. The paired data are then matching is performed using either bilinear interpolation or nearest
inserted into the MySQL database using a Perl script. neighbor. Additionally, AMET attempts to match near-coastal
440 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443
Fig. 8. Sample plot from the run_barplot script from the AMET air quality module for two different CMAQ model simulations. The height of the bars represent the total PM2.5 mass,
þ
while the color shading represents the contributions of sulfate (SO2
4 ), nitrate (NO3 ), ammonium (NH4 ), total carbon TC), and other PM2.5 mass (Other) to the total PM2.5 mass.
2.3. R statistical package scripts quality modules, along with a brief description of the script func-
tions. These scripts are used to generate statistics and plot the
The R project for Statistical Computing, “R”, is an open-source results using the various plotting functions available in the R soft-
software for statistical computing and graphics based on the “S” ware. Many of the output products available in AMET are frequently
language, and is available for many different computer platforms. used in the air quality modeling community. In addition to using the
The R software can be used through a command line interface or provided R scripts, the more advanced user could use these scripts as
with the use of scripts, and contains a large number of functions for a template for creating new R scripts tailored to their particular
computing common statistical metrics and generating graphics (e.g. application and analysis.
box plots and scatter plots). Additionally, the R software contains
a library for interfacing with the MySQL database software. All these 3. Examples of tool applications
features make R an attractive computational and plotting software
package for AMET. In 2007, AMETv1.1 was released as a tool by CMAS as part of the
The information stored in the database in each project table is CMAQv4.6 model release. The AMET installation and users guide
queried through individual cshell scripts for each R analysis script (http://www.cmascenter.org/help/documentation.cfm?
(the R analysis scripts are describing in detail in the next section). MODEL¼amet&VERSION¼1.0) provide detailed information on
The R software has available a MySQL library which contains how to set-up and run AMET with the CMAQ modeling system.
commands for interfacing with the MySQL database. Through the Since its release in the fall of 2007, the AMET has been downloaded
various commands included in the R MySQL library, the database over 550 times, indicating the tool has garnered significant interest
can be queried based on any of the information stored in the data- in the meteorological and air quality communities.
base tables. For example, one could limit their data analysis to For the past five years, AMET has been used extensively by
a single day or week, or to a single state or collection of states. The AMAD for research using the CMAQ modeling system (Appel et al.,
AMET scripts include instructions for performing common queries 2007, 2008, 2010; Pleim and Gilliam, 2009; Gilliam and Pleim,
(e.g. queries based on a date range or location). However, anyone 2010; Otte, 2008a,b). Additionally, AMET has been used by EPA’s
familiar with basic database queries could easily formulate their Office of Air Quality Planning and Standards (OAQPS) to aid in the
own query criteria to subset the data in any way they choose. evaluation of their air quality simulations for federal rule making
The AMET software already includes a collection of R scripts that applications and by other groups to evaluate MM5 and WRF model
perform various analyses such as computing standard statistical simulations for efforts not necessarily associated with CMAQ model
metrics such as bias, error and correlation; creating common applications. AMET is currently under active development at the
statistical plots such as scatter plots and time series plots; and Institute for the Environment and at the EPA.
creating unique plots designed specifically for AMET. Table 1 lists the The following (Figs. 2e9) highlight some key analysis capabil-
R scripts currently available for the AMET meteorological and air ities of AMET (refer to Table 1 for additional information). Fig. 2
442 K.W. Appel et al. / Environmental Modelling & Software 26 (2011) 434e443
average species concentration. Criteria and goal lines, which vary as easily compute common indicator ratios (e.g. O3/NOY); improving
a function of concentration, are also included on the plot and form existing scripts to add flexibility and improve the quality of the plots;
a shape similar to that of a bugle. The idea behind the bugle plot is improving the organization of the database to increase overall speed;
that at lower concentrations the acceptable range for bias and error and improving the AMET interface. One possibility for improving the
is larger than at higher concentrations. AMET interface would be the development of a JAVA-based interface
Beyond the examples described above, the structure of the AMET that will enable AMET to be easily used on multiple computing
software allows for various other types of analyses. While most of platforms. Many of these improvements are currently in develop-
the examples above utilize surface observations, AMET is capable of ment and will be included with the next release of the AMET software.
analyzing just about any type of data, including aloft observations
(e.g. radiosondes, ozonesondes). Additionally, while the existing Acknowledgements
analysis scripts in AMET are based on data paired in time and space,
through the development of new R scripts AMET can easily be The authors would like to thank Sarav Arunachalam with the
extended to include analysis of unpaired data such as cumulative University of North Carolina’s Institute for the Environment for
distributions, spectral density analysis and principle component helping develop the AMET release code and the AMET installation
analysis. In its most basic form, the AMET software provides and users guides. The authors would also like to thank Sharon
a convenient way to store, organize and access observation and Phillips of the EPA’s Office of Air Quality Planning and Standards,
model data for any type of analysis that a user may wish to perform. Air Quality Analysis Division for her help with the testing and
evaluation of the AMET software.
4. Conclusions
Appendix
AMET is a comprehensive software package that is used to pair
observations with gridded model data for meteorological and air Disclaimer e The United States Environmental Protection
quality related applications. AMET performs statistical calculations Agency through its Office of Research and Development funded and
and creates a variety of output products to enable researchers to managed the research described here. It has been subjected to
better understand and evaluate model predictions and improve the Agency review and approved for publication.
science within the models. AMET output includes tabular statistics,
graphical representations of statistics, and spatial plots. AMET is built
References
entirely on open-source software, and while currently set-up to work
with MM5, WRF and CMAQ model data, the software could easily be Appel, K.W., Gilliland, A.B., Sarwar, G., Gilliam, R.C., 2007. Evaluation of the
extended to work with other models. For example, the AMET soft- Community Multiscale air quality (CMAQ) model version 4.5: sensitivities
ware has been used to evaluate Comprehensive Air Quality Model impacting model performance: Part IeOzone. Atmos. Environ. 41, 9603e9615.
Appel, K.W., Bhave, P.V., Gilliland, A.B., Sarwar, G., Roselle, S.J., 2008. Evaluation of
with Extensions (CAMx; ENVIRON, 2010) model output with the use the Community Multiscale Air Quality (CMAQ) model version 4.5: sensitivities
of file conversion software available for that model. impacting model performance; Part IIeparticulate matter. Atmos. Environ. 42,
The AMET software is designed to organize observed and pre- 6057e6066.
Appel, K.W., Roselle, S.J., Gilliam, R.C., Pleim, J.E., 2010. Sensitivity of the Community
dicted data into a relational database that can be easily accessed to Multiscale Air Quality (CMAQ) model v4.7 results for the eastern United States
help improve the efficiency in which a model’s performance can be to MM5 and WRF meteorological drivers. Geosci. Model. Dev. 3, 169e188.
assessed. AMET has been used by various groups in the meteoro- Byun, D.W., Schere, K.L., 2006. Review of the governing equations, computational
algorithms, and other components of the Models-3 Community Multiscale Air
logical and air quality communities to synthesize model results and Quality (CMAQ) modeling system. Appl. Mech. Rev. 55, 51e77.
enhance model development. The use of a common model evaluation Dennis, R., Fox, T., Fuentes, M., Gilliland, A., Hanna, S., Hogrefe, C., Irwin, J., Rao, S.T.,
tool allows for better understanding of the model evaluation process Scheffe, R., Schere, K., Steyn, D., Venkatram, A., 2010. A framework for evalu-
ating regional-scale numerical photochemical modeling systems. Environ. Fluid
and easier interpretation of the evaluation results.
Mech.. doi:10.1007/s10652-009-9163-2.
AMET will continue to grow and improve through internal ENVIRON, 2010. User’s Guide to the Comprehensive Air Quality Model with
development within AMAD, external development through Extensions (CAMx) Version 5.20. Available at. ENVIRON International Corpo-
ration, 773 San Marin Drive, Suite 2115Novato, California 94998. http://www.
collaborating partners, and through contributions and suggestions
camx.com/files/CAMxUsersGuide_v5.20.pdf.
from the user community. Several signinficant improvements to Foley, K.M., Roselle, S.J., Appel, K.W., Bhave, P.V., Pleim, J.E., Otte, T.L., Mathur, R.,
the AMET software are planned for the next release of the tool. One Sarwar, G., Young, J.O., Gilliam, R.C., Nolte, C.G., Kelly, J.T., Gilliland, A.B.,
improvement will be to combine the meteorological and air quality Bash, J.O., 2010. Incremental testing of the Community Multiscale Air Quality
(CMAQ) modeling system version 4.7. Geosci. Model. Dev. 3, 205e226.
modules into a single streamlined module that will allow for Gilliam, R.C., Pleim, J.E., 2010. Performance assessment of new land-surface and
greater cross-comparability of the meteorological and air quality planetary boundary layer physics in the WRF-ARW. J. Appl. Meteor. Climatol.
data. A second improvement focuses on greatly expanding the doi:10.1175/2009JAMC2126.1.
Grell, G.A., Dudhia, A.J., Stauffer, D.R., 1994. A description of the Fifth-Generation
types of datasets that can be used for model evaluation. These PennState/NCAR Mesoscale model (MM5). NCAR Technical Note NCAR/TN-
changes will allow the user to include non-standard observational 398þSTR. Available at. http://www.mmm.ucar.edu/mm5/doc1.html.
datasets, such as field campaign data or observations from outside Otte, T.L., 2008a. The impact of nudging in the meteorological model for retro-
spective air quality simulations. Part I: evaluation against national observation
of the U.S. with no changes to the underlying AMET code. Addi- networks. J. Appl. Meteor. Climatol. 47, 1853e1867.
tionally, the use of satellite data has been streamlined with the Otte, T.L., 2008b. The impact of nudging in the meteorological model for retro-
development of tools to regrid satellite data to regular grids, (e.g. spective air quality simulations. Part II: evaluating collocated meteorological
and air quality observations. J. Appl. Meteor. Climatol. 47, 1868e1887.
regrid swath data to the CMAQ lambert conformal grid), as well as Otte, T.L., Pleim, J.E., 2010. The meteorology-Chemistry interface processor (MCIP)
to calculate comparable variables (e.g. column aggregated aerosol for the CMAQ modeling system. Geosci. Model. Dev. 3, 243e256.
optical depth values from model output). Pleim, J.E., Gilliam, R.C., 2009. An Indirect data Assimilation Scheme for Deep Soil
temperature in the PleimeXiuLand surface model. J. Appl. Meteor. Climatol. 48,
Additional future enhancements to the AMET software will
1362e1376.
include implementing new analysis scripts (e.g. Taylor plots); adding Skamarock, W.C., Klemp, J.B., Dudhia, J., Gill, D.O., Barker, D.M., Duda, M.G., Huang,
non-parametric analysis techniques such as cumulative distribution X-Y, Wang, W., and Powers, J.G., 2008. A description of the advanced research
frequency plots, QeQ plots and histograms; adding temporal WRF version 3. NCAR Tech Note NCAR/TN 475 STR, 125 pp, Available from UCAR
Communications, P.O. Box 3000, Boulder, CO 80307.
decomposition capabilities such as Kolmogorov-Zurbenko filtering Zurbenko, I.G., 1986. The spectral analysis of time series. North-Holland Series in
(Zurbenko, 1986) and spectral density plots; adding the ability to Statistics and Probability.