You are on page 1of 7

AN ADAPTIVE DIAGNOSTIC MODEL

FOR AIR QUALITY MANAGEMENT


ROBERT CARBONE
Academic Faculty of Management Science
and
WILPEN L. GORR
School of Public Administration, The Ohio State University, Columbus, OH 43210, U.S.A.

(First received 1 Nouember 1976 and infinal form 13 January 1978)

Abstract - This paper draws attention to the program evaluation and regulation adjustment capabilities
required for implementing and maintaining air quality standards. A diagnostic approach is presented for
determining the causes of air quality problems based on a multivariate time series model. The model
formulation is based on routinely available data and the concept of time-varying parameters with estimation
via an adaptive tittering technique. The diagnostic approach is illustrated with some total suspended
particulates data from Allegheny County, Pennsylvania, recorded over a period of time during which a series
of pollution control milestone events occurred.

1. INTRODUCTION that it has the capability of continuously providing


diagnostics without the attention of systems analysts.
Implementation and maintenance of air quality stan- The next section of the paper describes the empirical
dards tend to be treated as once-only planning efforts setting for a demonstration of ADM using total
with the resulting pollutant emission regulations being suspended particulates (TSP) concentrations recorded
thought of as “permanent” (see Federal Register, 1971 over a period of five years at the Logans Ferry
and 1974). Ambient air quality resulting from plans monitoring site in Allegheny County (Pittsburgh)
may miss the targeted standards, however, for several Pennsylvania. In an earlier paper (Carbone and Gorr,
reasons. These include emission inventory errors, 1976) we reported initiat experience in applying AEP
forecasting errors, diffusion modeling prediction er- to the estimation of multivariate models of pollutant
rors, non-compfiance with regulations and unfavor- concentrations, specifically background concen-
able changes in uncontrollable variables such as trations, based in part on these data. These data are re-
background con~ntration. examined in this paper because of specific milestone
In some cases, then, it may be necessary to institute events of air pollution control which have occurred
feedback control systems to adjust regulations or over the period at the various point sources in the
make other corrections in air quality management vicinity of the site in question. An analysis of such data
according to the experienced air quality trends. The allows us to see whether or not ADM is capable of
purpose of this paper is to draw attention to the detecting and separating such events from the series of
adjustment-of-air-pollution-regulations problem and ambient concentration without using any a priori
to develop and test an appropriate diagnostic ap knowledge. Section 3 presents ADM and a brief
preach for the adjustment process - the Adaptive discussion of the estimation approach. Finally, the
Diagnostic Model (ADM). results of our study are summarized in Section 4.
Diagnostic models have the purpose of inexpen-
sively limiting the explanation of an event to a subset of
all possible causes of the event. Thus explanation 2. DATA
becomes a staged process with the first stage being
made with readily available information and routine Figure 1 shows the locations of major particulate
procedures. The ADM which is presented for this first sources in Allegheny County including sources of
stage analysis is a multivariate time series model based fugitive emissions (largely from by-product coke
on the concept of time-varying parameters with para- plants) and point sources (largely integrated steel
meter estimates calculated by the Adaptive Estimation plants and fossil-fueled power plants). Also in Fig. 1 is
Procedure (AEP) developed by Carbone (1975) and the monitoring site under study, Logans Ferry oper-
Carbone and Longini (1975, 1976). Through the ated by the Allegheny County Bureau of Air pollution
application of this adaptive filtering algorithm, ADM Control. Additional information on the Logans Ferry
is self-adaptive to changing structures of an airshed so monitor and nearby sources can be found in Rubin and

1785
1786 ROBERTCARBONEand WILPEN L. GORR

3. MODEL FORMULATION AND ESTIMATION


APPROACH

The formulation of a model which relates pollution


concentration, c(t), in real time at a monitoring site
during period t as a function of a set of meteorological
and other variables denoted by x,<t] or q(r) is guided
by several considerations. These include implications
of atmospheric diffusion theories, limitations due to
the type of data which is routinely available, the
(3M~sj~r~SWtce,~8ftii~t8)J requirement for an automated modeling process, ease
8missronrate0 Bs“ of model interpretation and application and overall
@ Fusl eombustii SOWCB ~rfo~ance.
@Industrial prOC8SS SOWGS The unavailability of real time data for source-
oriented causal variables of ambient concentration
Fig. 1. Distribution of particulate sources and site of Logans dictates the use of wind directional sectors and other
Ferry monitor in Allegheny County (1972). variables as surrogates. Thus the model presented
below is only capable of discriminating between
classes of causes of polIution concentration trends
such as sources to the west vs local sources, low level vs
high level release emissions etc. By applying the
concept of time-varying parameters it is possible to
Bloom (i975), Carbone and Gorr (19761, and Gorr
capture the effect on concentration levels of changes in
and Dunlap (1977).
the real causal variables which are not included in the
The data set from Logans Ferry has 365 obser-
equation. examination of the experienced patterns of
vations of 24-h average TSP taken by a hi-vol sampler
variation of the parameter estimates becomes the
from May 1970 to April 1975. A standard Unico principal tool for diagnostic analysis.
shelter and Gelman filter paper with a collection A series of trial model forms for TSP at Logans
emciency of 99.7% for particles of size 0.3 pm and Ferry has led to the following form.
larger ,are used for this instrument. Meteorological
data include surface measurements and Rawinsonde c(r) = {g a#““) [E Q)li(‘) (,C, i$Q)xI(t))
data taken at the Greater Pittsburgh Airport in the
western portion of Allegheny County. The Rawin-
sonde data were processed by Denardo and McFar-
land Weather Services, West MifIlin, Pennsylvania to
yield inversion-related variables.
Lending interest to this data set are air pollution where G is a set ofgeneral variables, i and J are sets-of
control activities of two large fossil-fuels power inversion related variables, and NI and NJ are sets of
plants, A and B, just to the west of Logans Ferry (see non-inversion related variables. The a,(t) and fidt) in
Fig. 1) which followed a trend ofdecreasing particulate (1) are corresponding time-varying parameters and
emissions over the five year study period : u(t) is an undefined error term.
All the variables in (1) are restricted to the values of0
Plant A or 1. Wind-directional sectors for the NJ subset and
(a) In 1970 there were five coal-fired boilers in “inversion” for the J subset correspond to the x-type
service with 322 MW capacity: variables. In the present context, inversion is assumed
Nos. 2,4 and 6 - 565 MMBTU h each ; to be a non wind~ir~tional phenomenon. Only one of
No. 77 - 1020 MMBTU h ; the x-type variables can be assigned the value 1 for my
No. 88 - 1450 MMBTU h. given period since observations are either classified as
(b) In January, 1973 Nos. 2,4 and 6 were derated a particular wind-direction sector or inversion. In our
and finally taken out of service. empirical study, we have defined 45-135”, 135-225”,
(c) In December. 1974 No. 77 was converted to low 225-315” and 315-45” as the wind-dir~tional sectors
sulfur oil with the plant serving a minor role to provide and a period t is determined to be an “inversion” if
peak power. there is a morning ground-based inversion with a
temperature differential of at least 2°C across the
Plant B thickness of the inversion.
(a) Old coal-fired units rated at 262 MW were The z-type variables by definition indicate the
gradually derated from 1970 and taken ouf of service occurrence of one of a finite number of nominal classes
in May 1971. of explanatory variables such as probation levels,
(b) A new coal-fired -nit with precipitators and a wind speed, etc. They are assumed to indirectly affect
tall stack commenced operation in June 1970 and by concentration by weighting or correcting the e&i-
May 1971 was operating at capacity. mated levels determined by the /? parameters. While
An adaptive diagnostic model for air quality management 1787

some adjustment factors in (1) are assumed to affect types of processes governing the change in parameter
both wind-directional and inversion classified obser- values. This aspect is crucial since it is generally
vations (the elements of G), others are viewed as having impossible to assume a priori knowledge of the
an adjusting effect only when one of the two types of processes governing structural changes in an airshed.
period prevail. In our study, four classes for pre- Note that no restriction in (2) and (3) is imposed as to
cipitation (none, O-O.254 cm, 0.254-0.889, and greater the type of parameter variation that may arise.
than 0.899cm in water equivalent), two classes for In reference to the empirical study, the following
degree day (“heating” if the 24-h average ambient non-unique steps were taken to estimate the a,(t) and
temperature is less than 18.3”C, “cooling” otherwise), a,(t) coefficients using F,!ZP :
and two classes reflecting general activity level for (a) “None” for prectpttation, “cooling” for degree
particulate sources (“weekday” and “weekend”) con- day, “weekend” for day and 2.25-4.47 m s- ’ for wind
stitute the elements of G. Moreover, wind speed speed were defined to be standard conditions for
adjustments are applied only to the four wind direc- normalization of the z-type variables. The Zi(t) vari-
tional estimates, and not to the inversion condition. ables associated with the standard classes are sup
Five discrete classes for the variable “24-h average pressed by setting their values to zero for all obser-
wind speed” (O-2.25 m s- ‘, 2.25-4.41 m s - I, vations. (Then the /3,(t) coefficients represent the TSP
4.47-6.71 m s-l, 6.71-8.94 m s-l, and greater than concentration of the given wind direction or inversion
8.94 m s-t) are the z-type variables contained in NI. during the standard condition.)
No z-type factors have been used for I. (b) At t =O,a,(t) = 1 Viand/3)(t) = 80pgrnm3Vj
Attention is now directed to the estimation problem (an approximation of average TSP concentration at
which is to capture the path that each parameter may Logans Ferry).
follow over time so as to adapt (1) to changing (c) fj(t) = 10+-j at t = 0 and 5, = 0.04.
structural conditions. This is accomplished by apply- (d) D = 40 was applied for updating the /lAt)‘s in
ing the AEP algorithm. According to this algorithm (2) and D = 10 for the a,(t)‘s in (3). A smaller value of D

:
updated parameter estimates of (1) are obtained with was chosen for the a,(t)‘s since they are updated only
each new observation via the following two recursive when the class of explanatory variable corresponding
formulae to i occurs.

sjw
=b,(t
-I)+
I&
-1)1
A
(e) The set of observations was then reiterated

(c(t)~-,,).g.,
1(2)
several times through the AEP equations (2) and (3)

[
via a forward-backward input procedure (see Longini
et al., 1975, and Carbone and Longini, 1975) to adjust
the value of D, if necessary, and to converge to the
for all j pattern of change in the parameter estimates. In this
case, no adjustments to D were found to be necessary.
and Note that because of the small values of D used, the
&(t) = oi,(t - 1) + a;# - 1) algorithm and parameter estimates have virtually no
memory of individual events of past iterations.

[
(c(t) - W)
Z(t)
z,(t).&
1
(3)
A “purged” data set was also created by deleting the
observations that resulted in an absolute estimation
error greater than 60 pg me3 after the above steps were
for all i
performed. This second data set was similarly pro-
where : cessed through the above steps. The purpose here is to
2(t) = a short run prediction of concentration investigate robustness properties of the estimated
calculated by applying the observed data parameter paths derived.
at t to the parameter estimates oft - 1;
D = a damping parameter > 1; 4. RESULTS
P = the number of variables described by z-
type indicators; Table 1 presents some measures of performance for
f,(t) = updated average for the j-th x variate. the full and purged data sets. The simple correlation
An exponential smoothing scheme is used to calculate coefficient in this table is between observed and
this latter average as follows: estimated TSP concentrations, and the serial cor-
q(t) = &.x,(t) + (1 - S&(t - 1) relation coefficient is for a first-order autoregressive
scheme.
where 0 c S, < 1 and depends on the “forgetting rate” The average observed and estimated concentrations
of past observations applied to the process. reported in Table 1 shows little error in central
This algorithm is easy to implement ; for example, it tendency for both data sets. The standard deviation of
requires no matrix inversion or calculation of cor- estimated concentrations is smaller than that of the
relation functions. The algorithm functions as a servo- observed in both cases, but this is to be expected when
mechanism. Given initial values for the parameters, it significant measurement errors exist as with hi-vol
is designed to automatically capture with some lag the data since the variance of observed concentrations
1788 ROISERT
CARBINEand WILPENL. GORR

Table 1. Some descriptive measures of model performance

Model from Model from


Statistic full data set purged data set
__-_- ^..__ _______ _ _.__~__ ~~___.
Average observed concentration 87.4 81.9
Average predicted concentration 88.9 83.2
Standard deviation observed 43.3 34.2
Standard deviation estimated 27.9 23.8
Root mean square error 36.0 26.3
Mean absoiute error 27.0 20.9
Simple correlation coefficient 0.563 0.643
Serial correlation coefficient - 0.022 - 0.052

includes the variance of true concentrations and the evidence of serial correlation is detected which is an
error distribution variance. Furthermore, we note that indication that the time-lag introduced by damping is
statistics reflecting closeness of fit (root mean square causing no adverse effects.
error, simple correlation coefficient and mean ab- Table 2 presents parameter estimates for the Logans
solute deviation) are comparable to those derived from Ferry TSP model. The first column of Table 2 is the ‘?;
other empirical modeling efforts. See for example occurrence of each class of a variable in the sample.
Islitzer and Slade (l%S), McColhster and Wilson The next two columns give parameter estimates for the
(1975), Bankoff and Hanzevack (1975), Samson et al., start and end of the five year study period as estimated
(19751, and Sidik and Neustader (1976). Finally no from the full data set. The last column contains the

Table 2. Parameter estimates for model with full data set and comparison with purged
data set

Mean absolute
error
Variable “/0of Parameter estimates full vs
class occurrence Start End purged

U ind direction (“)


45-135 7.4 43.1 52.9 1.20
135-225 8.5 119.0 97.3 11.44
225-315 40.6 118.6 95.1 2.62
315-45 14.2 63.7 54.3 2.51
Inversion 29.3 106.7 112.2 4.21
100.0
Wind speed (m s-‘)
O-2.24 6.3 1.092 1.106 0.116
2.24-4.47+ 51.6 1.ooo 1.000 O.ooO
4.47-6.71 36.4 1.013 0960 0.027
6.71-8.94 4.9 1.088 1.055 O.OQO
> 8.94 0.8 1.205 1.196 0.004
lao.0
Precipitation
none* 53.4 1.000 mlo 0.000
low 20.3 0.906 0.854 0.012
medium 15.9 0.873 0.851 0.034
heavy 10.4 0.890 0.895 0.018
100.0
Degree day
cooling* 28.5 l.oOO Moo O.OiKl
heating 71.5 0.854 0.807 0.012
100.0
W
weekend* 26.3 1.ooo l.ooo 0.000
weekday 73.7 1.250 1.127 0.038
100.0

* Parameter estimates for these classes are always equal to 1 because they were &bed
as standard classes w&b respect to which the el%cts of other multiplicative variabies are
compared.
An adaptive diagnostic model for air quality management 1789

mean absolute deviation between parameter estimates


from the full and purged data sets computed over
observations common to both data sets.
From this table, it is easy to calcuiate the range of
estimated concentrations which is 32.1-179.2 pg me3
at the start and 34.1-131.1j~grn-~ at the end. The
values reported give results as theoretically expected.
For example, the results reveal that concentration on a
weekday is 25% greater than on a weekend at the
beginning and about 12% at the end of the period
(which may provide some measure of the general
effectiveness of control policies); that precipitation
reduces concentration by about 12-15% in contrast to
no precipitation over the period ; that a resultant wind
direction from the major point sources located close to -I
75 150 225 300 375
the monitor and from the city (135-315”, see Fig. 1) for Observation number
days characterized by the standard condition leads to
concentration levels close to three times greater than Fig. 3. Parameter paths of the fi estimates for 225-3 15” wind
direction.
those for the background resultant wind direction
(45-135”) at the beginning and less than two times
greater at the end (which also provides some in-
dication of the effectiveness of control policies). In fact,
the results reveal that concentrations from the urban
area directions have decreased from 20-40% over the
period depending upon the type ofindicator condition.
Also of interest is the fact that the coefficient for the
highest wind speed class is approx 20% higher than the
three middle classes. This provides some evidence that
the Steady Wind Incident phenomenon of high pol-
lutant concentration exists for TSP as well as sulfur
dioxide (see Gorr and Dunlap, 1977).
To further explore the diagnostic capabilities of
ADM, Figs 2-6 present the patterns of variation over
time of the estimates of certain parameters. In all cases
examined, the paths derived from both data sets are
plotted with the heavier line corresponding to the full
0 75 I50 225 300 3
data set. Observations not common to both data sets Observation number
have also been deleted in the parameter plots. Since
Fig. 4. Parameter paths of the p estimates for inversion.

1.60

E”
; 1.20
z
I

p7 l-l
f o.60j----=-=--
L

CT
0.40

1971 1972 1973 1974 1975


i
0 75 150 225 300 3 1971 1972 1973 1974 1975
Observation number
0 75 150 225 300 3j! 5
Observation number
Fig. 2. Parameter paths of the b estimates for 45-135” wind
direction (heavy line indicates full data set, light line purged Fig. 5. Parameter paths of the ai estimates for mild
data set). precipitation.
1790 ROBERT CARBONE and WILPEN L. GORK

and “weekday”. Here again, we can visually observe


OAISI index , consistency between the patterns derived from the full
and purged data sets. Figure 5 reveals that the impact
of mild precipitation in contrast to no precipitation
has remained somewhat constant over the study
period as is expected. It is interesting to note from Fig.
6 that the path derived for weekday in contrast to
weekend which reflects changes due to general source
activity level follows a pattern similar to the inversion
trend portrayed in Fig. 4. Comparing this path to the
running annual AISI steel production index as in Fig.
4, we can easily observe the same phenomenon. The
same pattern of variation between these two paths
1971 1972 1973 1974 1975
seems to persist until 1973, and thereafter, a decreasing
0’ 7’5 150 225 360 3;5 trend in the weekday parameter exists as opposed to
Observation number
an increasing trend in the steel production index. This
Fig. 6. Parameter paths of the ai estimates for weekday. further supports our earlier contention as to when
local emission control policies show some sign of
effectiveness.
parameter estimates are only updated when a class of a
variable is observed, it is assumed in these figures that 5. CONCLUDING REMARKS
they persist in value until such an occurrence.
Figures 2,3 and 4 depict the paths of j? parameters The main thrust of this paper has been to propose an
respectively associated with the 45-135” (background) adaptive diagnostic framework for air quality manage-
and 225-315” (major point sources) wind directional ment based exclusively on routinely available infor-
classes, and inversion. The plotted values in these cases mation. It is apparent from the results reported in the
represent the patterns of variation of expected TSP previous section that this framework appears promis-
given the standard condition; i.e. no precipitation, ing. These results demonstrate the ability of the system
cooling, weekend, and 2.25-4.41 m s-l wind speed for for accurately capturing and analyzing changes in
non-inversion days. trends in concentration 1el:els without using any a
It is evident from these figures that the paths from priori knowledge.
the full and purged data sets follow nearly identical Large scale computational experiments with real
patterns. Our results portrayed in Fig. 2 reveal an time data from a telemetry network are planned. Also,
increasing trend in TSP advected into the region under we are currently exploring the use of this type model-
the standard condition. In contrast, we note from Fig. ing process for the dynamic calibration of monitoring
3 a decreasing trend for the same condition whenever instruments, long-run forecasting via the extrapo-
the wind was blowing from the direction of major lation of the estimated paths of the parameters, and
nearby point sources (power plants A and B in Fig. 1). finally, refinement of ADM for real time prediction of
The milestone events in particulate emission controls concentration levels at any point in an airshed given an
at these sources mentioned in Section 3 appear to be existing monitoring network.
evidenced in Fig. 3. Also plotted in Fig. 4 is a running
annual average (see Rubin and Bloom, 1975) com- Acknowledgments - Appreciation is expressed to members of
puted from monthly summaries of district steel pro- the Allegheny County Bureau of Air Pollution Control and
duction reported by the American Iron and Steel especially Deputy Director R. J. Chleboski and S. Feigen-
Institute (AISI) which combines Allegheny County baum whose efforts resulted in much of the information for
this study. Partial support was provided by an Ohio State
steel production with that of some more distant University Research Grant.
facilities. Significant contributions to inversion con-
centrations are more likely due to low level emissions
such as from by-product coke plants as opposed to
elevated sources such as power plants. A qualitative REFERENCES
comparison of these plots suggests that inversion
concentration followed a similar pattern of change as Bankoff S. G. and Hanzevack E. L. (1975) The adaptive-
filtering transport model for prediction and control of
the steel production trend until 1973. Thereafter a pollutant concentration in an urban airshed. Atmospheric
significant departure in opposite.direction is noticed. Environment 9, 792-808.
The year 1973 may then indicate the point in time Carbone R. (1975) The design of an automated mass
when local emission control policies for steel plants aooraisal svstem using feedback. Unpublished Ph.D.
disiertation; Carnegie-Mellon University, Pittsburgh.
impacted ambient air quality. Carbone R. and Gorr W. L. (1976) Environmental Modeling
Figures 5 and 6 present the patterns of variation of a& S&m&ion. (Edited dy Oti) pp- 478482. USEPA
the estimates of the multiplicative parameters re- 600/9-76-016, Washington,. D.C.
spectively assigned to the classes “mild precipitation” Carbone R. and Lontini R. L. (1975) An adaptive stochastic
An adaptive diagnostic model for air quality management 1791

approximation algorithm for estimating time-varying McCollister G.‘M. and Wilson K. R. (1975) Linear stochastic
parameters. College of Administrative Science, The Ohio models for forecasting daily maximum and hourly con-
State University WPS 75-56. centrations of air pollutants. Atmospheric Environment 9,
Carbone R. and Longini R. L. (1977) A feedback approach to 417-423.
automated real estate assessment. Mgt Sci. 24, 241-248. Rubin E. X. and Bloom H. T. (1975) Maintenance of ambient
Federal Register. (1971j Environmental Protection Agency particulate standards in an industrialized region. Presented
36, No. 158, 15486-15505. at the 68th Annual Meeting of the Air Pollution Control
Federal Register. (1974) Environmental Protection Agency Association, Boston.
39, No. 167,31000-31009. Samson P. J., Neighmond G. and Yencha A. J. (1975) The
Gorr W. L. and Dunlap R. W. (1976) Characterization of transport of suspended particulates as a function of wind
steady wind incidents for air quality management. Atmos- direction and atmospheric conditions. J. Air Polk. Con-
pheric Environment 11, 59-64. trol Ass. 25, 1232-1237.
Islitzer N. F. and Slade D. H. (1968) Meteorology and Atomic Sidik S. M. and Neustader H. E. (1976) Environmental
Energy. (Edited by Slade) pp. 132,143. U.S. Atomic Energy Modeling and Simulation. (Edited by Ott) pp. 678-682.
Commission TID-24190, Springfield, Virginia. USEPA 600/g-76-016, Washington, DC.
Longini R. et al. (1975) Filtering without phase shift. IEEE Widrow B. et al. (1975) Adaptive noise cancelling: principles
Trans. biomed. Engng 432-433. and applications. Proc. IEEE 63, 1692-1716.

You might also like