Professional Documents
Culture Documents
Herbert Robbins Sutton Monro: The Annals of Mathematical Statistics, Vol. 22, No. 3. (Sep., 1951), Pp. 400-407
Herbert Robbins Sutton Monro: The Annals of Mathematical Statistics, Vol. 22, No. 3. (Sep., 1951), Pp. 400-407
The Annals of Mathematical Statistics, Vol. 22, No. 3. (Sep., 1951), pp. 400-407.
Stable URL:
http://links.jstor.org/sici?sici=0003-4851%28195109%2922%3A3%3C400%3AASAM%3E2.0.CO%3B2-W
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained
prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in
the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/journals/ims.html.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
The JSTOR Archive is a trusted digital repository providing for long-term preservation and access to leading academic
journals and scholarly literature from around the world. The Archive is supported by libraries, scholarly societies, publishers,
and foundations. It is an initiative of JSTOR, a not-for-profit organization with a mission to help the scholarly community take
advantage of advances in technology. For more information regarding JSTOR, please contact support@jstor.org.
http://www.jstor.org
Mon Aug 27 15:50:48 2007
A STOCHASTIC APPROXIMATION METHOD'
has n unique loot n: = 0. Therc are many methods for determining the value of 8
by successive approximation. With any such method n-e begin by choosing one or
more values 1'1, . . . , z,more or less arbitrarily, and then successively obtain new
values 2, as certain fucctions of the previously obtained xl , . . , x,-I ,the values
.I() . . ill(.r,-I) , possibly those of the derivatives M ' ( q ) , - - . ,M'(x,-~),
and
etc. If
lim x, = 0,
n+w
- Y(x)with distribution
1s the expected value of Y for the given x. Neither the exact nature of H ( y / x)
nor that of M(x)is known to the experimenter, but it is assumed that equation ( 1 )
has a unique root 0, and it is desired to estimate 0 by making successive observa-
tions on Y a t levels x1 , x2 , . . . determined sequentially in accordance with some
definite experimental procedure. If (2) holds in probability irrespective of any
arbitrary initial values xl , . . . , x, , we shall, in conformity with usual statistical
terminology, call the procedure consisfcnf for the given H ( y / x) and vali~rn
This work was upp ported In part by t h e Office of Naval R ~ s r t i r c h
400
STOCHASTIC APPROXIMATION 40 1
(4) Prl 1 Y ( d 15 CI =
-C
dH(ylx1 = 1 for all .r.
I t follows in particular that for every x the expected value M(x) defined by (3)
exists and is finite. We suppose, moreover, t,hat there exist finite constants a,
0 such that.
(5) M(z) 5 a for x < 8, M(x) 2 cu for z > 0.
Whether M (8) a is, for the moment, immaterial.
=;
no matter what the initial value xl . As is wall known, (10) implies the convergence
in probability of x, to 8.
From (7) we have
402 HERBERT KOHBINS .4XD SUTTON MONK0
Setting
d. = E [ ( z n- O)(IM(x.) - a)],
we can write
bn+l - bn = anen - 2u, d,, .
2
(14)
Note t'hai from ( 5 )
dn 2 0,
while from (4)
0 5 en I [C + ( aj12 < 03.
Together with (6) this implies that the positive-term series ~Ia:e, converges.
Summing (14)we obtain
linl b,,
11-01
= b, + he. - 2 2 a, dn = b
exists; b >. 0.
Now suppose that tllere exists a sequence {kn} of nonnegative constants
such tjhat
E'rorn (20) a~ltlthe secolid part of (19) it follows that for any e > 0 there must
exist infinitely many values 12 such that b. < E. Since we already kno~vthat
b =; lim,,,,. b,, exists, it follo\vs that b = 0. Thus Itre have proved
LEMMAI . I j n sequence (k,,)of nonnegative constants esist.s satisfying ( 1 9 )
then b = 0.
Let
From (5) i t follow that L,, 2 b. Moreover, denoting by P,(z) the probahilit,y
distribiition of x, , we have
.I, = i ir Rl<A,,
(., - 6) (A/( a ) - a) CIP,,
(.I.)
(24)
-
k II 7. - 0 l2 dJ',?(.t.) En bw
I t follons that the particu1:tr sequcncc3 (z,,) defined by (23) sat isfics the first
part of (19).
111 orclel- t o c,st:?hlish the second part of (39) nr\'r shall m:ilir the following
assnmptinns:
:ind i hc. ~ c c o u dpart, (11 (19) folloi~s froin (291 :111(1 (26). '1.h;~: 111~ovr:-
J,mrv z U,f ( 2 5 ) ( ~ n f(26)
? hold ihrn h = 0.
404 HERBERT BOBBINS AND SUTTON MONRO
The hypotheses ( 6 ) and (26) concerning {a,} are satisfied by the sequence
a, = l/n, since
More generally, any sequence {a,] such that there exist two positive constants
C" for which
c',
will satisfy (6) and (26). We shall call any sequence {a,} which satisfies ( 6 )
and (26),whether or not it is of the form (30), a sequence of type l/n.
If {a,} is a sequence of type l/n it is easy to find functions M ( x ) which satisfy
( 5 ) and (25). Suppose, for example, that M ( x ) satisfies the following strength-
ened form of ( 5 ): for some 6 > 0 ,
(5') M(x)<a-6 for x < O , M(x)La+8 for a : > O .
Then for 0 <Ix - 0 I 5 A, we have
so that
6
5, 2 -
A,'
which is (25) with K = 6. From Lemma 2 we conclude
THEOREM 1. If (a,] is of type l/n, if ( 4 ) holds, and if iM(x) satisfies (5') then
b = 0.
A more interesting case occurs when iM(x) satisfies the following conditions:
(33) M ( x ) is nondecreasing,
We shall prove that (25) holds in this case,also. From (34) it follows that
(36 ) M ( x ) - a: = (X +
- e ) [ M f ( e ) e(x - e)],
where r(t) is a function such that
so that
We do not know whether Theorems 1 and 2 hold with (4) replaced by (4').
Likewise, the 'hypotheses (33), (34), and (35) of Theorem 2 could be weakened
somewhat, perhaps being replaced by
(5") M ~ X<
) for L < e, M(X) > for x > e.
4. Estirnatiori of a quantile using response, nonrespoxlse data. Let F(x) be
an unknown distribution function such that
(431 F(0) = a: (0 < a < I), E"(0) > 0,
and let (z,) be a sequence of independent random variables each with the
distribution function Pr[z, 5 x] = F(x). On the basis of {z,} we wish to estimate
8. However, as sometimes happens in ptnctice (bioassay, sensitivity data), we
are not allowed to know the values of zn themselves. Instead, me are free to
prescribe for each n a value x, and arc then given only thc values (y,,} where
11 ifzn<x, ("response"),
yn = 10 otherwise (((nonres~:onse").
Hon shall we choose the values {z,} and how shall we nse the sequcncil {y,]
to estimate 01
406 HERBERT ROBRINS .4ND SUTTON BIONRO
Let us proceed as follows. Choose xl as our best guess of the value 0 and
let ( a , ] be any sequence of constants of type I /n. Then choose vall~esx2 , x3 , . . .
seqilentially according t o thc rule
using the iotat ti on (44). J,ct fj, = arithmetic mean of the values (49). Then
setting
and deals with the estimation of one or both of the parameters Pi on the basis of
observations y, , yz , . . . , y, corresponding to observed values XI , 22 , . . . , x, .
The method of least squares, for example, yields the estimators bi which minimize
t'he expression