Professional Documents
Culture Documents
Decision-Making A Fuzzy Environment: Nasa Contractor
Decision-Making A Fuzzy Environment: Nasa Contractor
1.
Y
N A S A CONTRACTOR
REPORT
DECISION-MAKING I N A
FUZZY ENVIRONMENT
C&W'O'
J NASA CR-1594
J L/
By R. E. Bellman and L. A. Zadeh
..
, 7 . !, ,
,i/' .( -.
" t
c -
Issued by Originator as Report No ERL-69-8
'i
pJ
Prepared under Grant No. NGL-05-003-016 by
ELECTRONICS RESEARCH LABORATORY ; J C ~ C
/ - . ,..iL:L'!4;i<c'L:&
Ltc,3'
UNIVERSITY OF CALIFORNIA I
Berkeley,
Calif. 94720 J,,i! za", F b ,
for
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION
For sale by the Clearinghouse far Federal Scientific and Technical Information
Springfield, Virginia 22151 -
CFSTI price $3.00
ABSTRACT
decision process in which the goals and/or the constraints, but not
necessarily the system under control, are fuzzy in nature. This means
-iii-
control is either deterministic or stochastic.By using dynamic pro-
-iv-
CONTENTS
Page No.
1. INTRODUCTION 1
5. A FUZZY ENVIRONMENT
STOCHASTIC SYSTEMS IN 31
7. CONCLUDING REMARKS 51
REFERENCES 53
FIGURES
-V-
1. INTRODUCTION
To deal quantita-
quences of possible actions are not known precisely.
whatever
itly accepting the premise that imprecision - its -
nature c.an
sumption.
we mean a type
of imprecision which is associated with the use of fuzzy
sets,’”that is, classes in which there is no sharp transition from
the classes in the real world do not have crisp boundaries which separate
those objects which belong to the classes in question from those which
is several
course between humans, statements such as “John inches taller
.
than Jim," "x is much larger than y, "Corporation
X has a bright
fact, it may be argued that the main distinction between human intel-
or non-membershipof an object in
a non-fuzzy set. Fuzziness, on the
other hand, has to do with classes in which there may be grades of mem-
To
bership intermediate between full membership and non-membership.
X has a modern
illustrate the point, the fuzzy assertion "Corporation
outlook." On the other hand, the statement "The probability that Cor-
poration X is operating at
a loss is0.8," is a measure of the uncer-
for dealing with fuzziness are quite different from those of probability
theory. They are simpler in many ways because to the notion of prob-
2
respondents ofa +b and ab, where
a and b are real numbers, are the
through the techniques provided by the theory of fuzzy sets rather than
note, our main concern is with introducing three basic concepts: fuzzy
goal, fuzzy constraint and fuzzy decision, and exploring the application
of these concepts to multistage decision processes in which the goals or
the constraints may be fuzzy, while the system under control may be eith
-
deterministic or stochastic but not fuzzy. This, however, is not an
20-25.’’
be approximately in the range The sources of fuzziness in these
3
A less trivial example is provideda deterministic
by
xn+l = xn + un , n=0,1,2,. ..
where x and u denote, respectively, the state and input at ntime
and
n n
in which for simplicity xn and un are assumed to be real-valued. Here
In what follows,
we shall consider in greater detail
a few
4
suggest tentative ways of attacking them, rather than to a develop
gen-
of a fuzzy -
algorithm3 a concept which may be of use in problems which
the sequel.
2.
""
A Brief
~ .. .
Introduction
~- ~~~
to Fuzzy Sets
~~
there is no sharp boundary between those objects that belong to the class
and those that do not.A more precise definition may be stated as fol-
lows.
A = {(x,pA(x))} , x E X (1)
of x
where pA (x) is termed the grade of membership A, and vA: X +. M
5
function becomes identical with the characteristic function
a non- of
fuzzy set.
A (x))
( 2 ) we list only those pairs (x,v
with the understanding that in
6
fication. The problem of pA from.
estimating knowledge of the
the. setof
contrary - x in X.
that p (x) is given for all
A
shall employ a wavy bar undera symbol (or symbols) which define x. For
example, if A is the set.of real numbers between2 and 5, i.e., x = {x I
2 <
”
x < 5}, then A = (x I 2 < X 5 5)
- is a fuzzy setof real numbers which
“3
and only if
p,(x) i 0. )
7
Equality: as A = B, if and onlyif
Two fuzzy sets are equal, written
VA
-- VB, that is,vA(x) = pB(x) for allx in X. (In the sequel, to
simplify the notation we shall omit when an equality or
the xargument
PA' = - pA. For example, the fuzzy sets: A =' {tall men} and A' =
{not tall men) are complements of one another if the negation
is "not"
and B.
fined as the largest fuzzy set contained in Aboth The membership
function ofA r
l B is given by
a
The n o t i o n of i n t e r s e c t i o n b e a r s a close r e l a t i o n t o t h e
-
and f a t .
n o t a l l o w any tradeoff between p (x) and p (x) so long as p (x) > (x).
~ . l '
A B A
For example, i f pA(x) = 0.8 and pB(x) = 0.!j9 then pA n ,(x) = 0.5 so long
whichonewants "and" t o b e i n t e r p r e t e d . F o r t h i s r e a s o n , i n t h a t f o l -
A u B i s given by
9
where Max (a,b) = a if a b and Max (a,b) = b if a < b. In infix form,
simply as
and B = (fat men), then A u B = (tall or fat men}. Also, we can differen-
A e B and is defined by
(9).
is defined by
defined by
10
It is easyto verify that
1
A 0 B = (A'B')
B are concave,so is A m ,
11
x = x1 X X2 X.. ..xx"is a fuzzy set X characterized by an n-variate
in
is a fuzzy relationin R
2
. A subjective % in this case
expression for
X.
fuzzy set in Then, the mappingf induces a fuzzy setB in Y whose
of points f
is taken over the set
where the supremum
-1(y) in X which are
is expressed by uB(ylx) .
x ranges
Suppose that the parameter overa space X, so
12
i n Y. Through t h i s mapping, any given fuzzy set A i n -X induces a ' fuzzy
t h a t of a c o n d i t i o n a l d i s t r i b u t i o n .
tation
c = A ~ B
or e q u i v a l e n t l y
where A and B are fuzzy sets w i t h membership f u n c t i o n s o f t h e form
number of spaces.
f i n e d by t h e L e b e s g u e - S t i e l t j es i n t e g r a l
Equivalently,
fuzzy event.
c o n c e p t sr e l a t i n gt of u z z y sets. I nt h ef o l l o w i n gs e c t i o n , we s h a l l u s e
14
3. Fuzzy Goals,'Constraints and Decisions
principal ingredients of
a decision process (a) A set of alterna-
are
alternative the gain (or loss) resulting from the choice of that alter-
native .
a &,
tives. Then, a fuzzy goal or simply G, in X will be identified
G in X.
with a given fuzzy set X = R'
For exampleg if (the real line),
in words as
then the fuzzy goal expressed x should be substantially
I'
1
larger than10" might be represented abyfuzzy setin R whose member-
PG(X) = 0 , x<10
15
= (1 + (x-10) -2)-1 , XLlO.
pG(x),
on the set of alternatives. Clearly, the membership function,
ment.
fo m
such a way as to reflect the sense in which the approximation to the in-
= 4 mand a =
terval [2,10] is to be understood. For example, if we set
5-4, then at x
= 2 and x= 10 we have approximately
1~ (x) = 0.8, while
C
at x = 1 and x = 11, IJC (x) = 0.5; and at x= 0 and x= 12, uc(x) is ap-
proximately equal to0 . 3 .
cepts of goal and constraint is that both are defined as fuzzy sets in
of a decision. By con-
can be treated identically in the formulation
17
*
straints. This similarity - indeed identity - is made explicit
in our
formulation,
I
given by (17)
pressed by (18).
the intersection of fuzzy sets. This implies that in the example under
by
or more explicitly
-2 -1 -4 -1
(x) = Min ((1
l’lGr?C:
+ (x-10) ) , (1 + (x - 15) ) ) for x > 10
-
12.
*For a very thorough discussionof these points see Ref.
18
Note thatG n C is a convex fuzzy set since both
G and C are convex
fuzzy sets.
G and
decision, D, which is a fuzzy set resulting from intersection ofC.
In symbols,
D = G n C
and correspondingly
Lf) = ?JG A PC
19
More generally, suppose that wen goals
have G190.09 G
n
and m constraints C190.e3 Cm. Then, the resultant decision is the inter-
D = ~~nG~ n... n G n
n
cp c2n o o o n cm
and correspondingly
D inforexactly
goals and the constraints enter into the expression
the same way. This is the basis for our earlier statement concerning
20
(8); and may be assigned other concrete meanings when a need f o r a spe-
As a n i l l u s t r a t i o n o f ( 2 1 ) ' we s h a l l c o n s i d e r a very
simple example i n which X = €1'2 $ o . ,101and G19 G2, C and C2 are de-
l
f i n e d below.
2 3 4 5 6 7 8 9 10
9 10
X I 1 2 3 4 5 6 7 8
0 0.1 0.4 0.7 0.8 0.6 0.4 0.2 0 0
D = {(2,O.1),(3,O.4),(4,O.7)9.(5,O.8)9(6yO~6)y(7,O~4)~(8,O~2~~
21
I
Note that nox in X has full (that is, unity grade) membership
D. in
This reflects,of course, the fact that the specified goals and con-
whose fuzziness is
a consequence of the imprecision of the given goals
ship function.
* 3.
The executionof fuzzy instructions is discussed in Ref.
22
M
More generally, let D be a fuzzy decision and let D be
set.
*
D as the intersection or
In defining a fuzzy decision -
- of the goals and constraints, we are
more generally, as the confluence
tacitly assuming that all of the goals and constraints that enter into
ever, in which some of the goals and perhaps some of the constraints are
D might be expressed
of greater importance than others. In such cases,
n m
23
where the Q
i
and 8J. are membership functions such that
n m
x . Qi(X) + c Bj(X) =1
i=1 j=1
(22) can G u C in
generate any fuzzy set which is contained and contains
which the goals and the constraints are fuzzy sets in space of
X, the
in which the goals and the constraints are fuzzy sets in different
24
X which induces Gi in
Y. Specifically, the membership function
of Gi
*
is given by the equality
,i = l,...,n
of
* *
G1 ,...,G and C1,. ..,Cm* Using (23), we can express
1-1 (x) more ex-
n D
plicitly as
where f: X-tY. In this way, the case where the goals and the constraints
are defined as fuzzy sets in different spaces can be reduced to the case
4. i.-
Multistage Decision
~ ~ Processes
of the concepts of fuzzy goal, fuzzy constraint and fuzzy decision, rather
25
which fuzziness enters in one way or another.
X =
t = 0,1,2,..., ranges over a finite set
the state, x at time t,
ty
{o1,.. .,u
n
1 , and the inputut, ranges over a finite Uset
= {al,.. ,a 1 .
m
.
The temporal evolution Aofis described by the state equation
x
t+l
= f (xt, Ut) , t = 0,1,2,. .. (25)
fromto X.
in which f is a given function XxU Thus, f(x u ) represents
t' t
the successor state of x for input u Note, that if f is a random
t t'
function, thenA is a stochastic system whose state at time
is at+l
I xt,ut).
~ ( x ~ + ~
*
Since we will not be concerned with such systems in
a fuzzy Ct , which
constraint is a fuzzy setU characterized
in by a
26
fuzzy setGN in X, which is characterized by a membership' function
1;
1':
/:I:
g
J Problem 1. In this case, the system is assumed to be characterized by
(25), with fa given non-random function. The termination time
N is
2 assumed to be fixed and specified, The initial state, xo, is assumed
a maximizing decision.
to be given. The problem is to find
27
Our problem, then,is to find a sequence of inputs
fo m
u = IT (x,)
t t
,t = 0,1,2,. ..,N-1 ,
28
where
’!
may be regarded
as the membership function
of a fuzzygoal at time t =
which yield the solution to the problem. Thus, the maximizing decision
M is given by the successive maximizing values
U
Of”” UN-19 %-v ofin
(311, with s-vdefined as a function
of xN.-V ’ V = l,..., N.
29
Example. As a simple illustration, consider
a system with three states
u1, cJ2, u3 and two inputs a1 and a 2 . Assume N = 2 for simplicity. Let
respectively by
f in (25) is
The state transition table which defines the function
assumed to be
a1 U
3
U
1
a2 *2
U
3
30
and the Corresponding maximizing decision
is given by
Similarly, for t = 0
and
P(X~+~
(xt,ut). The problem ist o maximize the probability
of attainment
31
"
C ,
N, subject to the fuzzy constraints
of the fuzzy goal at time
0
...,
cN-l
GN is regarded as a fuzzy
If the fuzzy goal event'' in X,
replaced by
32
where, as b e f o r e , 1-1 (x ) . d e n o t e tsh e membership f u n c t i o notfh e
G N-V P-v
fuzzy goal a t t = N -
V induced by t h e f u z z y g o a l at t = N - v + 1 ,
V = 1, ..., N . Theseequationsyield a s o l u t i o n t o theproblem, as i s
i l l u s t r a t e d by t h e f o l l o w i n g example.
t h r e e states
1’
u2, d
3
and two i n p u t s a a
1.’ 2’
N i s assumed t o be
equal
I. u = a 11. u = a
t 1 t 2
U U
Ol 2 u3 u1- 2 u3
%x
0.8 0.1 0.1 0.1 0.9 0
33
Thus , t h e e n t r y 0.8 i n t h e p o s i t i o n (o1, a2) . i n t h e f i r s t t a b l e s i g n i f i e s
t h a t i f thesystem i s i n s t a t e 0 a t time t andinput cx1 is a p p l i e d ,t h e n
1
with probability 0.8 t h e s t a t e a t t i m e t + 1w i l l b e a2.
thepreceding example, t h a t i s
a2
I 0.93 0.42 0.75
34
which correspond to the following values
of the maximal policy function
!
The final iteration v = 2 yields
with
t
function IT
t’
that is, u = IT^ (x,) ( t = 0, 1, xt:= 01, 02, a,, ut =
35
I: .
fuzzy goal induced at N-1
timebefore finding its intersection with CN-l,
the decision process. Althoughwe shall not dwell further upon this aspect
for the first time, a specified subset of the state space. In this case,
setT
the goal is defined as a fuzzyG in , rather X.
than in
A,
More concretely, assume that the system under control,
X
t+l
= f(x u )
t' t
, t = 0,1,2,. .. (34)
where x
t
ranges overX = {u1, .. ,G ,O
R R/bla***' n
u 1, in which =T{at+l,. ..,
0 }>constitutes the termination set.
As before, f is assumed to
be a
n
*In its conventional (non-fuzzy) formulation,
this case plays an important
role in the theoryof optimal control and Markoffian decision processes.
Some of the more relevant papers on this subject are cited in of the list
references.
36
given function from
XxU to X, where U = {al, ...,am} is the range of
u t = 0,1,2,.
t’
.. . Note that if(Si is an absorbing state, that
is, a
1-1 (x ), where
ized by a membership function Nis the time at which
x ET,
G N t
with xt$ T for t< N . As for the constraints on the input, we assume for
simplicity that they are independent of time but not necessarily the
be denoted by vC(utlxt).
is the complement of T X.
in To each such initial state will correspond
a decision,D(xo), given by
(34). Thus
37
....... e . . . . . . . . . .
D(xo).
tion which is satisfied by Specifically, in virtue
of the time-
38
or, using ( 3 4 ) ,
ship functionsof the sets in question, this equation assumes the follow-
'can be written
as p (x
D o
!IT>.Similarly, pc(u0 I x,) can be written as
39
With these substitutions,(41) assumes the more compact form
( 4 3 ) corresponding to thex
that the equations in which are in T1 yield
0
(43)
uniquely by successively solving subsets of the system of equations
40
whose ith component,- = 1,
i ...,ti, is the input which must be applied
when A is in state
0 . U = {al,.
Note that Tr((Ji) ranges over the set ..,
~1
m1 and thus that there are
mR distinct policiesin the policy space.
( 4 3 ) , let
With reference to the system of equations
an (corresponding
the valuesof the membership function of D at ul, ...,
to policy T ). It is naturalto define a partial ordering in the policy
and only if p
D
(Oili'r') 2 pD(aiI?') for i = 1, ..., n. Then, a policy
assertion can be proved rigorously,* but it will suffice for our purposes
41
to regard it as -a
a consequence of the alternation principle14. principle
'theorem.
or equal to both IT'and IT" From this and the finitenessof the
= Max p (IT)
IT
D
42
one i f and o n l y i f D = f (ai, n ( o i ) ) , t h a t is, t h e state u is the
j. j
immediate successorof 0
i
underpolicy T .
which i s t h e d e s i r e d f u n c t i o n a l e q u a t i o n f o r
VD
. A l t h o u g hd i f f e r e n t
d i s t r i b u t i v i t y of Max and A .
Specifically, l e t n
1
,..., r
7~ where r = m
R , d e n o t et h e
becomes
system of e q u a t i o n s i n t h e components of pM
D
, reads
M
where cc = r(ai) = input under policy IT in state oi ; pD (ai) = i th
j
component of the maximal goal attainment vector; f(a a ) = successor
i' j
state* of ai for inputa with f (ai,
j'
a.) = ai for i= ra+l,. ,n (that
J
..
is, fora in the termination set T); pC ( a j lai) = value of the member-
i
C in statea
ship function of the constraint for input a , with uc(a. loi) = 1
i 1 J
for i= R+l, ...,n; and for i= J!,+l,...,n, M
pD (ai) = pG(ai) =
value of the membership function of the given goal G 0at Thus, the:
i *
M .. uDM (ai) , I =
pD (cri), i = 1,. > R , are the unknowns in (51), while the
constants.
denoted by wi M
that is, wi = pD (ai) for i = 1, ... Furthermore,
let the product and plus symbols denote A and v, respectively. Then,
44
bik = of ui ;
0 if crk is not an immediate successor
and
with the understanding that 1-1 (d. ) = 0 for states outside the termina-
G i
tion setT.
w = B w
S
+y , s = 0,1,2,. .. (54)
45
I
the solutionof ( 5 2 1 , that is, to the first
fi components* of the maximal
p p ( a . l a ) cor-
(a.1) are shown alongside. The indicated values of the
G C J i
respond to the constraint sets
46
I
CLfi Ul “2 (5 3 “4 U5
CL a4 “3 U5 U4 U
1 5
a2 O2 (3 1 U4 ‘(55
cL2
1: 1
policy.Forexample,for IT = ( a2, al, a2 ), w e have
P ( a2 , a1, C 2L ) =
0 1 0
0
1
0
d i r e c t i o n of f l o w i n e a c h b r a n c h ( s e e F i g . 3 ) a n d t r e a t i n g t h e s t a t e s i n
47
in which A
Employing the simplified notation and v are
M
= pD
replaced by the product and sum, respectively, wiand (ai), i = 1,
u = B w + y (57)
where
0 1 0
B = 0 1 0.8
0.7 0 0
(0.6, 0 , 0.8).
u2 = (0.6, 0.8, 0 . 8 )
the sources in Fig.3 (which are the absorbing states in Fig.2) generates
48
of diameters ranging from0 to pG(ai). Furthermore, imagine that a
eters are indicated in the diagram and which can carry balls
of lesser
flow technique.
Now assume that it takes one unit of time for the balls
three units of time for any ball to travel from its source to any node
solution of (57).
49
of ( 4 3 ) and the alternation
Turning to the illustration
principle, consider the policy vector 7~ = (al, al, all. For this 7~ ,
the systemof equations ( 4 3 ) becomes
(5 U
1 O2 3
0.6 0 0.8
0.6 0 0.6
0 0 0
0 0 0.8
0 0 0
50
1
1;
I'
Ascheck
IT"
on the
=
alternation
principle,
take
let
us
1,;
!I Note that IT 2 IT'and IT 2 ITTT".
From inspection of the table, the maximal
policy is seen tobe (a,, al, al). which agrees with the result obtained
by iteration.
7. Concluding Remarks
such a theory.
51
o f execution of fuzzy decisions; the way in
these are the question
which the goals and the constraints must be combined when they are of
52
1:
r
References
1. t. A . Z a d e h", F u z z y S e t s , " I n f o r m a t i o na n dC o n t r o l ,v o l . 8 ,
p p . 3 3 8 - 3 5 3 J, u n e , 1965.
i:
> .
2. L. A . Zadeh,
"Toward a T h e o r y of F u z zSyy s t e m s ,I1
3. L. A. Z a d e h , " FIunzfzoyr m
C ao tnidtorno l ,
vo1.12, pp.99-102F
, e b r u a r y 1, 9 6 8
4. s. 5. L. C h a n gI, I F u z z yD y n a m i cP r o g r a m m i n ga n dt h eD e c i s i o n
M a k i n gP r o c e s s , "P r o c .3 dP r i n c e t o nC o n f e r e n c eo nI n f o r m a t i o n
S c i e n c e se n dS y s t e m s ,p p . 2 0 0 - 2 0 3 , 1969.
5. 5. FU a n d T . J . L i , "On t h e B e h a v i oorLf e a r n i n A
g utomata
L a f a y e t t e ,I n d i a n a , Aug. 1968.
p p . 1 4 5 - 1 7 4A
, pril, 1967.
B a l l i s t i c Research L a b o r a t o r i e s ,A b e r d e e n ,M a r y l a n d ,J a n u a r y , 1969.
53
8. A. B e l l m a n , R. K a l a b aa n d L. A . Z a d e h ," A b s t r a c t i o na n dP a t t e r n
C l a s s i f i c a t i o n , "J o u r .M a t h .A n a l .' a n dA p p l . ,v o 1 . 1 3 ,p p . 1 - 7 ,
J a n u a r y , 1966.
9. W. G. Wee, "On G e n e r a l i z a t i o no fA d a p t i v eA l g o r i t h m sa n dA p p l i c a -
t i o n o f t h eF u z z yS e tC o n c e p tt oP a t t e r nC l a s s i f i c a t i o n , "T e c h n i c a l
R e p o r t TR-EE-67-7, P u r d u eU n i v e r s i t y ,L a f a y e t t e ,I n d i a n a ;J u l y , 1 9 6 7 .
Appl.~
, 0 1 . 2 4 p, p . 1 8 2 - 1 9 0 , 1968.
S e r i e s D, p p . 2 3 - 2 9M
, a r c h1, 9 6 2 .
C o n t r o l ," A u t o m a t i c na n dR e m o t eC o n t r o l , v o l . 2 4 ,p p . 3 2 8 - 3 3 0 , M a r c h , 1 9 6 3 .
15. R . E. B e l l m a n ," D y n a m i cP r o g r a m m i n g , l lP r i n c e t o nU n i v .P r e s s ,
P r i n c e t o n , N.J., 1957.
54
1'ij 16. V. G. E o l t y a n s k ' i i , R. V. Gamkrelidze,
and L. S. P o n t r y a g i n , "On
1960.
I. -
24. D. B l a c k w e l l", D i s c o u n t e dD y n a m i cP r o g r a m m i n g , ' l Ann. M a t hS
, tat.,
~ 0 1 . 3 6 , pp.226-235,
1965.
pp.1284-1294,
1966.
27. E. V . D e n a r d o ", C o n t r a c t i o nM a p p i n g s i n t h eT h e o r yU n d e r l y i n g
56
constraint goal
. . . . .. .. .. .
57
Q~ 0.6
0.8
58
0.8
I
8
l
c5 0.8
.
NASA-Langley, 1970 - 19 CR-1594 59