Professional Documents
Culture Documents
I. T VN
H a tc t (multiagent system) l h thng trong
nhiu tc t t ch tng tc vi nhau thc hin
mt s nhim v no . Yu cu quan trng i vi
h a tc t l tc t phi c kh nng phi hp hnh
ng vi nhau sao cho ton h thng hot ng hiu
qu. C ch phi hp c th ci t sn khi xy dng
tc t di dng cc quy c, cc k hoch lp sn,
cc k thut ng b... [15]. Mt phng php m
bo phi hp khc l khng ci t sn c ch ng
b m tc t t hc cch phi hp hnh ng thng
qua kinh nghim thu c trong qu trnh tng tc
vi nhau. Phng php ny c mt s u im nh
trc quan, cho kt qu n nh. Vn t ra l cn
xy dng cc thut ton hc t ng ph hp vi tnh
r
t =0
t =0
R( st , ( st )) l ln nht.
la chn, Ri l hm thng ca tc t i Ri : A R ,
y A l tp cc hnh ng chung ca tt c tc t
0 1 v xc nh mc thay i gi tr Q sau
mi bc hc. Nu mi hnh ng c thc hin
mt s ln v hn cho mi trng thi th gi tr Q s
hi t ti gi tr ti u Q*. Chnh sch ti u * khi
c xc nh t Q* nh sau:
* ( s ) = arg max Q * ( s, a )
(2)
Khi to Q ( s, a ) = 0
Cp nht
Qi ( s, ai j ) =
a = ( a1 ,..., an ), ai = ai j
(Pr(a | ai )( R( s, a)
+ max Qi ( s ', ai ')))
(4)
ai ' Ai
max{Qi ( s, ai j ), R ( s, ai j )
Qi ( s, ai ) =
j
Qi ( s ', ai ')}, (ai Ai )
+ max
'
a
A
i
i
(5)
Qi ( s, ai j ) =
max
a =( a1 ,...,an ),ai = ai j
Q ( s, a )
(6)
a11
A12
a13
A2
10
0
k
A22
0
2
0
3
A2
k
0
10
trong k l tham s sao cho k<10. S dng quy tc
(5), tc t s xy dng bng Q ring nh sau
1
Q1(s0,ai)
Q2(s0,ai)
Nu s dng xc
t bng Q ring
i * ( s ) = arg max a
ai
ai
ai
10
2
10
10
2
10
nh chnh sch hnh ng ti u
da trn cng thc (2) ta c
Ai
(i=1,2) c th tc ng vo vt nng mt lc Fi ,
hnh ng ca tc t. Mt vng bt u t
v tr xut pht S v kt thc nu xy ra mt
trong ba tnh hung sau: 1) tc t y c
vt nng ti ch; 2) vt nng b y ra khi
khng gian cho php; 3) s hnh ng vt
qu mt gii hn cho trc m khng xy
ra hai tnh hung trc. Vic hn ch s
lng hnh ng cho php trnh tnh trng
tc t khng lm g c khi vt nng nm
trn qu o mong mun nhng cha t
ti ch.
Vi mi tc t i
Ti thi im 0:
Khi to
Qi 0 ( s, ai j ) = 0
0 ( s) A
i
Khi to ngu nhin i
Ti thi im t:
Thc hin hnh ng aij
Cm nhn trng thi mi trng s
Xc nh gi tr phn thng
Cp nht
R( s, ai j )
t +1
Nu
max
Qi ( s, ai ) > max
Qi ( s, ai )
j
j
j
ai Ai
th cp nht
v tr ch G
ai Ai
i ( s ) = ai j
Hnh 1. Thut ton hc Q phn tn
qu o
c th
qu o
tt nht P
r
0 | Fi | Fmax di mt gc i . Tng hp tc ng
ca hai tc t c xc nh bi vct lc
r r r
F = F1 + F2 . Lc ny lm vt nng dch chuyn theo
r
trc x v y khong cch tng ng l | F | cos( ) v
r
| F | sin( ) n v. Gi s v tr hin ti l ( x, y ) ,
Px ( y ) l ta x ca qu o P ti cng tung y,
x =| x Px ( y ) | l khong cch theo trc x gia vt
nng v qu o P. Khi phn hi (phn thng) m
mi tc t nhn c cho hnh ng cui cng l
S
Tc t 1
vt nng
Tc t 2
t nu Q t +1 Q t vi l mt s dng nh tu
Trong [10], bi ton di chuyn vt nng vi hai tc
t c gii quyt bng cch s dng thut ton hc
tng cng Bucket Brigade (BB) cho h a tc t. Kt
qu th nghim thut ton trnh by trn c so
snh vi kt qu thut ton BB nh trn th Hnh
3.
Gi tr theo trc tung l khong cch trung bnh t
qu o thc n qu o mong mun. Gi tr theo
trc honh l s lng vng hc ca tc t. Kt qu
thc hin Bucket brigade ly t [10] vi h s hc
=0,6 v cc tham s khc nh K, a, tham s dng
ri rc ho, kch thc khng gian hc, hn ch v s
bc trong mi vng c la chn ging nh trnh
by trn.
Thut ton trnh by trn hi t sau khong 50
vng. Sau khi hi t, tc t hnh ng rt n nh v
di chuyn vt nng theo ng qu o mong mun.
T l tham s v c gi
tr gim dn theo thi
gian. Mc ch chnh ca
phng php ny l cho
php tc t p dng s
dng kinh nghim trc
thu hp khng gian
kho st.
V. KT QU.
Tiu ch chnh nh
gi thut ton hc l s
vng hc cn thit trc
Khong cch ti qu o ti u
eQ ( a ) / T
12
a ' eQ ( a ') / T
10
thut ton Bucket
Brigade
8
6
4
2
0
0
20
40
60
80
S vng hc
100