Professional Documents
Culture Documents
The purpose of the paper is to prove that the family of languages accepted
by finite probabilistic automata is not closed under any of the operations
catenation, catenation closure and homomorphism.
I. INTRODUCTION
Very little is known about the closure properties of the family of stochastic
languages. This is partly due to the fact that methods have not been found
for investigating whether a given language is nonstochastic. Using the
characteristic polynomial of a transition matrix, Paz (1970) managed to find
a context-sensitive language which is not stochastic. The same idea has then
been used by Nasu and Honda (1970) who found a language which is context-
free but not stochastic. Using this language, we proved in (Turakainen, 1970)
that the family of stochastic languages is closed neither under catenation nor
under homomorphism. In this paper, we use another nonstochastic language
and prove that the family of stochastic languages is not closed under catenation
closure. The same fundamental language is applicable to the establishing
of the above results on catenation and homomorphism.
"½ o o o o o ½
o ~ o ooo~ 0
OOl 0 0 0 0 0
o o o 1 0 0 0 0
M(x) = 0 0 0 o]o~ 0 ,
000 o o ½½ 0
000 0 0 0 1 0
000 0 0 0 1 0
000 0 0 0 1 04
and
"0 0 ½ o ½ 0 0 0-
0 0 0 o 0 0
0 0 ½ o 0 0 0
0 0 0 ½ o ½ 0 0
M(y) = 0 0 0 00 00 0
000 00 0 0 1
000 00 01 0
000 00 01 0
000 00 01 0
THE FAMILY OF STOCHASTIC LANGUAGES 255
Denote
L 1 = {P ~ (x + y)* [ % M ( e ) f o -----0}.
If we draw the graph of GPA, we easily see that the regular language
~-~x*y(x*y)* x*y is a subset of L 1 . Denote this language by L a . Let P
be an arbitrary word not belonging to L 2 . I t is of the form P -~ x~yQx~y,
where k, l / > 0 and Q ~ ( x * y ) * . Denote by q the n u m b e r o f y ' s in Q.
F r o m the graph of G P A we now obtain
Proof. W e use the same method as Nasu and H o n d a (1970). Assume that
L = L ( P A , ~), where P A = (S, M, % , fo), and let the characteristic equation
of M ( x ) be
I I I . THEOREMS
Proof. Our theorem follows from L e m m a s 2 and 3 and from the fact
that mi(L) = (x + y)* mi(L~), because mi(Ls) is stochastic b u t mi(L) is not
(el. Turakainen, 1969b).
Clearly, ~roM(xip)fo > ~7 if and only if i is one of the numbers k 1 ,..., h~.
This leads to a contradiction in the same way as in L e m m a 3.
REFERENCES
Nasty, M., AND HONDA, N. (1970), A context-free language which is not accepted by
a probabilistic automaton, unpublished.
PAZ, A. (1970), "Formal Series, Finiteness Properties and Decision Problems,"
Technical Report No. 4, Israel Inst. of Technology, Dept. Comput. Sci., Haifa.
TURAKAINEN,P. (1968), On stochastic languages, Information and Control 12, 304-313.
TtmAKnINEN,P. (1969a), On languages representable in rational probabilisfic automata,
Ann. Acad. Sci. Fenn. Ser. A I 439.
TURAKAINEN,P. (1969b), Generalized automata and stochastic languages, Proc. Amer.
Math. Soc. 21, 303-309.
TDRAKAINEN,P. (1970), The family of stochastic languages is closed neither under
catenation nor under homomorphism, Ann. Univ. Turku. Set. A I 133.