Wei Dai’s Updateless Decision Theory
Tyrrell McAllister
Abstract
Wei Dai developed an approach to decision theory called UpdatelessDecision Theory (UDT). This note describes two versions, both due toDai, of the UDT formalism.
This note is just a more-pedantic version of Wei Dai’s original descrip-tions of Updateless Decision Theory (UDT). The first section gives the orig-inal version of UDT [2], which Dai now calls “UDT1”. In the next section,I give a similar description of UDT1.1, an improvement due to Dai[1]. Inboth cases, I’ve stripped out almost all motivation.
1 UDT1
Let
=
1
,
2
,...
be a sequence of programs. To specify a UDT1 algo-rithm, one must specify the following:
a set
X
of inputs that the UDT1 algorithm might receive;
a set
Y
of outputs that the UDT1 algorithm might produce in responseto an input;
for each program
i
in
, a set
E
i
of execution histories that
i
mightproduce (so far as the algorithm or its writers know);
a utility function
:
E
R
, where
E
:=
E
1
×
E
2
× ···
;
a
mathematical intuition
, which is a function that takes threearguments, namely
an input
X
,
an output
Y
, and
a sequence
=
1
,
2
,...
E
,1
 
and returns a probability
(
X,Y,E 
) that
i
would be the executionhistory of 
i
for all
i
if the UDT1 algorithm were to return output
upon receiving input
1
.(Note that, since
(
X,Y,E 
) is a probability, the function
must benonnegative and normalized for each choice of 
X
and
Y
. That is,for every
X
and
Y
, we must have that
E
(
X,Y,E 
) = 1.)Suppose that the algorithm receives input
X
. To determine itsresponse, the algorithm computes, for every possible output
Y
, theexpected utility
EU 
X
(
):=
E
(
X,Y,E 
)
(
)of producing output
. The algorithm then produces the output
Y
that maximized
EU 
X
. That is, the algorithm returns that output
Y
such that
EU 
X
(
) = max
Y
EU 
X
(
)
.
2 UDT1.1
Dai later suggested an improvement to UDT, which he dubbed “UDT1.1”(the original UDT now to be called “UDT1”).A UDT1.1 algorithm is specified in the same way as a UDT1 algorithm,with the following exceptions:
An additional set needs to be specified — namely, a set
I
of input-output mappings that the algorithm may (for all it knows) be imple-menting. Thus, an element
I
is a function
:
X
Y
.
The mathematical intuition function
now has a different domain:it now takes only
two
arguments, namely
an input-output mapping
I
and
a sequence
=
1
,
2
,...
E
,and returns a probability
(
f,
) that
i
will be the execution historyof 
i
for all
i
if the UDT1.1 algorithm were to implement the input-output mapping
:
X
Y
.
1
I believe that Wei Dai generally supposes that
has complete access to the sourcecodes of the programs
i
. This isn’t strictly necessary for the UDT1 formalism, however.The UDT1 algorithm only needs to know enough about the
i
to “intuit” probabilities forpossible execution histories as functions of how the algorithm responds to inputs. SinceDai does not go into precisely how the mathematical intuition works, the extent of itsaccess to the source codes of the
i
is not part of the UDT1 formalism
2
View on Scribd