Bayesian Parameter Estimation
•
Let
y
be distributed according to a
parametric family
:
y
∼
f
θ
(
y
)
. The goal is, given iid observations
{
y
i
}
, toestimate
θ
. For instance, let
{
y
i
}
be a series of coinflips where
y
i
= 1
denotes “heads” and
y
i
= 0
denotes “tails”. The coin is weighted, so
P
(
y
i
= 1)
can be other than
1
/
2
. Let us define
θ
=
P
(
y
i
= 1)
;our goal is to estimate
θ
. This simple distribution isgiven the name “Bernoulli”.
•
Without prior information, we use the
maximumlikelihood
approach. Let the observations be
y
1
...y
H
+
T
. Let
H
be the number of heads observedand
T
be the number of tails.
ˆ
θ
= argmax
f
θ
(
y
1:
H
+
T
)= argmax
θ
H
(1
−
θ
)
T
=
H/
(
H
+
T
)
•
Not surprisingly, the probability of heads is estimatedas the empirical frequency of heads in the datasample.
•
Suppose we remember that yesterday, using the samecoin, we recorded 10 heads and 20 tails. This is one
2
Leave a Comment
Those who know this subject, know how useful it is in any science. I have used Bayesian estimation for assessing diversity and obtained excellent results. There is nothing funny about my earlier comments except for ignorant
I believe this was given as a handout for a lecture in one of Julius Smith's or Bosi's courses at CCRMA. I don't recall which one though.