0% found this document useful (0 votes)

82 views5 pages

Bayesian Analysis of Binomial Data

The document discusses approximating binomial and beta distributions using Laplace approximations. It describes approximating a prior distribution as a weighted mixture of two normal distributions. An EM algorithm is proposed to calculate the distribution of outcomes given the prior parameters. The algorithm iterates between estimating hidden variables and re-estimating the parameters. Regression models are also considered to relate two correlated normal variables based on their joint distribution.

Uploaded by

fdsafdsafdsafds7277

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views5 pages

Bayesian Analysis of Binomial Data

Uploaded by

fdsafdsafdsafds7277

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1

y yi i i (1 i )ni yi

The Laplace approximation for a binomial via moment matching is thus yi N [ni i , ni i (1 i )] A Beta distribution has the Laplace approximation 1 (1 )1 N[ 1 ( 1)( 1) , ] + 2 ( + 2)3

Thus our prior has the approximation i w1 N [ 1 1 (1 1)(1 1) 2 1 (2 1)(2 1) , ] + (1 w1 )N [ , ] 3 1 + 1 2 (1 + 1 2) 2 + 2 2 (2 + 2 2)3

I believe we want to calculate the distribution of p(Y |j , j , w1 )

K K

p(Y |j , j , w1 ) =
i

N [ni i , ni i (1 i )]
i

p(i |j , j , w1 )d

This would appear to be a complete mess that I spent far too much time attempting to sort out. However, if we multiply the likelihood and the prior together rst:
y i i (1 i )ni yi [w1 1 1 (1 )1 1 + (1 w1 )2 1 (1 )2 1 ]

=
j

wj i i

y +j 1

(1 i )ni yi +j 1

Which than can then be approximated more easily by w1 N ( (yi + 1 1)(ni yi + 1 1) yi + 1 1 , ) ni + 1 + 1 2 (ni + 1 + 1 2)3 yi + 2 1 (yi + 2 1)(ni yi + 2 1) , ) ni + 2 + 2 2 (ni + 2 + 2 2)3

+(1 w1 )N (

This has total likelihood (2 lines)

n 2

ln
i j

wj (

(yi + j 1)(ni yi + j 1) 1/2 ) (ni + j + j 2)3

exp[(

yi + j 1 (yi + j 1)(ni yi + j 1) yi )2 /2 ] ni ni + j + j 2 (ni + j + j 2)3

2 For simplicity in further notes, let the mean be uj and the variance j . We will want to set up an EM algorithm as follows for that mess: 0 0 0 0) Select initial estimates for j , j , w1

1) E-Step Find the expected value of a hidden variable t t t estimates of j , j , wj

i,j

for each observation based on current

2 t t wj /j exp[(yi /ni ut )2 /2j ] j 2 k t t 2 wk /k exp[(yi /ni ut )2 /2k ] k

2) M-Step t+1 Set wj = Solve

n i i,j /n 2

arg max
j ,j i j

t i,j ln

1 t2 exp[(yi /ni ut )2 /2j ] j t j

This is actually a massive pain to solve, and I have not managed to do so. As such, only skeleton code for the EM algorithm is submitted. 2 ui is Normal(u, 2 ) so vi = a + bui is Normal(a + bu, b2 2 ). With the constraint on 2 2 being diagonal, and independent errors for x and y ( 2 = y = x ), we thus have: xi yi N( u a + bu , 2 0 2 2 2 ) 0 b + y

Thus the likelihood given n observations is L[(x, y)|a, b, u, ] = (

1 ) exp[ ( 2 (b2 2 + 2 ) 2
n

(xi u)2 + 2

(yi a bu)2 )] b2 2 + 2

Given the structure of these two observations (MV Normal with marginal Normals), it makes sense to me to model this as a regression problem. In particular, we have xi = ui +
1 2

yi = a + bui +
i

N (0, 2 )
1

yi = a + bxi + b Thus we end up with:

yi N (a + bxi , b2 2 + 2 ) Y |X, 2 , a, b N56 (a + bX, (b2 2 + 2 )I) However, after considerable issues attempting to sort out how to set this up, I give up and fall back to a regression model with Zellners G-Prior (thus the estimate will have an incorrect variance structure): y|, 2 , X N56 (X , 2 I) P (| 2 , X) N2 (0, c 2 (X T X)1 ), P ( 2 ) 2 Posterior N56 (X , 2 I) N2 (0, c 2 (X T X)1 ) 2 Conditionals c c B, 2 (X T X)1 ) c+1 c+1 56 s2 1 P ( 2 |y, X, ) IG( , + ()T X T X()) 2 2(c + 1) 4 P (| 2 , y, X) N2 (
0

Using a Gibbs sampler: 1) Select initial 0 , 2 2) Update with 2

t+1

= P ( 2 | t , y, X)
t+1

t+1 = P (| 2

, y, X) estimates: Variance 0.051504521 0.001409253 0.051273862 3

We arrive at the Estimate a 5.1437016 b -0.2699121 2 1.1667646

Convergenge plots are in p2.png 3 Our K observations are yi the number of survivors out of ni the total number, with three factor variables: sex, age, and passenger type. Using a logistic model with a at prior, we thus have the posterior (in a form that makes it easier to work with in R)
K

p[B|X, (Y, N )]
i=1 K

(i )yi (1 i )ni yi

p[B|X, (Y, N )]
i=1

exp[X i B] yi 1 ) ( )ni yi i B] 1 + exp[X 1 + exp[X i B]

The initial estimate of B, is the maximum likelihood estimator. Metropolis-Hasting is thus updated by 1) Generate B Nk (B t1 , T ) with T as a scale factor
p( 2) Calculate p = min(1, p(BB|y) ) t1 |y)

3) Update B t = B with probability p, B t1 with probability 1-p Our parameter estimates for a no-interaction model are: Estimate (Intercept) 2.0629318 sexmale -2.4375442 childkid 1.0813796 class2 -1.0128018 class3 -1.7921001 classC -0.8602009 Variance 0.03059069 0.02177024 0.05988409 0.03982714 0.03021637 0.02758146

This is with 5000 samples after a burn-in of 10000. Graphs for convergence is p3m1a.png. Convergence seems ne. And for the interaction model:

(Intercept) childkid sexmale class2 class3 classC childkid:sexmale childkid:class2 childkid:class3 sexmale:class2 sexmale:class3 sexmale:classC

Estimate 3.8332383 41.639925 -4.5582154 -1.9454472 -3.9939029 -1.8018233 0.6690028 -28.870286 -41.665613 0.2339378 3.0609728 1.2792995

Variance 0.4698347 85.9295259 0.4948292 0.6169035 0.4848646 1.031236 0.2806819 114.4440457 85.8944413 0.7066308 0.5219417 1.0829465

This is with 10,000 samples after a burn-in of 100,000. Graphs for convergence are p3m2a.png, p3m2a.png. We still do not have good convergence for several variables, particularly childkid and its interactions with class. While these are notably dierent from the MLE estimates, further inspection (ie/ throwing them into the link function and calculating the probability) indicate they make some sense. Eectively these estimates make a stronger statement for a high survival rate of the base case (female rst class) compared with the MLE estimate, which has the base rate probability more or less at 0.5, with the other estimates pulling down the probability (or drastically increasing it!) for various cases. I was not able to satisfactorily calculate Bayes factors - is this because of the at prior or my mistake?

Maximum Likelihood vs. MAP Estimation
No ratings yet
Maximum Likelihood vs. MAP Estimation
9 pages
EE531 Statistical Learning Theory Assignment
No ratings yet
EE531 Statistical Learning Theory Assignment
5 pages
Estimation and EM Algorithm Insights
No ratings yet
Estimation and EM Algorithm Insights
6 pages
Bayesian Inference in Linear Regression
No ratings yet
Bayesian Inference in Linear Regression
53 pages
Week 5 Solutions: Probability & Statistics
No ratings yet
Week 5 Solutions: Probability & Statistics
10 pages
Seattle SISG 18 IntroQG Lecture08
No ratings yet
Seattle SISG 18 IntroQG Lecture08
21 pages
Week 6 Mle
No ratings yet
Week 6 Mle
41 pages
Bootstrapping and Prediction Intervals in Statistics
No ratings yet
Bootstrapping and Prediction Intervals in Statistics
10 pages
Solutions for Statistical Inference Exercises
No ratings yet
Solutions for Statistical Inference Exercises
106 pages
Advanced Econometrics: GLM Techniques
No ratings yet
Advanced Econometrics: GLM Techniques
30 pages
Math170S Lecture6
No ratings yet
Math170S Lecture6
13 pages
Bayesian Time Series Econometrics Guide
No ratings yet
Bayesian Time Series Econometrics Guide
72 pages
Math for CompSci: MLE & Regularization
No ratings yet
Math for CompSci: MLE & Regularization
46 pages
ABD Formulas
No ratings yet
ABD Formulas
55 pages
Homework 8
100% (1)
Homework 8
6 pages
Non-Parametric Regression and Smoothing Techniques
No ratings yet
Non-Parametric Regression and Smoothing Techniques
5 pages
Final Exam Questions on Statistical Estimation
No ratings yet
Final Exam Questions on Statistical Estimation
12 pages
Lecture 4: Parameter Estimation and Diagnostics in Logistic Regression
No ratings yet
Lecture 4: Parameter Estimation and Diagnostics in Logistic Regression
40 pages
Exam Solutions for Statistics Resit 2022
No ratings yet
Exam Solutions for Statistics Resit 2022
8 pages
Understanding Limited Dependent Variables
No ratings yet
Understanding Limited Dependent Variables
18 pages
Stat231 Final Formula Sheet
No ratings yet
Stat231 Final Formula Sheet
15 pages
Point Estimation and Estimators Explained
No ratings yet
Point Estimation and Estimators Explained
8 pages
Bayesian Linear Regression Models
No ratings yet
Bayesian Linear Regression Models
159 pages
Deming Regression: Methcomp Package May 2007
100% (1)
Deming Regression: Methcomp Package May 2007
10 pages
Generalized Linear Modeling Examples & Solutions
No ratings yet
Generalized Linear Modeling Examples & Solutions
43 pages
Logistic and Probit Regression Analysis
No ratings yet
Logistic and Probit Regression Analysis
11 pages
Density Estimation and Least Squares Methods
No ratings yet
Density Estimation and Least Squares Methods
9 pages
Ultimate Statistics Formula Sheet
No ratings yet
Ultimate Statistics Formula Sheet
11 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Maximum Likelihood vs Bayesian Estimation
No ratings yet
Maximum Likelihood vs Bayesian Estimation
11 pages
GLM Midterm Review: Key Concepts
No ratings yet
GLM Midterm Review: Key Concepts
26 pages
HW 3
No ratings yet
HW 3
3 pages
Understanding Beta Distribution Basics
No ratings yet
Understanding Beta Distribution Basics
5 pages
Bayesian Inference For The Gaussian
No ratings yet
Bayesian Inference For The Gaussian
11 pages
Probability Distributions Guide
No ratings yet
Probability Distributions Guide
86 pages
Logistic Regression Assignment 2024-25
No ratings yet
Logistic Regression Assignment 2024-25
2 pages
Estimating Distributions from Data
No ratings yet
Estimating Distributions from Data
14 pages
Introduction to Bayesian Methods
No ratings yet
Introduction to Bayesian Methods
53 pages
Expo Kundu
No ratings yet
Expo Kundu
22 pages
26GeneralizedLinearModelBernoulliAnnotated PDF
No ratings yet
26GeneralizedLinearModelBernoulliAnnotated PDF
46 pages
MLE Lecture Note For Econometrician
No ratings yet
MLE Lecture Note For Econometrician
13 pages
Exam Solutions for Statistics Resit 2023
No ratings yet
Exam Solutions for Statistics Resit 2023
9 pages
Slide 8 01
No ratings yet
Slide 8 01
37 pages
UCT STA3030F Inferential Statistics Exam
No ratings yet
UCT STA3030F Inferential Statistics Exam
13 pages
Econometrics I 20
No ratings yet
Econometrics I 20
56 pages
EM Algorithm Overview and Applications
No ratings yet
EM Algorithm Overview and Applications
7 pages
Understanding the Gibbs Sampler Method
No ratings yet
Understanding the Gibbs Sampler Method
1 page
Inf 2
No ratings yet
Inf 2
37 pages
EM Algorithm for Statisticians
No ratings yet
EM Algorithm for Statisticians
36 pages
Computational Biology: EM Algorithm Insights
No ratings yet
Computational Biology: EM Algorithm Insights
15 pages
20 Bayesian2
No ratings yet
20 Bayesian2
50 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Beta-Binomial Distribution Explained
No ratings yet
Beta-Binomial Distribution Explained
5 pages
Uniform Prior with Binomial Likelihood
100% (1)
Uniform Prior with Binomial Likelihood
5 pages
Mathematical Statistics Formula Sheet
No ratings yet
Mathematical Statistics Formula Sheet
15 pages
Linear Stochastic Models Overview
No ratings yet
Linear Stochastic Models Overview
12 pages
A Level 4 Statistics Exam Paper
No ratings yet
A Level 4 Statistics Exam Paper
6 pages
Dimensional Statistics: Learning & Risks
No ratings yet
Dimensional Statistics: Learning & Risks
5 pages
Gambit 2.2: Modeling Guide
No ratings yet
Gambit 2.2: Modeling Guide
7 pages
Cottages for Holistic Living
No ratings yet
Cottages for Holistic Living
7 pages
F Ma (01 11 2024) Transcript
No ratings yet
F Ma (01 11 2024) Transcript
15 pages
Araling Panlipunan 1 Competencies Overview
No ratings yet
Araling Panlipunan 1 Competencies Overview
5 pages
Advantages Outweigh Handout
No ratings yet
Advantages Outweigh Handout
3 pages
Astrology and The Devas of The Planes
96% (23)
Astrology and The Devas of The Planes
46 pages
Cycle Counting Methodology Overview
No ratings yet
Cycle Counting Methodology Overview
31 pages
Infinit-I Primer 2019: IT Insights IIM Indore
No ratings yet
Infinit-I Primer 2019: IT Insights IIM Indore
13 pages
03 Olympiads Chemistry
No ratings yet
03 Olympiads Chemistry
4 pages
How To Structure An Economics IA
100% (2)
How To Structure An Economics IA
4 pages
Beal Et Al 2003-Cohesion and Performance in Groups
No ratings yet
Beal Et Al 2003-Cohesion and Performance in Groups
18 pages
Rev
No ratings yet
Rev
2 pages
Sukhoi Sugc-2007 PDF
No ratings yet
Sukhoi Sugc-2007 PDF
27 pages
Alfred Nobel's Legacy and Inventions
No ratings yet
Alfred Nobel's Legacy and Inventions
4 pages
Germs of Wisdom
100% (4)
Germs of Wisdom
130 pages
Hach Method 8051 for Sulfate Analysis
No ratings yet
Hach Method 8051 for Sulfate Analysis
7 pages
Store Procedure
No ratings yet
Store Procedure
3 pages
Higher Education Management Information System: Dise Ii: He-Mis
No ratings yet
Higher Education Management Information System: Dise Ii: He-Mis
7 pages
NDT Form - Pdf.crdownload
No ratings yet
NDT Form - Pdf.crdownload
2 pages
APPSC Group-I Services Hall Ticket
No ratings yet
APPSC Group-I Services Hall Ticket
3 pages
Mathematics Paper 2 Specimen 2022
No ratings yet
Mathematics Paper 2 Specimen 2022
8 pages
SAISC Annual Report 2007 Insights
No ratings yet
SAISC Annual Report 2007 Insights
28 pages
Journeys Grade 5
100% (1)
Journeys Grade 5
25 pages
Eller College Resume of Reed Jones
No ratings yet
Eller College Resume of Reed Jones
2 pages
Organizational Development Executive Profile
No ratings yet
Organizational Development Executive Profile
2 pages
Design08 Esquisse2
No ratings yet
Design08 Esquisse2
1 page
The Pedestrian: A Dystopian Analysis
No ratings yet
The Pedestrian: A Dystopian Analysis
16 pages
Arbaminch University Jornal
No ratings yet
Arbaminch University Jornal
12 pages
Crafting My Daily Life Essay
100% (2)
Crafting My Daily Life Essay
4 pages
Managing Nonconformities in Blood Services
No ratings yet
Managing Nonconformities in Blood Services
5 pages

Bayesian Analysis of Binomial Data

Uploaded by

Bayesian Analysis of Binomial Data

Uploaded by

1

Thus our prior has the approximation i w1 N [ 1 1 (1 1)(1 1) 2 1 (2 1)(2 1) , ] + (1 w1 )N [ , ] 3 1 + 1 2 (1 + 1 2) 2 + 2 2 (2 + 2 2)3

I believe we want to calculate the distribution of p(Y |j , j , w1 )

This has total likelihood (2 lines)

(yi + j 1)(ni yi + j 1) 1/2 ) (ni + j + j 2)3

yi + j 1 (yi + j 1)(ni yi + j 1) yi )2 /2 ] ni ni + j + j 2 (ni + j + j 2)3

1) E-Step Find the expected value of a hidden variable t t t estimates of j , j , wj

for each observation based on current

2 t t wj /j exp[(yi /ni ut )2 /2j ] j 2 k t t 2 wk /k exp[(yi /ni ut )2 /2k ] k

2) M-Step t+1 Set wj = Solve

1 t2 exp[(yi /ni ut )2 /2j ] j t j

Thus the likelihood given n observations is L[(x, y)|a, b, u, ] = (

yi = a + bxi + b Thus we end up with:

Using a Gibbs sampler: 1) Select initial 0 , 2 2) Update with 2

, y, X) estimates: Variance 0.051504521 0.001409253 0.051273862 3

We arrive at the Estimate a 5.1437016 b -0.2699121 2 1.1667646

exp[X i B] yi 1 ) ( )ni yi i B] 1 + exp[X 1 + exp[X i B]

You might also like