Professional Documents
Culture Documents
Outline
Background: Bayesian approaches to LV
Outline (continued)
Different sorts of spatial factor model
Case Studies
Social capital & mental health, multilevel
Background
SEM and factor models originate in (& still
most widely used) in psychological,
educational & behavioural applications.
Recent Bayesian applications to
psychological & education testing data
include SEM (e.g. Lee & Song, 2003), LCA,
item analysis, and factor analysis per se (e.g.
Aitkin & Aitkin, 2005; Press & Shigemasu,
1998).
Also some work on automated Bayesian
model choice in normal linear factor model
standard
assumptions often built into classical
estimation methods (e.g. factor scores
multivariate normal & independent over
subjects)
Advantage in generalizations such as
nonlinear factor effects, multiplicative factor
schemes
Bayesian Computing
Many Bayesian applications to SEM and
WINBUGS
Despite acronym, WINBUGS employs
# PRIORS
for (j in 1:4){ alph[j] ~ dnorm(0,0.001);
# gamma prior on precisions
tau[j] ~ dgamma(1,0.001)
# alternative prior starts with s.d. of residuals
# sd.y[j] ~ dunif(0,100); tau[j] <- 1/(sd.y[j]*sd.y[j])
# identifiability constraint on loadings to ensure
# positive alienation measure
lam[j] ~ dnorm(1,1) I(0,)}
Spatial Priors
My focus: CAR priors for lattices (e.g.
administrative areas)
These are priors for structured effects
(where labels of area units are important)
as opposed to unstructured effects
(unaffected or exchangeable over different
labelling scheme for areas)
Substantive Basis
Generally taken to represent
DIFFERENT TYPES OF
COMMON SPATIAL FACTOR
MANIFEST VARIABLES:
AREA HEALTH VARIABLES
Types of Event
May be deaths, hospitalizations, incidence
For sij
can have:
a) j all positive combined with si acting as
positive measure of health risk (higher s i in
areas with higher cancer rates)
OR
b) j all negative combined with si acting
as negative measure of health risk (s i
higher in areas with lower cancer rates)
e.g.
log(ij)=j+jsi+js2i
Or: spline for nonlinear effects in
Linear Spline
Then linear spline
log(ij)=j+jsi+kbjk(si- k)+
bjk might be random effects, but raises
identification issues?
INDICATOR BASED
SPATIAL CONSTRUCTS
(j=1,,J); e.g.
mortality or incidence counts
Social Indicators Zik (k=1,..k); e.g. census
rates of unemployment
Typical Scenario: multiple common spatial
factors (F1i,..,FQi) primarily measured by Z
variables (indicators established as
relevant).
2 class model
But Factors F also act to potentially
Example
g(i5)= 5F2i+wi5
MODEL CHOICE
typically slow
in models with many random effects (such
as factor scores)
Slow convergence also applies to other
measures of fit, e.g. Monte Carlo
estimates of conditional predictive
ordinates
Model selection alternatives
factors
Within area covariance matrix in
MCAR prior denoted F
CASE STUDIES
Latent Risks
Finally Pr(Y=1) also related to latent
Use
i=1,gend[i]+2,eth[i]+3,noqual[i]
+4,urb[i]+5,reg[i]
+6,dep[i].
: fixed effects parameters with reference
category (zero coeff) for identification
Only small number of regions in HSE
If had finer spatial detail could take area
effects spatially random (but weak
identification?)
Effect of F on Y
Multinomial Categories
Model Form
Model includes
(c=1,..,3142) to be
spatially correlated CAR
But us (state effects, s=1,..,51) taken to be
unstructured.
Avoids confounding of two spatially
structured effects
indicators
Q=3 latent constructs (F1 fragmentation, F2
deprivation, F3 urbanicity). Converse of F3
is rurality. Common spatial factors.
Geographic Framework
N=1118 small areas (called wards,
Confirmatory Sub-Model
Confirmatory Z-on-F model
Each indicator Zk
construct Fq.
Most indicators binomial. A few taken as
normal after transformation. Mostly 2001
Census, a few non-census (service
access score, proportion greenspace)
1,2,3 denotes
which construct it loads on.
Regression with link g allows for
overdispersion via unique w effects
g(ik)= kk,GkF[Gk,i]+wik
Expected Direction of
Confirmatory Model Loadings
Coefficient selection on
Redundant Coefficients
Some coefficients (e.g. urbanicity on male
More generally
Bayesian software options for latent