928
RICHARD SCHEINES
Figure 1. Causal graph.
have developed algorithms for determining the
set
of causal structuresthat are consistent with the independence relations assumed to hold overa set of measured variables in a non-experimental, or observational study,even causal structures that include latent, or unmeasured variables. Insome instances all the causal structures in an equivalence class agree onsome subset of the causal relations, and in some cases they all agree thatone variable
X
is a cause or direct cause of another
Y
. In these cases wecan, using basically the same assumptions as are made in experimentalstudies, infer that
X
is a cause of
Y
.Although we have no general characterization of the conditions underwhich a causal inference to can be made in observational studies,
X
r
Y
it turns out that when the inference is possible it is often driven by theexistence of what I call a
detectible instrumental variable
that stands inthe same relationship to
X
and
Y
in the observational study as does theideal intervention on
X
in the experimental study. In what follows,I brieflysketch the key ideas behind the representational system, I show how anexperimental causal inference works in this system, how the typical ob-servational causal inference works involving detectible instrumental var-iables, and the parallel between detectible instrumental variables and ex-perimental interventions.
2. Representing Causation.
2.1. Causal Graphs, Probability Distributions, and the Causal MarkovAxiom.
Recently from computer science, but as far back as Sewall Wrightin the early twentieth century (Wright 1934), the fundamental represen-tational device for causal systems is the directed graph. A directed graphis simply a collection of vertices and directed edges over pairs of thesevertices. In a directed graph interpreted as a causal graph, each directededge (or arrow) from one vertex
X
to another
Y
is taken to assert that
X
is a direct cause of
Y
relative to the set of vertices in the graph. Forexample, Figure 1 represents a graph , with vertices
V
p
G
p
A
V
,
E
S
{Exposure, Infection, Symptoms}, and edges
E
p
{
A
Exposure, Infection
S
,
A
Infection, Symptoms
S
}. We further assume that the vertices in such agraph can be interpreted as random variables with some probability dis-tribution
P
, and thatcausalprocesses situatedinsomedefinitebackgroundcontext generate probability distributions over these variables.
Add a Comment