Professional Documents
Culture Documents
Steven Kou
Department of Finance
Questrom School of Business
Boston University
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
Chapter 11
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
11.2 Bias and Causal Inference
Three major problems in statistics are: (1) How to design experiments and
collect data that better and e¢ ciently address the questions of interest (Exper-
imental Design and Sampling Design). (2) How to describe the major features
and detect patterns in the data (Exploratory Data Analysis and Data Min-
ing). (3) How to account for sampling variability and bias and draw reliable
conclusions from the data (Statistical Inference).
Much of the statistics is about investigate and control the variability in the
third problem. To study variability, one can construct 95% con…dence intervals,
and conduct statistical hypothesis testing.
The key goal of causal inference for observational data is, however, about
reducing the bias. The biased samples can appear in many di¤erent ways. For
example, in 1983, A national television news program invited its viewers to
participate in a “phone-in” on the issue whether the U.N. should continue to
be based in the U.S. The phone-in result was: yes, 33% and no 67%, with a
sample size 180,000. However, a more scienti…c survey several days later based
on only about 1,000 random samples revealed that about 78% of people in U.S.
thought that the U.N. should continue to be based in the U.S. The recent U.S.
presidential elections also revealed the potential bias in public opinion pools
In …nance, the self-selection biases refers to the biases arising when data
availability leads to certain subsets of stocks (assets) being excluding from the
analysis. For example, if we look at the stock returns within U.S., we may have
a survival bias. Namely failing stocks during the past are not included; thus
the observed return numbers will be somewhat higher. By the way, which was
the 3rd largest stock exchanges in world in 1900? The answer may surprise
some of you. Survival bias is of particular concern when we analysis high tech
stocks. For a detailed analysis of survival bias, see, Brown, Goetzmann, and
Ross (Journal of Finance, 1995), and Li and Xu (2002, Journal of Finance).
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
Example (b): Green …nance. Corporate social responsibility (X), …nancial
performance (Z), buy or sell by institutional investors (Y ).
3. Collider.
Example (a) Ascertainment bias. Beauty (X), acting talent (Y ), and Holly-
wood stars (Z).
Example (b) Selection bias. Economic conditions (X), heights of enlisted
men (Y ), civilian employment (Z).
Example (c) Non-response bias. Social status (X), marriage outcome (Y ),
migration to another city (Z).
If conditioning on collider, it may lead to selection bias.
4. Feedback. This is the most di¢ cult type for causal inference. More
research is needed here, as this is related to directed cyclic graphs (rather than
directed acyclic graphs) or non-recursive structural equation models.
More complicated causal can be shown in Figures 2 and 3.
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
(3) One has to be careful with the adjustment with conditioning variables and
do the conditioning carefully. Sometimes, one has to do conditioning (e.g. in the
presence of confounding variables), and other times, one cannot do conditioning
(e.g. in the presence of colliders).
(4) One has to separate direct and indirect e¤ects (as in the mediation
analysis).
(5) Statistics alone cannot solve causal inference problems. One has to draw
upon the expertise from the related domain …eld to propose proper causal struc-
tures (e.g. drawing DAG graphs, setting up structural equations), before choos-
ing proper statistical tools. In other words, draw (sometime literarily as in DAG
graphs) assumptions before reach conclusion.
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
There are two fund managers A and B, and suppose that manager A is a
much better stock picker than manager B, in every categories. More precisely,
However
11.6.1 D-separation
A path from X to Y is any consecutive sequence of nodes and edges that the
start and end notes are X and Y , ignoring the direction.
We de…ne a path to be either blocked or open according to the following
graphical rules.
1. If there are no variables being conditioned on, a path is blocked if and
only if two arrowheads on the path collide at some variable on the path (i.e.
there is a collider on the path and the collider is not conditioned). For example,
in Figure 2 the path X, Z1, Y is blocked.
2. Any path that contains a non-collider that has been conditioned on is
blocked. For example, in Figure 3 conditioning on Z1, the path X, Z1, and Y
is blocked.
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
3. A collider that has been conditioned on does not block a path. For
example, in Figure 2, conditioning on Z1 the path X, Z1, Y is open.
4. A collider that has a descendant that has been conditioned on does not
block a path. For example, conditioning on Z3 in Figure 3 the path X, Z2, and
Y is open.
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
OSX-L and (1 p2 ) in SOXX-L. Under what conditions on p1 and p2 does the
Simpson’s paradox appear?
2. For Example B (asset allocation), calculate the average causal e¤ect of
two portfolio managers on portfolio returns.
Y := + X + ":
Note that, unlike the standard equation Y = + X + ", this does not imply
that
X := 0 + 0 Y + "0 :
For example, Y can be the grade a student may get and X can be the hours
studied. By the equation, increase one hour of study can on average increase
the grade by . But the reversed causality is not true, i.e. if a teacher adds one
point to the score, does not change how many hours the student has studied.
See pp. 9-12 of Dablander (2020).
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
In other words, the overall tax rate increased, despite the tax rates were
lower in each category. The main reason is that there were a lot more people
belong to middle and high income categories, possibly due to the high in‡ation
during the time period, as the cuto¤ points for the tax brackets did not change
from 1974 to 1978. For example, due to the in‡ation, a person used to belong
to the low income category in 1974 might belong to the middle income category
in 1978, resulting in a tax rate at 15.9% in 1978 instead of 9.5% in 1974.
This is related to dynamic Simpson’s paradox, as we are comparing two
outcomes at di¤erent time periods. The causal inference for dynamic data is
much more complicated, as the causal relationship and the magnitude of causal
e¤ects (including the possibility of delayed e¤ects) can change from time to
time.
This study source was downloaded by 100000880261197 from CourseHero.com on 02-06-2024 06:17:54 GMT -06:00
https://www.coursehero.com/file/115499588/lec11Causal1pdf/
Powered by TCPDF (www.tcpdf.org)