0% found this document useful (0 votes)

66 views18 pages

Understanding Correlation and Causation

The document provides an overview of key concepts in political inquiry including: - Correlation does not imply causation and the requirements for establishing causality. - Different research design types and their tradeoffs between internal and external validity. - Nominal, ordinal, interval, and ratio variable types and issues with measurement error. - Common statistical tests like hypothesis testing, regression analysis, and their appropriate applications.

Uploaded by

anon_564338114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views18 pages

Understanding Correlation and Causation

Uploaded by

anon_564338114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Poli 30: Political Inquiry

Fall Quarter, 2012

Review

Correlation & Causation

Correlation is a relationship between two (or more) variables

Correlation does not imply causation

Requirements for establishing causality

Temporal ordering
Correlation
Mechanism
Confounds ruled out

Research Design & Hypotheses

Tests are designed in the face of constraints (i.e. data, time,
funding) with different goals
Internal Validity vs. External Validity

Designs & validity

Controlled observational studies (- internal, + external)
Natural experiments (internal depends on selection, external)
Randomized/lab experiments (+ internal, - external)

Hypotheses
Specific and measurable implications of theories

Variable Types & Measurement

Nominal

Ordinal

Interval

Ratio

Count

Yes

Calculate median & percentiles

Yes

Add or subtract

Yes

Calculate mean & standard deviation

Yes

Calculate ratio

Yes

Measurement error stems from differences between conceptual and

operational definitions
Systematic error a chronic and consistent distortion in observations
leads to biases in analysis
Random error inconsistent distortions in observations increases
noise, but does not bias inferences

Visualizations
Stem & Leaf Plots
Histograms
Bar Charts

Calculating a Confidence Interval

Confidence intervals contain a range of values based on a
sample that are likely to contain the actual population
parameter

1.96

1.96
See Week 7 slides for more.

T-Scores vs. Z-Scores

Z of 1.96 gives us 95% confidence with a two-tailed test

This means each tail contains 2.5% of data, given a normal
distribution

Use a t score for small samples, when comparing means

As sample size increases, t approaches z
At 100 or more observations, you can replace t with z

To find the correct value of t, calculate degrees of freedom

DF = Sample Size - 1

Z Table Note that

this particular table
gives the area in the
body of the
distribution, rather
than the tail. To get
the tail, simply
subtract this value
from 1.

T Table

Hypothesis Testing

One-sample proportion test (IV and DV are nominal)

Difference in proportions (IV and DV are nominal)
Difference in means (IV binary, DV interval)
Chi-square (IV and DV are categorical)
Regression framework (IV and DV are interval)

Steps

Identify H0 and H1
Choose test
Calculate key value (compare to Z, 2, t)
Interpret

Crosstabs, Summary Tables & Controlled Comparisons

Cross-tabulation

Simple test where IV and DV are nominal or ordinal

Summary Table
IV is nominal or ordinal, DV is interval or ratio

Controlled Comparison
Create crosstabs based on different levels of confound

Causal Relationships
Spurious

IVs effect on DV disappears when controlling for confound

Additive
IVs effect on DV remains stable when adding confound into
analysis

Interactive
IVs effect on DV increases or decreases (but remains significant)
when confound is included

Regression Analysis (I)

~
~

+
+

Provides a way to analyze the effect of your IV on your DV,

especially when controlling for one or more additional IVs
In a typical regression table, coefficients (b), standard errors
(SE), t-values (t), and an indication of significance (P>|t|) are
given

Regression Analysis (II)

Functional link between DV and IVs varies based on the type

of DV
Nominal DV Probit or Logit/Logistic Regression
Ordinal DV Ordered Probit/Logit OR Multinomial Probit/Logit
Interval/Ratio DV Linear Regression

When in doubt, ASK

Regression Analysis (II)

Interpreting coefficients

For each of these, sign indicates the same directional

relationship, and determining significance is done in the same
way

Further interpretation
Linear regression B indicates change in slope that is attributed
to X
i.e. a 1 unit increase in X results in a B-unit change in Y

Other forms of regression This is a huge mess, so dont worry

about it unless you want to take years and years of econometrics

Regression Analysis (III)

Determining significance of a coefficient

Compare t to relevant t/Z score for 95% significance

Other levels of significance are acceptable (i.e. 90%, 99%)

If presented with p > |t|, simply compare to .05 (1 0.95)

For other levels of significance, this might be 0.1 or 0.01

Regression Analysis (IV)

. regress exports revt emp ppent, robust cluster(naics)

Linear regression

Number of obs
F( 3,
843)
Prob > F
R-squared
Root MSE

=
=
=
=
=

6677
1.44
0.2285
0.0159
312.23

(Std. Err. adjusted for 844 clusters in naics)

-----------------------------------------------------------------------------|
Robust
exports |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------Total Revenue|
.0029434
.0020906
1.41
0.160
-.00116
.0070467
Employees |
.2512895
.4425897
0.57
0.570
-.6174177
1.119997
Phys. Capital| -.0009527
.0034821
-0.27
0.784
-.0077873
.0058819
Intercept|
6.60017
4.801907
1.37
0.170
-2.824926
16.02527
------------------------------------------------------------------------------

Regression Analysis (V)

. logit procafta atfp tsale exporter, robust cluster(naics)

Iteration
Iteration
Iteration
Iteration
Iteration
Iteration

0:
1:
2:
3:
4:
5:

log
log
log
log
log
log

pseudolikelihood
pseudolikelihood
pseudolikelihood
pseudolikelihood
pseudolikelihood
pseudolikelihood

Logistic regression
Log pseudolikelihood = -99.244511

=
=
=
=
=
=

-111.17378
-107.81316
-100.04385
-99.262411
-99.244544
-99.244511

Number of obs
Wald chi2(3)
Prob > chi2
Pseudo R2

=
=
=
=

6138
21.68
0.0001
0.1073

(Std. Err. adjusted for 832 clusters in naics)

-----------------------------------------------------------------------------|
Robust
procafta |
Coef.
Std. Err.
z
P>|z|
[95% Conf. Interval]
-------------+---------------------------------------------------------------Productivity|
.3844836
.164192
2.34
0.019
.0626732
.706294
Total Sales |
.0188455
.0050635
3.72
0.000
.0089213
.0287697
Exporter
|
1.403025
.7771746
1.81
0.071
-.1202088
2.92626
Intercept | -7.956071
.753366
-10.56
0.000
-9.432642
-6.479501
------------------------------------------------------------------------------

BRM-Lecture 4-2023
No ratings yet
BRM-Lecture 4-2023
48 pages
ECO 391: Data Analysis Methods
No ratings yet
ECO 391: Data Analysis Methods
26 pages
Regression Explained SPSS
No ratings yet
Regression Explained SPSS
25 pages
Hypothesis Testing and Regression Analysis
No ratings yet
Hypothesis Testing and Regression Analysis
32 pages
304BA AdvancedStatisticalMethodsUsingR
No ratings yet
304BA AdvancedStatisticalMethodsUsingR
31 pages
Unit 3
No ratings yet
Unit 3
24 pages
Econometrics Course Notes Overview
No ratings yet
Econometrics Course Notes Overview
16 pages
Statistical Analysis of Public Transport Usage
No ratings yet
Statistical Analysis of Public Transport Usage
18 pages
Understanding Regression Assumptions
No ratings yet
Understanding Regression Assumptions
6 pages
Regression Analysis A Practical Introduction Compress
No ratings yet
Regression Analysis A Practical Introduction Compress
363 pages
Regression Explained SPSS
No ratings yet
Regression Explained SPSS
24 pages
Da Sem Unit 3-1
No ratings yet
Da Sem Unit 3-1
13 pages
Residuals in Regression Analysis
No ratings yet
Residuals in Regression Analysis
9 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
Quantitative Methods for Business Decisions
No ratings yet
Quantitative Methods for Business Decisions
63 pages
Regression Explained SPSS
100% (1)
Regression Explained SPSS
23 pages
Chas A Low Course Notes
No ratings yet
Chas A Low Course Notes
146 pages
DSME2040 Regression Students
No ratings yet
DSME2040 Regression Students
35 pages
Alan C. Acock - A Gentle Introduction To Stata-Stata Press (2005)
No ratings yet
Alan C. Acock - A Gentle Introduction To Stata-Stata Press (2005)
658 pages
Regression Analysis: Correlation & Methods
No ratings yet
Regression Analysis: Correlation & Methods
41 pages
Statistical Analysis Techniques in SPSS
No ratings yet
Statistical Analysis Techniques in SPSS
15 pages
Ch08 Part 2 - Multtiple Regression
No ratings yet
Ch08 Part 2 - Multtiple Regression
45 pages
Att - Cahues2edoc7sv N7q9p8j6zzbhzn7aszxbvd Ybewy
No ratings yet
Att - Cahues2edoc7sv N7q9p8j6zzbhzn7aszxbvd Ybewy
14 pages
Lecture 3. Part 1 - Regression Analysis
No ratings yet
Lecture 3. Part 1 - Regression Analysis
21 pages
Understanding Regression Analysis Basics
No ratings yet
Understanding Regression Analysis Basics
5 pages
@ Arkes - Regression Analysis - A Practical Introduction (2023)
No ratings yet
@ Arkes - Regression Analysis - A Practical Introduction (2023)
413 pages
Statistical Analysis and Research Methods
No ratings yet
Statistical Analysis and Research Methods
23 pages
Introduction to Econometrics Basics
No ratings yet
Introduction to Econometrics Basics
41 pages
Simple Linear Regression Analysis Results
No ratings yet
Simple Linear Regression Analysis Results
2 pages
Unit IV - Analytics Tasks (Students)
No ratings yet
Unit IV - Analytics Tasks (Students)
127 pages
Multiple Regression Analysis in Business
No ratings yet
Multiple Regression Analysis in Business
71 pages
Understanding Regression Analysis in Health
No ratings yet
Understanding Regression Analysis in Health
22 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
White Noise: Desirable
No ratings yet
White Noise: Desirable
9 pages
Chapter 5
No ratings yet
Chapter 5
14 pages
Pearson Correlation and Linear Regression
No ratings yet
Pearson Correlation and Linear Regression
42 pages
Comparing Snowfall Data Statistics
No ratings yet
Comparing Snowfall Data Statistics
49 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Correlation and Regression
No ratings yet
Correlation and Regression
5 pages
Tutorial Session 12 - Model Selection Solution
No ratings yet
Tutorial Session 12 - Model Selection Solution
4 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
23 pages
Economic Analysis: Steps and Techniques
No ratings yet
Economic Analysis: Steps and Techniques
4 pages
06 - Banerjee and Banerjee - Business Analytics - Ch06
No ratings yet
06 - Banerjee and Banerjee - Business Analytics - Ch06
21 pages
OLS Assumptions & Issues Guide
No ratings yet
OLS Assumptions & Issues Guide
4 pages
10 - Regression - Explained - SPSS - Important For Basic Concept
No ratings yet
10 - Regression - Explained - SPSS - Important For Basic Concept
23 pages
Regression Analysis and Plot Interpretations in R
No ratings yet
Regression Analysis and Plot Interpretations in R
13 pages
Autocorrelation Analysis Guide
No ratings yet
Autocorrelation Analysis Guide
25 pages
Multiple Linear Regression Slides
No ratings yet
Multiple Linear Regression Slides
17 pages
Econo Labs
No ratings yet
Econo Labs
27 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
9 pages
Results 1
No ratings yet
Results 1
4 pages
Understanding Regression Analysis Basics
No ratings yet
Understanding Regression Analysis Basics
6 pages
Endogeneity and Panel Data Analysis
No ratings yet
Endogeneity and Panel Data Analysis
4 pages
Forecasting Models: Regression & ARIMA
No ratings yet
Forecasting Models: Regression & ARIMA
19 pages
Running and Interpreting Multiple Regression in SPSS (Includes Review of Assumptions)
No ratings yet
Running and Interpreting Multiple Regression in SPSS (Includes Review of Assumptions)
60 pages
Tonkinwise - C-2003 - Designing Philosophically Review of Vilem Flusser's The Shape of Things A Philosophy of Design - Design Philosophy Papers
No ratings yet
Tonkinwise - C-2003 - Designing Philosophically Review of Vilem Flusser's The Shape of Things A Philosophy of Design - Design Philosophy Papers
10 pages
How To Avoid Marrying A Jerk
100% (3)
How To Avoid Marrying A Jerk
35 pages
Problem Definition in Business Research
No ratings yet
Problem Definition in Business Research
37 pages
Ryan G. Witherspoon's CV
No ratings yet
Ryan G. Witherspoon's CV
10 pages
BRM Practice Questions PGP20
0% (1)
BRM Practice Questions PGP20
47 pages
Trans Global Projects
No ratings yet
Trans Global Projects
12 pages
Basic Concept of Mathematics
No ratings yet
Basic Concept of Mathematics
15 pages
HPGR Faq PDF
No ratings yet
HPGR Faq PDF
9 pages
ظواهر الروحانية شرحها وتكشفها علميا 2
No ratings yet
ظواهر الروحانية شرحها وتكشفها علميا 2
448 pages
Ganesha's Curse & Syamantaka Jewel
100% (1)
Ganesha's Curse & Syamantaka Jewel
5 pages
Review: Dhvani in Indian Aesthetics
No ratings yet
Review: Dhvani in Indian Aesthetics
2 pages
New Prof Ed MS Teams
No ratings yet
New Prof Ed MS Teams
22 pages
Love Letters of Great Men
100% (1)
Love Letters of Great Men
5 pages
Recommendation for MS in CS Candidate
50% (2)
Recommendation for MS in CS Candidate
1 page
My Experience With TRUE LOVE... : February 2016
No ratings yet
My Experience With TRUE LOVE... : February 2016
111 pages
Pa Suicide PDF
No ratings yet
Pa Suicide PDF
8 pages
Understanding Logic and Reasoning
No ratings yet
Understanding Logic and Reasoning
2 pages
Advanced Engineering Math Reflection
No ratings yet
Advanced Engineering Math Reflection
2 pages
Flexing Your Soul
No ratings yet
Flexing Your Soul
77 pages
Sources of Error
100% (3)
Sources of Error
3 pages
Ayurvedic Acupressure
No ratings yet
Ayurvedic Acupressure
3 pages
Ethics in Investment Management
100% (1)
Ethics in Investment Management
2 pages
Apology Key Themes
100% (1)
Apology Key Themes
6 pages
Ethical Dilemmas and Philosophical Inquiry
No ratings yet
Ethical Dilemmas and Philosophical Inquiry
15 pages
Understanding The Self Reviewer Expanded
No ratings yet
Understanding The Self Reviewer Expanded
4 pages
Opinion Mining and Sentiment Analysis Guide
No ratings yet
Opinion Mining and Sentiment Analysis Guide
6 pages
Plant and Animal Needs in Kindergarten
No ratings yet
Plant and Animal Needs in Kindergarten
12 pages
Electronics Study Material For MSC
100% (1)
Electronics Study Material For MSC
23 pages
3rd Quarter Mapeh 7
100% (1)
3rd Quarter Mapeh 7
4 pages
How Yoga Works by Criswell
100% (11)
How Yoga Works by Criswell
212 pages

Understanding Correlation and Causation

Uploaded by

Understanding Correlation and Causation

Uploaded by

Poli 30: Political Inquiry

Fall Quarter, 2012

Correlation & Causation

Correlation is a relationship between two (or more) variables

Requirements for establishing causality

Research Design & Hypotheses

Designs & validity

Variable Types & Measurement

Calculate median & percentiles

Calculate mean & standard deviation

Measurement error stems from differences between conceptual and

Calculating a Confidence Interval

T-Scores vs. Z-Scores

Z of 1.96 gives us 95% confidence with a two-tailed test

Use a t score for small samples, when comparing means

To find the correct value of t, calculate degrees of freedom

Z Table Note that

One-sample proportion test (IV and DV are nominal)

Crosstabs, Summary Tables & Controlled Comparisons

Simple test where IV and DV are nominal or ordinal

IVs effect on DV disappears when controlling for confound

Regression Analysis (I)

Provides a way to analyze the effect of your IV on your DV,

Regression Analysis (II)

Functional link between DV and IVs varies based on the type

When in doubt, ASK

Regression Analysis (II)

For each of these, sign indicates the same directional

Other forms of regression This is a huge mess, so dont worry

Regression Analysis (III)

Compare t to relevant t/Z score for 95% significance

If presented with p > |t|, simply compare to .05 (1 0.95)

Regression Analysis (IV)

. regress exports revt emp ppent, robust cluster(naics)

(Std. Err. adjusted for 844 clusters in naics)

Regression Analysis (V)

. logit procafta atfp tsale exporter, robust cluster(naics)

(Std. Err. adjusted for 832 clusters in naics)

You might also like