Professional Documents
Culture Documents
Data Analysis:
Simple Statistical Tests
Goals
Understand confidence intervals and p-
values
Learn to use basic statistical tests
including chi square and ANOVA
Types of Variables
Types of variables indicate which estimates you can
calculate and which statistical tests you should use
Continuous variables:
Always numeric
Generally calculate measures such as the mean, median and
standard deviation
Categorical variables:
Information that can be sorted into categories
Field investigation – often interested in dichotomous or
binary (2-level) categorical variables
Cannot calculate mean or median but can calculate risk
Measures of Association
Strength of the association between two variables,
such as an exposure and a disease
Two measure of association used most often are the
relative risk, or risk ratio (RR), and the odds ratio
(OR)
The decision to calculate an RR or an OR depends on
the study design
Interpretation of RR and OR:
RR or OR = 1: exposure has no association with disease
RR or OR > 1: exposure may be positively associated with
disease
RR or OR < 1: exposure may be negatively associated with
disease
Risk Ratio or Odds Ratio?
Risk ratio
Used when comparing outcomes of those who were exposed
to something to those who were not exposed
Calculated in cohort studies
Cannot be calculated in case-control studies because the
entire population at risk is not included in the study
Odds ratio
Used in case-control studies
Odds of exposure among cases divided by odds of exposure
among controls
Provides a rough estimate of the risk ratio
Analysis Tool: 2x2 Table
Commonly used with dichotomous
variables to compare groups of people
Table puts one dichotomous variable
across the rows and another
dichotomous variable along the columns
Useful in determining the association
between a dichotomous exposure and a
dichotomous outcome
Calculating an Odds Ratio
Table 1. Sample 2x2 table for Hepatitis A at Restaurant A
Outcome
No Hepatitis
Hepatitis A Total
A
Ate salsa 218 45 263
Exposure
Did not eat
21 85 106
salsa
Total 239 130 369
Yes No Total
Tomatoes 41 89 130
Yes No Total
Tomatoes 130
No Tomatoes 170
Yes No Total
Yes No Total
YES NO
Difference between groups is Difference between groups is Difference between groups is Difference between groups is
statistically significant NOT statistically significant statistically significant NOT statistically significant
Conclusion
In field epidemiology a few calculations and
tests make up the core of analytic methods
Learning these methods will provide a good
set of field epidemiology skills.
Confidence intervals, p-values, chi-square tests,
ANOVA and their interpretations
Further data analysis may require methods to
control for confounding including matching
and logistic regression