Professional Documents
Culture Documents
Descriptive Statistics
• Descriptive statistics
• You collect data on the SAT scores of all 11th
graders in a school for three years. You can use
descriptive statistics to get a quick overview of
the school’s scores in those years. You can then
directly compare the mean SAT score with the
mean scores of other schools.
Inferential statistics
• Since the size of a sample is always smaller than the size of the
population, some of the population isn’t captured by sample
data.
• This creates sampling error, which is the difference between
the true population values (called parameters) and the
measured sample values (called statistics).
• Sampling error arises any time you use a sample, even if your
sample is random and unbiased. For this reason, there is always
some uncertainty in inferential statistics. However, using
probability sampling methods reduces this uncertainty.
Estimating population parameters from
sample statistics
• You want to know the average number of paid vacation days that
employees at an international company receive.
• After collecting survey responses from a random sample, you
calculate a point estimate and a confidence interval.
• Your point estimate of the population mean paid vacation days is
the sample mean of 19 paid vacation days.
• With random sampling, a 95% confidence interval of [16 22] means
you can be reasonably confident that the average number of
vacation days is between 16 and 22.
Hypothesis testing
• parametric or non-parametric.
Parametric tests
• Parametric? Yes
• What’s being compared? Means
• Samples: 2
ANOVA
• Parametric? Yes
• What’s being compared? Means
• Samples: 3+ samples
Mood’s Median
• Parametric? No
• What’s being compared? Medians
• Samples: 2+ samples
Wilcoxon Signed-rank
• Parametric? No
• What’s being compared? Distributions
• Samples: 2
Wilcoxon rank-sum (Mann-Whitney U)
• Parametric? No
• What’s being compared? Sums of Rankings
• Samples: 2
Kruskal-Wallis H
• Parametric? No
• What’s being compared? Mean rankings
• Samples: 3+ samples
Correlation tests
• Parametric? Yes
• Variables: Interval/ratio variables
Spearman’s r
• Parametric? No
• Variables: Ordinal/Interval/ratio variables
Chi square test of independence
• Parametric? No
• Varaibles: Nominal/ordinal variables
Regression Tests