Professional Documents
Culture Documents
Exam 3 Review Final LPC
Exam 3 Review Final LPC
H C D 3 0 0 – B I O S TA S T I S T I C
WHAT WILL BE COVERED?
• UNIT 6:
– CHAPTER 11 & 12: TWO SAMPLE T-TEST (6.1 & 6.2)
– CHAPTER 13 & 14: ANOVA (6.3 & 6.4)
• UNIT 7:
– CHAPTER 15: CORRELATION AND SHARED VARIANCE (7.1)
– CHAPTER 16: LINEAR REGRESSION (7.2)
– CHAPTER 17: CHI-SQUARE (7.3 & 7.4)
T-TEST OR STUDENT’S T-TEST
• One of the most commonly
used techniques for testing Z – TEST T - TEST
a hypothesis * Sample size over 30 * Sample size under 30
• Determines a probability * σ is known * σ is unknown
that two groups are the * not used often in * used most often in
same with respect to the practice, because σ is practice
variable tested rarely known, and
when sample size is
• Can be used to compare >30, the t-distribution
the mean of one sample mirrors z- distribution
and a population or to
compare the mean of two
samples
TWO SAMPLE T-TEST
• A two-sample t-test is used to examine • Independent Samples: If values in one
differences between two samples sample reveal no information about or
• Two-sample t-tests have the following are unrelated to those of the other
assumptions: sample. For independent samples, you
have to specify variance of the two
– Random sampling or selection
groups.
– Interval or ratio scale of
• Dependent samples: If the values in
measurement
one sample affect or are related to the
– Normality values in the other sample.
• Sample type : Independent or • As a general rule, you can assume
dependent sample equal variance if the ratio of the
• Variance : Equal or unequal sample variances (s12/s22) is between
0.5 and 2.
TWO-SAMPLE TEST FOR
INDEPENDENT SAMPLES
TWO-SAMPLE T-TEST: INDEPENDENT SAMPLES
• Compare the difference in means between two groups
• Groups do not need to be equal sizes
• Can be calculated by hand, excel, and online calculators
X1 X 2
t
(n1 1) s12 (n2 1) s22 n1 n2
n1 n2 2 nn
1 2
• X 1 mean for Group 1
• X 2 mean for Group 2
• n1 number of participants in Group 1
• n2 number of participants in Group 2
• s12 variance for Group 1
• s22 variance for Group 2
CALCULATED BY EXCEL FUNCTION
• Three Methods:
– Scatterplot
– Functions
• CORREL
– Data Analysis Toolpak
THE CORRELATION COEFFICIENT
ŷ b0 b1 x
LINEAR REGRESSION - ASSUMPTIONS
• Used for continuous variables
• Assumptions:
– Linear relationship
– Multivariate normality
• Data is normally distributed
– No or little multicollinearity
• Two independent variables are not highly correlated
– No auto-correlation
• Correlation at different time points (usually from multiple observations of the same subject)
– Homoscedasticity
• Similar variance among observations
CALCULATING LINEAR REGRESSION: EXCEL -
SCATTERPLOTS
Step 1 Step 2
Step 3
C A L C U L AT I N G L I N E A R
REGRESSION: EXCEL
FUNCTIONS: SLOPE
Computes the slope of the line
Step 1 Step 2
Step 3
Step 4
C A L C U L AT I N G L I N E A R
REGRESSION: EXCEL
FUNCTIONS: INTERCEPT
Computes the
location where the
regression line
crosses the y-axis
* The bO in the linear
regression equation
Step 1
Step 2
Step 3
Step 4
CALCULATING LINEAR REGRESSION:
EXCEL : DATA ANALYSIS TOOLPAK
Step 1 Step 2
Step 3
LINEAR REGRESSION INTERPRETATION:
• For every one unit increase in x, there is a corresponding
_____ increase/reduction in y
PREDICTION ERRORS AND MULTIPLE
REGRESSION
PREDICTION ERRORS MULTIPLE REGRESSION
Standard error of estimate Multiple Regression formula:
The measure of how much each Y ' b1 X 1 b2 X 2 a
data point (on average) differs from Y’: the value of the predicted score
the predicted data point or a
X1: the value of the first independent variable
standard deviation of all the error
scores X2: the value of the second independent variable
Tells you how wrong the regression b: the regression weight for each variable
model is on average using the units When using multiple predictors your independent
of the response variable (y, variables (X) should be related to the dependent
dependent variable) variable (Y)
The higher the correlation between The independent variables should not be related
two variable (and the better the
prediction), the lower the error will
be
CHI SQUARE
PARAMETRIC VS. NONPARAMETRIC TESTS
Parametric Tests Nonparametric Tests
• Make assumptions: • Do not make same
– Tests being used to assumptions:
estimate a population – Not attempting to
parameter estimate parameters
– Populations based on – No assumption of
normal distributions normality “distribution
– Samples are also free”
normally distributed – Can be used for nominal
• Used for interval/ratio or data
ordinal data
SO FAR WE HAVE LEARNED
Parametric Test Nonparametric Test
O E
2
Degrees of freedom:
2
Degrees of freedom approximate the number
E
of categories in which data has been
X2: chi-square value organized
𝜮: summation sign For chi square:
df = c – 1; where c is the number of
O: observed frequency categories
E: expected frequency Expected values (two methods):
Evenly divide the total among
categories
Based on population distribution
CALCUL ATING ONE-WAY CHI-SQUARE:
EXCEL FUNCTIONS
CALCUL ATING ONE-WAY CHI-SQUARE:
EXCEL FUNCTIONS
CALCUL ATING ONE-WAY CHI-SQUARE:
EXCEL FUNCTIONS
CALCUL ATING ONE-WAY CHI-SQUARE:
EXCEL FUNCTIONS
*note that the CHISQ.TEST result is not the chi-square statistic, it is the
p-value*
TWO-WAY CHI SQUARE – TWO DIMENSIONS
O E
2
2
• Degrees of freedom:
E • Degrees of freedom approximate the number
of categories in which data has been organized
X2: chi-square value
• For chi square: df = (r – 1)(c – 1); here r is
𝜮: summation sign the number of rows and c is the number of
O: observed frequency categories
*note that the CHISQ.TEST result is not the chi-square statistic, it is the p-value*
STATISTICAL SIGNIFICANCE AND
INTERPRETATION OF CHI-SQUARE
STATISTICAL SIGNIFICANCE INTERPRETATION
Compare p-value to _____. (2)
2
20.6, p .05
Compare critical value to _____. X2(2): represent the test statistic
Can look up critical values and p- with two degrees of freedom
values in a table or use an online 2: number of degrees of freedom
calculator 20.6: the obtained value
p<0.05: the probability
MEASURE OF EFFECT FOR CHI-SQUARE
PHI CRAMER’S V
o Measures the relationship between o Used for tables which have more
two binary variables than 2x2 rows and columns
o Applies to 2x2 nominal tables only o n = total observations
o Interpreted like a correlation o k = number of columns or rows
coefficient (whichever is smaller)
o Can determine the amount of
shared variance
o N = total observation
INTERPRETING PHI AND CRAMER’S V
PHI CRAMER’S V
Effect Size for Phi Effect Size for Cramer’s V
df Small Medium Large
small .10 to .29 1 .10 to .29 .30 to .49 ≥ .50
2 .07 to .20 .21 to .34 ≥ .35
medium .30 to .49
3 .06 to .16 .17 to .28 ≥ .29