Professional Documents
Culture Documents
Presentation Topics
1. Sampling Methods Population Sample Sampling Methods 2. Inferential Statistics Parametric Tests Nonparametric Tests
What is a population?
A population is the complete collection of specific types of elements such as scores, people, and other shared variables to be studied.
The first-year SUT undergraduate students enrolled in English I course in Trimester 1/2010.
What is sampling?
Sampling is the process of selecting a small number of elements from a larger target group of such elements so that the data gathered from the small group will allow judgments or claims to be made about the populations.
Sampling Frame
A sampling frame is an actual set of units from which a sample has been identified, and should cover all the sampling units in the population of interest.
Sampling Methods
Probability sampling
Nonprobability sampling
Probability Sampling
A sampling in which members of the population have equal chance (probability) of being selected. Nonprobability Sampling A sampling in which the chances (probability) of selecting members from the population are not equal.
Probability Sampling
Simple random sampling Systematic random sampling Stratified random sampling Cluster sampling
Nonprobability Sampling
Convenience sampling Judgment sampling Quota sampling
Advantages of SRS
Fair Unbiased
Disadvantages of SRS
over- or under-sampling no guarantee of getting good representatives
Stratified Sampling
The population is separated into homogeneous groups/segments/strata and a sample is taken from each. The results are then combined to get the picture of the total population.
Cluster sampling
method by which the population is divided into groups (clusters), any of which can be considered a representative sample
Convenience Sampling
A sample is obtained by selecting individual participants who are easy to approach.
Purposive Sampling
This method starts with a purpose in the researcher s mind, and the sample is thus selected to include participants of interest and exclude those who do not suit the purpose.
Quota Sampling
A sample is obtained by identifying subgroups to be included, then establishing quotas for individuals to be selected through convenience for each subgroup.
INFERENTIAL STATISTICS
Hypothesis and Hypothesis Testing Level of Significance Directional and Non-directional Hypothesis Testing Type I and Type II Error Parametric and Nonparametric Tests
Research Hypothesis
A hypothesis is an assumption about the population parameter.
A parameter is a characteristic of the population, like its mean or variance. The parameter must be identified before analysis.
Hypothesis Testing
Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test:
Null hypothesis (H0) Alternative hypothesis (HA) Test statistic Rejection region (the alpha level)
H 0 : Q1 ! Q 2
HA :
Q1 { Q 2
the most unlikely 5% (or 1%) of the sample means (the extreme values) is separated from the most likely 95% (99%) of the sample means (the central values).
Critical Region
Critical Value
Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis
Critical Value
Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis
Critical Value
Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis
Reject H0 Fail to reject H0
Two-tailed Test
Two-tailed Test
Two-tailed Test
H0: = 100 H1: { 100
Means less than or greater than
Two-tailed Test
H0: = 100 H1: { 100
E is divided equally between
the two tails of the critical region
100
Right-tailed Test
H0: e 100 H1: > 100
Right-tailed Test
Right-tailed Test
H0: e 100 H1: > 100
Points Right
Fail to reject H0 Reject H0
100
Left-tailed Test
Left-tailed Test
H0: u 100 H1: < 100
Points Left
Left-tailed Test
H0: u 100 H1: < 100
Points Left
Reject H0 Fail to reject H0
100
Type I Error
The mistake of rejecting the null hypothesis when it is true.
Example: Rejecting a claim that the group mean score equals 96 when the mean really does equal 96
Type II Error
the mistake of failing to reject the null hypothesis when it is false.
Example: Failing to reject the claim that the group mean score is 96 when the mean is really different from 96
Inferential Statistics
Parametric Tests
normal distribution ratio or interval scale random sampling T-test ANOVA Pearson s Chi-square
Nonparametric Tests
do not require normality ordinal or nominal scale
t-tests
Compute two sets of mean values 1. one sample t-test 2. two independent samples t-test 3. two paired (dependent) samples ttest
H0: The students TOEFL mean score is about 500. HA: The students TOEFL mean score is different from 500.
Output Data
Significant at p-value = .011, p < .05 Reject H0 The students TOEFL mean score is different from 500
Dependent-sample t-test
compares the means of individual participants in one group. pre-test posttest design
Example:
H0: There is no difference between the mean scores of the pre-test and posttest. HA: The students mean scores in the posttest is higher than those in the pre-test
Independent-sample t-test
examines whether the mean values of two independent groups are significantly different.
A researcher wants to know whether the students of his class perform better or worse than students in another class in an English final examination.
Research Hypothesis
H0 : There is no difference between the mean scores of the two classes. HA: The mean scores between two classes are different
One-Way ANOVA
The response variable is the variable you re comparing The factor variable is the categorical variable being used to define the groups We will assume k samples (groups) The one-way is because each value is classified in exactly one way Examples include comparisons by gender, race, political party, color, etc.
One-Way ANOVA
determines whether there is any significant difference of the mean values among sample groups
Why not repeated t-tests? 1. One-way ANOVA can handle the comparison for more than two groups in one time. 2. More tests done, higher risk of Type-I error.
Research Hypothesis
H0:
Non-parametric Test
Pearson s Chi-square
- Goodness-of-fit
test
Goodness-of-Fit Test
Compares observed frequencies within groups to their expected frequencies. HO = observed frequencies are not different from the expected frequencies. Research hypothesis: They are different.
Test of Independence
Review cross-tabulations (= contingency tables) Are the differences in responses of two groups statistically significantly different? One-way = observed vs expected Two-way = one set of observed frequencies vs another set.