Professional Documents
Culture Documents
A. Chapter 9 (SD) (Hybrid)
A. Chapter 9 (SD) (Hybrid)
The Good News? You already did a lot of this in Methods One,
so most of this chapter will simply be a review!
Overview of this Chapter
In this chapter, we will focus on the underlying mechanics of study
data, including …
Descriptive Statistics
Descriptive Statistics
Descriptive Statistics
This chapter will give a brief overview of the statistics:
– Statistics is a branch of mathematics that involves the
collection, analysis, and interpretation of data
– Descriptive Statistics involve those procedures used to
summarize or “describe” data
– Inferential Statistics are procedures used to analyze data
after an experiment is completed in order to determine
whether the independent variable has a significant effect
Descriptive Statistics
Descriptive Statistics – In this section, we will look at some new
items and review some old items.
The new items include …
– 1). Scales of Measurement
Nominal Scale
Ordinal Scale
Interval Scale
Ratio Scale
Descriptive Statistics
Descriptive Statistics – In this section, we will look at some new
items and review some old items.
The old items include …
– 2). Measures of Central Tendency
Mean
Median
Mode
Those expecting
2 10 18
a mild shock
Descriptive Statistics
1). Scales of Measurement – NOIR
A. Nominal scales
– In the Schracter study, we could randomly assign values to
the levels of the dependent variable
1 = Wait alone, 2 = Wait with others, 3 = Don’t care
– But keep in mind that these numbers are arbitrary in a
nominal scale. We could just as easily have
1 = Don’t care, 2 = Wait alone, 3 = Wait with others OR
1 = Wait with others, 2 = Wait alone, 3 = Don’t case
– Levels differ in quality (type), not value
Descriptive Statistics
1). Scales of Measurement – NOIR
B. Ordinal scales involve variable levels that differ in terms of
rank (value!). For example, think about Marathon Racers
– In a marathon, you have first, second, and third place. You
know the finishing order, but ordinal scales do not give you
any info beyond rank order.
– The first and second place runners
may have finished seconds apart
while second place finished a full
five minutes ahead of third place
All you know is rank order. Distance between is unknown
Descriptive Statistics
1). Scales of Measurement – NOIR
B. Ordinal scales – You might ask a question like:
– If a child ranks 5 toys and is given the one ranked # 3, will he still
play with it in a week?
The original ranking doesn’t tell you how good the toy is
– What about small appliances? Brehm was curious about
appliance rankings, so he conducted a study …
A woman ranked eight small household appliances and then
received an appliance of her choice as a gift (albeit a mid-
ranked appliance, one she ranked 3rd 4th or 5th)
Descriptive Statistics – Ordinal Scale
1). Scales of Measurement – Brehm & Appliance Attractiveness
{
Scores on the ABCs Quiz S4 16
S5 18
What is the Median for the S7 22
# Correct? Yeah, let’s put S1 23
them in order to make this S3 24
easier …
S2 24
{
S8 25
The median is 24
S9 26
– Half the scores fall
S6 27
above; half below
S10 29
The Median # Correct
{
Scores on the ABCs Quiz S4 16
S5 18
What is the Median for the S7 22
# Correct? Yeah, let’s put S1 23
them in order to make this S3 24
easier …
S2 24
{
S8 25
Now, let’s change one of
S9 26
those middle numbers …
S6 27
S10 29
The Median # Correct
Scores on the ABCs Quiz S4 16
S5 18
Now what is the Median S7 22
for the # Correct? S1 23
S3 23
S2 24
S8 25
S9 26
S6 27
S10 29
The Median # Correct
{
Scores on the ABCs Quiz S4 16
S5 18
Now what is the Median S7 22
for the # Correct? S1 23
– Add middle numbers S3 23
23 + 24 = 47 S2 24
{
– Divide by two S8 25
47 / 2 = 23.5 S9 26
S6 27
S10 29
The Median # Correct
Scores on the ABCs Quiz S4 16
S5 18
Okay, back to our original S7 22
data set (median of 24). S1 23
S3 24
Let’s add another student S2 24
this time … S8 25
S9 26
S6 27
S10 29
# Correct
The Median S11 8
Scores on the ABCs Quiz S4 16
S5 18
Now, given an 11th student, S7 22
what is the Median for the #
S1 23
Correct =
S3 24
S2 24
S8 25
S9 26
S6 27
S10 29
# Correct
{
The Median S11 8
Scores on the ABCs Quiz S4 16
S5 18
Now, given an 11th student, S7 22
what is the Median for the #
S1 23
Correct = 24
S3 24
{
S2 24
S8 25
S9 26
S6 27
S10 29
# Correct
The Median S11 8
Scores on the ABCs Quiz
S4 16
S5 18
Let’s focus on that new kid
(S4), who just didn’t seem to S7 22
get it. S1 23
S3 24
Does his score of 8 impact our S2 24
distribution when it comes to
the median?
S8 25
S9 26
S6 27
S10 29
New Original
The Median
S11 8 –
Scores on the ABCs Quiz
S4 16 16
Let’s focus on that new kid (S4), S5 18 18
who just didn’t seem to get it. S7 22 22
S1 23 23
Not really! The new median, like
the original, is still 24 S3 24 24
S2 24 24
Consider bigger differences S8 25 25
between two data sets …
S9 26 26
S6 27 27
S10 29 29
Varied Unvaried
The Median S11 2 19
Scores on the ABCs Quiz S4 3 20
S5 3 21
These two data sets are S7 5 22
very different, but they still
S1 7 23
have the same median (24)
S3 24 24
Consider both data sets S2 29 25
from the perspective of the S8 30 26
mean (You remember the S9 30 27
mean, right?) S6 30 28
S10 30 29
Descriptive Statistics
2). Measures of Central Tendency – The Mean
C. The mean is the average of all scores in a distribution
– This value is dependent on each score in a distribution
– The mean is the most widely used and informative measure
of center, and is often expressed as:
∑χ
M=
n
Non-normal data
140
120
100
80
60
40
20
0
None 1 Year 2 years 3 years 4 years 5 years
Histogram Showing a Negative Skew
Non-normal data
140
120
100
80
60
40
20
0
None 1 Year 2 years 3 years 4 years 5 years
Histogram Showing a Bimodal Pattern
Bimodal data
Descriptive Statistics
2). Measures of Central Tendency – When should you use each?
Recall this example from last semester. Imagine a real estate
agent has the following sales (30 = 30,000)
– Set A: 30 40 50 60 70. Mean = 50 and median = 50
– Set B: 30 40 50 60 700. Mean = 176 and median = 50
#2
Graphing Your Results?
Why did I skip “Graphing Your Results”, even though it is covered
in Smith and Davis (Chapter 9)?
Measures of Variability
Measures of Variability
Measures of Variability (or Measures of Spread)
In research, it is rare and unlikely that all participants produce
identical data. Even if two conditions give you similar means,
the distributions within those samples may vary a great deal
Who is the better real estate agent? Think about houses sold
– Ted’s sales: $100,000, $550,000, $75,000, $300,000, $110,000
– Bill’s sales: $200,000, $250,000, $225,000, $240,000, $220,000
Who is the better real estate agent? Think about houses sold
– Ted’s sales: $100,000, $550,000, $75,000, $300,000, $110,000
– Bill’s sales: $200,000, $250,000, $225,000, $240,000, $220,000
– You see that Ted’s sales have huge outliers (a very high and a
very low amount). It has more variability than Bill’s sales
Measures of Variability
Calculating and Computing Statistics
While measures of center are easy, measures of variability (as
you saw last semester) are a bit more complicated to calculate
It’s time to remind you about the range, variance, and standard
deviation!
Measures of Variability
Measures of Variability (or Measures of Spread)
Variability refers to the extent to which scores in a distribution
spread out around the mean
– When variability is small, the scores are clustered together
– When variability is large, there are big differences between
individuals, and scores are spread across a range of values
Measures of Variability
Measures of Variability (or Measures of Spread)
Variability refers to the extent to which scores in a distribution
spread out around the mean
-5 Range +5 Range
Measures of Variability
1). The Range
So, about that graph on the prior slide …
– All three curves (black, blue, and red) have the same range
(-5 to +5), but their distributions look different. This should
give us some pause about relying solely on the range …
∑ (X – M)2
S2 =
n–1
Measures of Variability
2). Variance
Variance is a single number that represents the total amount of
variation. The higher the number, the greater the spread
∑ (X – X)2
S2 =
n–1
Measures of Variability
2). Variance
Variance is a single number that represents the total amount of
variation. The higher the number, the greater the spread
∑ (X – X)2
S2 =
n–1
Pop Quiz: Pre-Class Quiz Question #13
When do we use N and when do we use n – 1 for the standard
deviation calculation?
A). Use N for the sample and n – 1 for the population
B). Use N for the population and n – 1 for the sample
C). You can use either, as they result in the same outcome
D). You should actually use n + 1 over either of these options
E). Use N when your data is randomly assigned and n – 1 when
the data is not randomly assigned
Measures of Variability
2). Variance
Variance is a single number that represents the total amount of
variation. The higher the number, the greater the spread
∑ (X – X)2
S2 =
n–1
Measures of Variability
2). Variance
Variance is a single number that represents the total amount of
variation. The higher the number, the greater the spread
SD = √ s2
SD = √ 11
SD = 3.32
Measures of Variability
3). Standard Deviation
The standard deviation is the square root of the variance
– Unlike variance, the standard deviation is expressed in the
same unit as the original numbers (and is thus more useful)
This is the most widely used measure of spread
A). 30 to 40
B). 40 to 50
C). 40 to 60
D). 50 to 60
E). 50 to 70
Pop Quiz: Pre-Class Quiz Question #16
Do you want the standard deviation to be a big number or a small
number?
A). Big
B). Small
C). It doesn’t really matter
D). None of the above
Measures of Variability
Pause-Problem #3 (Outliers)
We saw earlier in this presentation (as well as in Methods One)
that high or low outliers can screw up the mean score. I want
you to come up with two possible ways that you can reduce the
impact of outliers
– This one is tough, so feel free to get creative!
#3
Measures of Variability
4). The Semi-Interquartile Range
As you probably know, both the range and standard deviation are
sensitive to extreme scores
– In such cases, researchers may choose to look at the semi-
interquartile range instead. The S-IR is …
less sensitive than the range to extreme scores
used when you want a simple, rough estimate of spread
Minimum 19
Measures of Variability
4). The Semi-Interquartile Range and The Boxplot
Some of you may have some across a Boxplot before. This is a
graphic representation of the five number summary
– First and third quartile numbers define the ends of the box
– A line inside the box represents the median
– Vertical “whiskers” extending above and below the box and
represent the maximum and minimum scores (respectively)
– For example …
Example of a Boxplot
Example of a Boxplot
Median
Median
Minimum Score
Measures of Variability
4). The Semi-Interquartile Range and The Boxplot
Some of you may have some across a Boxplot before. This is a
graphic representation of the five number summary
– Here is data from multiple treatments, represented by side-
by-side boxplots
Example of a Boxplot (left) and
Side-By-Side Boxplot (right)
A Prior Quiz - Five Number Summary
Maximum 20
Median (Middle)) 15
Minimum 9
Box Plot – A Prior Methods Two Quiz!
Pop Quiz: Pre-Class Quiz Question #17
In this boxplot, what is the median?
A). 150
B). 350
C). 420
D). 490
E). 920
Pop Quiz: Pre-Class Quiz Question #18
In this boxplot, what is the total range?
A). 140
B). 300
C). 650
D). 770
E). 1000
Pop Quiz: Pre-Class Quiz Question #19
In this boxplot, what is the semi-interquartile range?
A). 140
B). 300
C). 650
D). 770
E). 1000
Pop Quiz: Pre-Class Quiz Question #19
In this boxplot, what is the semi-interquartile range?
A). 140
B). 300
C). 650
D). 770 Hint!
E). 1000
Part Three
Correlations (Brief)
Correlation
I know Smith and Davis talk about correlations in Chapter 9, but
you already know a lot about correlations (
Reread Chapter 5 in Salkind “Ice-Cream and Crime” if you
need a quick reminder!)
No test questions on correlations for this chapter, but we will
return to correlations later this semester when we look at
Salkind Chapter 16
Part Four
Inferential Statistics
Inferential Statistics
Inferential Statistics
Thus far, we have talked about ways to describe data (through
measures or spread and center) using descriptive statistics
A t-value of 1.80 is significant for a one tailed but not a two-tailed test
Inferential Statistics
3). One vs. Two Tail Tests of Significance
So, if you have a better shot at finding significance with a one-
tailed test, why bother with a two-tailed test?
– One-tailed tests give up a lot of valuable information about
the untested “other” end of the curve.
– If your hypothesis is that the new GRE program does better
than an alternative program, you cannot later conclude that
it did worse than the alternative program (all you can say is
that it did no better than the alternative)
Thus you reject the alternative hypothesis and retain the
null hypothesis
Inferential Statistics
3). One vs. Two Tail Tests of Significance
So, if you have a better shot at finding significance with a one-
tailed test, why bother with a two-tailed test?
– Often, psychologists do not know what the outcome will be
(which is why they did the study!), so a two-tailed test lets
them fully test all potential study outcomes
A clinician testing a new ADHD therapy, for example,
may want to know if the new therapy helps the patient
while also knowing if it harms the patient. She would thus
need to run a two-tailed test
Inferential Statistics
3). One vs. Two Tail Tests of Significance
Choosing a test
– It is highly recommended that researchers pick which test
they will be using BEFORE data collection begins
“Fraud” issues
– But what if Time 1 you got 1140 and Time 2 you got 1340?
Something other than chance is probably going on here, right?
Like what?
Inferential Statistics
4). The Logic of Significance Testing
The logic behind significance testing is that we want to see if
differences between our control group and our experimental group are
due to chance or due to the addition of our IV
– That is …
Inferential Statistics
4). The Logic of Significance Testing
You can tell if something is statistically significant by looking at
the probability that two different samples drawn from the same
population would differ by the observed amounts (means)
– If the chance that they would differ is very small (that is, it is
unlikely that two samples this different could be drawn from
the same population), then you have statistical significance
– To put it another way, two groups drawn from the same
population will act a lot alike, but there will be some natural
variation. However, if you give a treatment to only one of the
groups, variation will emerge beyond “natural” fluctuations
Inferential Statistics
4). The Logic of Significance Testing
Suppose we had a “population” of Homer Simpsons
Inferential Statistics
4). The Logic of Significance Testing
If we drew two Homer Simpsons at random, they should look a
lot alike (although probably not identical – remember, there will
be some natural variation among population members, so the
sample members will also have some natural variation). Or …
Inferential Statistics
4). The Logic of Significance Testing
The Population
The Sample
– Sample “features” fall within the
range of population “features”
– Any differences in the sample
members (1 & 2) are by chance
Inferential Statistics
4). The Logic of Significance Testing
The Population
The Sample
– Sample “features” fall within the
range of population “features”
– Any differences in the sample
members occur by chance
Inferential Statistics
4). The Logic of Significance Testing
Now …
The Sample
– Expose them to the IV (control
vs. treatment groups)
– The IV should create different
outcomes based on IV features
Inferential Statistics
4). The Logic of Significance Testing
The Population
– Population should reflect the
same differences IF exposed
to the IV as well
The Sample
– Expose them to the IV (control
vs. treatment groups)
– The IV should create different
outcomes based on IV features
Inferential Statistics
4). The Logic of Significance Testing
The point here is that we to try to
generalize from the sample back
to the population (increasing our
external validity)
– The effect size is becoming just as important (if not more so)
as the significance testing
Inferential Statistics
6). Effect Size
Statistical testing only tells us that there was an effect. It does
not really tell us much about the magnitude of that effect.
– Effect sizes can be calculated using Cohen’s d (for two
sample t-tests) or Pearson’s r
Next up, we are going to head into Smith and Davis Chapter
10, where we will go into a bit more detail about the t-Test (both
dependent and independent).