Professional Documents
Culture Documents
How To Use SPSS PDF
How To Use SPSS PDF
Brian C. Cronk
I
ll
L
:,-
Choosing the Appropriafe Sfafistical lesf
Ytrh.t b Yq
I*l
QraJoi?
Dtfbsr
h
ProportdE
Mo.s Tha 1
lnd€Fndont lldr Tho 2 L6Eb
Varidl6 d li(bsxlq*
Vdidb
I
Notice
SPSSis a registeredtrademarkof SPSS,Inc. Screenimages@ by SPSS,Inc.
and MicrosoftCorporation.Usedwith permission.
This book is not approvedor sponsoredby SPSS.
A CaliforniaCorporation.
"PyrczakPublishing"is an imprintof FredPyrczak,Publisher,
Althoughtheauthorandpublisherhavemadeeveryeffortto ensuretheaccuracyand
completeness of informationcontainedin thisbook,we assumeno responsibilityfor
errors,inaccuracies,omissions,
or anyinconsistencyherein.Any slightsof people,
places,or organizations
areunintentional.
ProjectDirector:MonicaLopez.
of Americaby Malloy,Inc.
Printedin theUnitedStates
r s B N l -8 8 4 s8 5 -79 -5
Tableof Contents
Introduction
to theFifth Edition v
What'sNew?
v
Audience v
Organization v
SPSSVersions vi
Availabilityof SPSS
vi
Conventions vi
Screenshots vi
PracticeExercises
vii
Acknowledgments'/
vii
ChapterI GettingStarted I
Ll StartingSPSS I
t.2 EnteringData I
1.3 DefiningVariables 2
1.4 LoadingandSavingDataFiles 5
1.5 RunningYour FirstAnalysis 6
1.6 ExaminingandPrintingOutputFiles
1.7 Modi$ing DataFiles 8
Chapter2 EnteringandModifyingData ll
2.1 VariablesandDataRepresentation ll
') ') Transformation
andSelection
of Data t2
Chapter3 Descriptive
Statistics l7
3.1 Frequency andpercentileRanksfor a singlevariable
Distributions t7
3.2 Frequency andpercentileRanksfor Multille variables
Distributions 20
3.3 Measuresof CentralTendencyandMeasuresof Dispersion
for a SingleGroup 2l
3.4 Measures of CentralTendency
andMeasures of Dispersion
for MultipleGroups 24
3.5 StandardScores )7
Chapter4 GraphingData 29
4.1 GraphingBasics 29
4.2 TheNew SPSSChartBuilder 29
4.3 Bar Charts,Pie Charts,andHistograms 3l
4.4 Scatterplots 33
4.5 AdvancedBarCharts 36
4.6 EditingSPSSGraphs 39
Chapter5 Predictionand Association 4l
5.1 PearsonCorrelation Coeffi cient 4l
5.2 SpearmanCorrelation Coeffi cient 43
5.3 Simple Linear Regression 45
5.4 Multiple Linear Regression 49
u,
Chapter6 Parametric
InferentialStatistics 53
6.1 Reviewof BasicHypothesis Testing 53
6.2 Single-Sample t Test ))
6.3 Independent-Samples I Test 58
6.4 Paired-Samples t Test 6l
6.5 One-WayANOVA 65
6.6 FactorialANOVA 69
6.7 Repeated-Measures ANOVA 72
6.8 Mixed-Design ANOVA 75
6.9 Analysisof Covariance 79
6.10 MultivariateAnalysisof Variance(MANOVA) 8l
Chapter7 Nonparametric
InferentialStatistics 85
7.1 Chi-SquareGoodness of Fit 85
7.2 Chi-SquareTestof Independence 87
7.3 Mann-Whitney UTest .90
7.4 WilcoxonTest 93
7.5 Kruskal-Wallis,F/Test 95
7.6 FriedmanTest 97
Chapter8 TestConstruction 99
8.1 Item-TotalAnalysis 99
8.2 Cronbach'sAlpha 100
8.3 Test-RetestReliability l0l
8.4 Criterion-Related
Validiw t02
AppendixB PracticeExerciseDataSets r 09
PracticeDataSet I 109
PracticeDataSet2 ll0
PracticeDataSet3 ll0
tv
ChapterI
GettingStarted
Section1.1 StartingSPSS
Startup proceduresfor SPSS will differ
slightly, dependingon the exact configurationof
the machine on which it is installed.On most
ffi$ t't**** computers,you can start SPSS by clicking on
c rrnoitllttt
ffi Start, then clicking on Programs, then on SPSS.
(- lhoari{irgqrory
On many installations,there will be an SPSSicon
r,Crcrt*rsrcq.,y urhgDd.b6.Wbrd on the desktopthat you can double-click to start
(i lpanrnaridirgdataura the program.
When SPSS is started,you may be pre-
sentedwith the dialog box to the left, depending
on the optionsyour systemadministratorselected
for your version of the program. If you have the
dialog box, click Type in data and OK, which
will presenta blank data window.'
If you were not presentedwith the dialog
box to the left, SPSSshould open automatically
with a blank data window.
f- Dml*ro* fe tf*E h lholifrra
The data window and the output win-
dow provide the basic interface for SPSS. A
blank data window is shownbelow.
Section1.2 EnteringData
One of the keys to success
with SPSSis knowing how it stores
.:g H*n-g:fH"gxr__}rry".**
r! *9*_r1_*9lt
and uses your data. To illustrate the rtlxlel&l *'.1
rtl ale| lgj'Slfil Hl*lml sl el*l I
basics of data entry with SPSS,we
will useExample1.2.1.
Example 1.2.1
A surveywas given to several
students from four different
classes (Tues/Thurs mom-
ings, Tues/Thurs afternoons,
Mon/Wed/Fri mornings, and
Mon/Wed/Fri afternoons).
The students were asked
' Items that appearin the glossaryare presentedin bold. Italics areusedto indicatemenu items.
ChapterI GeningStarted
whether or not they were "morning people" and whether or not they worked. This
survey also asked for their final grade in the class (100% being the highest gade
possible).The responsesheetsfrom two studentsarepresentedbelow:
Response SheetI
ID: 4593
Dayof class: MWF X TTh
Classtime: Morning X Aftemoon
Are you a morningperson? Yes X No
Finalgradein class: 8s%
Do youwork outsideschool? Full-time Part{ime
XNo
ResponseSheet2
ID: l90l
Dayof class: x MwF _ TTh
Classtime: X Morning - Afternoon
Are you a morningperson? X Yes - No
Finalgradein class: 83%
Do vou work outsideschool? Full-time X Part-time
No
Our goal is to enterthe data from the two studentsinto SPSSfor use in future
analyses. The first stepis to determinethe variablesthatneedto be entered.Any informa-
tion that can vary amongparticipantsis a variablethat needsto be considered. Example
1.2.2liststhevariables we will use.
Example1.2.2
ID
Dayof class
Classtime
Morningperson
Finalgrade
Whetheror not the studentworksoutsideschool
In the SPSSdata window,columnsrepresent variablesandrowsrepresentpartici-
pants.Therefore,we will be creatinga datafile with six columns(variables)
andtwo rows
(students/participants).
Thus, the variable name "Q7" is acceptable,while the variable name "7Q" is not.
Similarly, the variable name "PRE_TEST" is acceptable, but the variable name
"PRE TEST" is not. Capitalizationdoes not matter, but variable namesare capitalizedin
this text to make it clear when we are referring to a variable name, even if the variable
name is not necessarilycapitalizedin screenshots.
To define a variable.click on the Variable View tab at
thebottomofthema inscre e n .Th is wills h o wy o u t h e V a ri-@
able View window. To return to the Data View window. click
on the Data View tab.
.lt-*l*lr"$,c"x.l
From the Variable View screen,SPSSallows you to createand edit all of the vari-
ables in your data file. Each column representssome property of a variable, and each row
representsa variable. All variablesmust be given a name. To do that, click on the first
empty cell in the Name column and type a valid SPSSvariable name. The program will
then fill in default valuesfor most of the other properties.
One usefulfunctionof SPSSis the ability to definevariableand value labels.Vari-
able labels allow you to associatea descriptionwith each variable.Thesedescriptionscan
describethe variablesthemselvesor the valuesof the variables.
Value labelsallow you to associatea descriptionwith eachvalue of a variable.For
example,for most procedures,SPSSrequiresnumerical values.Thus, for data such as the
day of the class (i.e., Mon/Wed/Fri and Tues/Thurs),we need to first code the values as
numbers.We can assignthe number I to Mon/Wed/Friand the number2to Tues/Thurs.
To help us keep track of the numberswe have assignedto the values,we use value labels.
To assignvalue labels,click in the cell you want to assignvaluesto in the Values
column. This will bring up a small gray button (seeanow, below at left). Click on that but-
ton to bring up the Value Labelsdialog box.
When you enter a iv*rl** ---
v& 12 -Jil
value label, you must click L.b.f ll6rhl| s*l
Add after eachentry. This will !!+ |
J::::*.-,.Tl mOVe the value and itS
associated label into the bottom section of
the window. When all labels have been
added, click OK to return to the Variable
View window.
ChapterI Gening Starred
Practice Exercise
Createa data file for the six variablesand two samplestudentspresentedin Exam-
ple 1.2.1.Name your variables:ID, DAY, TIME, MORNING, GRADE, and WORK. You
should code DAY as I : Mon/Wed/Fri,2 = Tues/Thurs.Code TIME as I : morning, 2 :
afternoon.CodeMORNING as 0 = No, I : Yes. Code WORK as 0: No, I : Part-Time,2
: Full-Time. Be sure you enter value labels for the different variables.Note that because
value labelsare not appropriatefor ID and GRADE, theseare not coded.When done,your
Variable View window should look like the screenshotbelow:
J -rtrr,d
r9"o'ldq${:ilpt"?- "*- .? --
{!,_q,
ru.g
Click on the Data View tab to open the data-entryscreen.Enter data horizontally,
beginningwith the first student'sID number.Enter the code for eachvariable in the appro-
priate column; to enterthe GRADE variablevalue,enterthe student'sclassgrade.
2
Dependingupon your versionof SPSS,it may be displayedas 2.08 + 009.
Chapter I Getting Started
: [[o|vrwl vrkQ!9try /
)1
*rn*to*u*J----.--
Instead of clicking the Value Labels icon, you may tu
l{il Ddr lrm#m Anrfrrr Cr6l! ,t
,t1
optionallytogglebetweenviews by clicking valueLaiels under r
the Viewmenu. ti
il
'i. I
r lii
Section1.4 Loading and SavingData Files
|:
Onceyou haveenteredyour data,you will need
to saveit with a uniquenamefor later useso that you
canretrieveit whennecessary.
Loadingand savingSpSSdatafiles worksin the
sameway as most Windows-based software.Underthe
File menu, there are Open, Save, and Save As
commands.SPSSdata files have a .,.sav" extension.
which is addedby defaultto the end of the filename.
ThistellsWindowsthatthefile is anSpSSdatafile.
H-
Save Your Data
When you save your data file (by clicking File, then clicking Save
or SaveAs to
specify a unique name),pay specialattentionto where you saveit. trrtist
systemsdefault to
the.location<c:\programfiles\spss>.You will probably want to
saveyour data on a floppy
disk, cD-R, or removableUSB drive so that you can taie the file withvou.
Load YourData
When you load your data (by clicking File, then
clicking Open, thenData, or by clicking the open file folder
icon), you get a similar window. This window lists all files
with the ".sav" extension.If you have trouble locating your
D{l lriifqffi
saved file, make sure you are
looking in the right directory.
ChapterI GeningStarted
Practice Exercise
To be surethat you havemasteredsav-
ing and openingdata files, nameyour sample FilE Edt $ew Data Transform Annhze @al
datafile "SAMPLE" andsaveit to a removable
storagemedium. Once it is saved,SPSSwill display the name of the file at the top of the
data window. It is wise to save your work frequently,in caseof computer crashes.Note
that filenamesmay be upper- or lowercase.In this text, uppercaseis usedfor clarity.
After you have saved your data, exit SPSS (by clicking File, then Exit). Restart
SPSSand load your databy selectingthe "SAMPLE.sav"file you just created.
j rttrtJJ
I nnafzc Gretrs UUtias gdFrdov*Help
File Edlt Vbw Data Transform
r ktlml lff al
El tlorl rl(llnl
Cottpsr Milns ) ol
lVisible:6
GanoralHnnarf&dd ) i Eipbrc,,.
Corr*lrtr ,) CrogstSr,..
't901.00 Re$$r$on ) Rdio,.,
Classfy ir l. P-Pflok,.,
OdrRrdrrtMr ,. ), . Q€ Phs.,,
Scab )
Norparimetrlclcrtt l
Tirna5arl6t )
Q.rlty Corfrd )
, I tj\g*r*qgudrr,*ts"uss-Rff(trve,.,
I -d * lnl-Xj
irllliliirrillliirrr
H. Ee lbw A*t lra'dorm Craphr ,Ufr!3 Uhdo'N Udp
-qg*g!r*!e!|ro_
slsl*gl elsl*letssJ
sl#_#rl o
+ l *l + l - l &hj l :l ql el ,
ornt * Descrlptlves
El Pccc**tvs*
r'fi Trb
6r** f]aiagarll l: \ lrrs datc\ra&ple.lav
lS Adi\€D*ard
ffi Dcscrtfhcsdkdics
lle*crhlurr Sl.*liilca
Practice Exercise
Load the sampledata file you createdearlier (SAMPLE.sav). Run the Descriptives
command for the variable GRADE and print the output. Your output should look like the
exampleon page7. Next, selectthe data window and print it.
Example1.7.1
Two morestudents
provideyou with surveys.
Theirinformationis:
ResponseSheet3
ID: 8734
Day of class: MWF X TTh
Classtime: Morning Afternoon
Are you a morningperson? Yes XNo
Final gradein class: 80%
Do you work outsideschool? Full-time Part-time
No
ResponseSheet4
ID: 1909
Day of class: X MWF TTH
Classtime: X Morning Afternoon
Are you a morning person? X Yes No
Final gradein class: 73%
Do you work outsideschool? Full+ime X Part-time
No
Chapter I Getting Started
To add thesedata, simply place two additionalrows in the Data View window (af-
ter loading your sampledata).Notice that as new participantsare added,the row numbers
becomebold. when done,the screenshouldlook like the screenshothere.
j .l lrrl vl
'..,
nh E*__$*'_P$f_I'Sgr &1{1zc Omhr t$*ues $ilndonHug_
Tffiffi
ID DAY TIME MORNING GRADE WORK var \^
1 4593.00 Tueffhu aternoon No 85.00 No
2 1gnl.B0MonMed/ m0rnrng Yes 83.00 Part-Time
3 8734.00 Tue/Thu mornrng No 80,00 No
1909.00MonAfVed/ mornrng Yeg 73.00 Part-Time
)
Procus*rlsready '.-
15P55 I i ,4
New variables can also be added.For example, if the first two participantswere
given specialtraining on time management,and the two new participantswere not, the data
file can be changedto reflect this additionalinformation.The new variable could be called
TRAINING (whether or not the participant receivedtraining), and it would be coded so
that 0 : No and I : Yes. Thus, the first two participantswould be assigneda "1" and the
Iast two participantsa "0." To do this, switch to the Variable View window, then add the
TRAINING variable to the bottom of the list. Then switch back to the Data View window
to updatethe data.
f+rilf,t - tt Inl vl
Sa E& Uew Qpta lransform &rpFzc gaphs Lffitcs t/itFdd^,SE__--
14:TRAINING l0 i of
lvGbt€r
t0 NAY TIME MORNING GRADE woRK I mruruwe 1r
1 4593.0f1 Tueffhu aterncon No 85.0u No l Yes
I 1901.OCIManA/Ved/ m0rnrng Yes ffi.0n iiart?mel- yes
3 8734"00 Tueffhu momtng No 80.n0 Noi No
4 1909.00M onrlVed/ morning Yes 73.00 Part-TimeI No '
s
I
(l) .r View { Vari$c Vlew
. l-.1 =J " rll'l
isPssW ,i
Adding data and adding variablesare just logical extensionsof the procedureswe
used to originally createthe data file. Save this new data file. We will be using it again
later in the book.
Chapter I Getting Started
Practice Exercise
Follow the exampleabove(whereTRAINING is the new variable).Make the
modifications
to your SAMPLE.savdatafile andsaveit.
l0
Chapter2
EnteringandModifying Data
In Chapter 1, we learnedhow to createa simple data file, save it, perform a basic
analysis,and examine the output. In this section,we will go into more detail about vari-
ablesand data.
Measurement Scales
There are four types of measurementscales:nominal, ordinal, interval, and ratio.
While the measurementscalewill determinewhich statisticaltechniqueis appropriatefor a
given set of data, SPSSgenerally does not discriminate.Thus, we start this section with
this warning: If you ask it to, SPSSmay conduct an analysis that is not appropriatefor
your data.For a more completedescriptionof thesefour measurementscales,consultyour
statisticstext or the glossaryin Appendix C.
Newer versionsof SPSS allow you to indicate which types of
data you have when you define your variable. You do this using the Measure
Measurecolumn. You can indicateNominal, Ordinal, or Scale(SPSS @Nv
doesnot distinguishbetweeninterval and ratio scales). f $cale
.sriltr
Look at the sampledata file we createdin Chapter l. We calcu-
lated a mean for the variable GRADE. GRADE was measuredon a ra- r Nominal
tio scale,andthe mean is an acceptablesummarystatistic(assumingthat the distribution
is normal).
We could have had SPSScalculatea mean for the variableTIME insteadof
GRADE.If we did, we wouldgettheoutputpresented here.
The outputindicatesthat the averageTIME was 1.25.Rememberthat TIME was
coded as an ordinal variable ( I =
m or ni ngcla ss,2-a fte rnoon
trlllql eilr $l-g
class).Thus, the mean is not an *lq]eH"N-ql*l
appropriatestatisticfor an ordinal :* Sl astts
.l.:D
scale,but SPSScalculatedit any- gtb
:$sh
way. The importance of consider- .6M6.ffi
$arlrba"t S#(|
ing the type of data cannot be
LS 2.qg Lt@
ll
Chapter2 Enteringand Modifying Data
Missing Data
Often, participantsdo not provide completedata.For some students,you may have
a pretestscore but not a posttestscore.Perhapsone studentleft one question blank on a
survey,or perhapsshe did not stateher age.Missing data can weakenany analysis.Often,
a singlemissingquestioncan eliminatea sub-
ql total ject from all analyses.
If you have missing data in your data
2.00 2.Bn 4.00 set, leave that cell blank. In the example to
3.00 1.0 0 4.00 the left, the fourth subject did not complete
Question2. Note that the total score(which is
4.00 3.00 7.00 calculatedfrom both questions)is also blank
becauseof the missing data for Question 2.
2.00 SPSS representsmissing data in the data
1 .0 0 2.UB 3.00 window with a period (althoughyou should
not enter a period-just leave it blank).
Selectinga Subset
We can use the SelectCasescommandto specify
F|! Ed vl6{ , O*. lr{lrfum An*/& e+hr (
a subset of our data. The Select Cases command is
t'llitl&JE O*fFV{ldrr PrS!tU6.,.
CoptO.tafropc,tir3,.. located under the Data menu. When you select this
il :id l,j.l,/r,:irrlrr! lif l ll:L*s,,.
command,the dialog box below will appear.
Hh.o*rr,.,
Dsfti fi*blc Rc*pon$5ct5,,,
q*d-:-"-- "-"""-*--*--**-""*-^*l
6 Alce
a llgdinlctidod
,r l
ConyD*S r irCmu*dcaa ]
i*np* | i:
{^ lccdotincoarrpr
sd.rt Csat
;.,* |
-:--J
c llaffrvci*lc
C6ttSldrDonoan!.ffi
t2
Chapter2 Enteringand Modifying Data
casenumber. For example,for our sampledata, the first and third casesare not 1
,1
selected.only the secondand fourth casesare selectedfor this subset. 'l
,.,:r. 1
EffEN'EEEgl''EEE'o rt lnl vl I
!k_l** 1
:
-#gdd.i.&l t
Flib'- 'l
ID TIME
MORNING ERADE WORK TRAINING 1
/,-< 4533.m Tueffhui affsrnoon No ffi.m Na Yes Not Selected
'l
An additional variable will also be createdin your data file. The new variable is
called FILTER_$ and indicateswhethera casewas selectedor not.
If we calculatea mean
DescripthreStailstics
GRADE using the subset we
just selected,we will receive std.
N Minimum Maximum M e a n Deviation
the output at right. Notice that UKAUE 2 73.00 83.00 78.0000 7 . 0 7 1 1
we now have a mean of 78.00 Va lid N
IliclwisP'l 2
with a samplesize (M) of 2 in-
steadof 4.
l3
Chapter2 Enteringand Modifying Data
Be careful when you selectsubsets.The subsetremains in ffict until you run the
commandagain and selectall cases.You can tell if you have a subsetselectedbecausethe
bottom of the data window will indicatethat a filter is on. In addition, when you examine
your output, N will be less than the total number of recordsin your data set if a subsetis
selected.The diagonal lines through some caseswill also be evident when a subsetis se-
lected.Be careful not to saveyour data file with a subsetselected,as this can causeconsid-
erableconfusionlater.
l4
Chapter2 Enteringand Modifying Data
t::,, - ltrl-Xl
Eile gdit SEw Qata lransform $nalyza 9aphs [tilities Add'gns Sindow Help
l5
Ch a p te 2
r En te ri n ga n d Mo d i fy i n gD ata
$q I '*J
codedas I or 2, basedon TOTAL.
*u"'." -ltrlIl
Flc Ed Yl.ly Drt! Tr{lform ce|6.,||tf^,!!!ry
{*!c I+
l6
Chapter3
DescriptiveStatistics
manyof the optionsavailablein SPSSfor dealingwith
ln Chapter2, we discussed
data.Now we will discusswaysto summarize our data.The proceduresusedto describe
andsummarizedataarecalleddescriptivestatistics.
Description
The Frequencies commandproducesfrequencydistributionsfor the specifiedvari-
ables.The outputincludesthe numberof occurrences, percentages,
valid percentages,
and
cumulativepercentages. The valid percentages and the cumulativepercentagescomprise
only thedatathatarenot designated asmissing.
TheFrequencies commandis usefulfor describingsampleswherethe meanis not
useful(e.g.,nominalor ordinal scales). It is alsousefulasa methodof gettingthefeelof
your data.It providesmoreinformationthanjust a meanandstandarddeviationandcan
be usefulin determiningskewandidentifyingoutliers.A specialfeatureof the command
percentileranks.
is its abilityto determine
Assumptions
Cumulativepercentages and percentilesare valid only for datathat are measured
on at leastan ordinal scale.Becausethe outputcontainsoneline for eachvalueof a vari-
able,thiscommandworksbeston variableswith a relativelysmallnumberof values.
Drawing Conclusions
TheFrequencies commandproduces outputthatindicatesboththenumberof cases
in the sampleof a particularvalueandthe percentage of caseswith that value.Thus,con-
clusionsdrawnshouldrelateonly to describingthe numbersor percentages of casesin the
regardingthe cumulativeper-
sample.If the dataareat leastordinalin nature,conclusions
centageand/orpercentilescanbe drawn.
.SPSSData Format
The SPSSdatafile for obtainingfrequencydistributions
requiresonly onevariable,
andthatvariablecanbeof anytype.
tt
C h a p te 3
r D e s c ri p ti v e
Sta ti s ti c s
I 4
Valid P6rc€nl
I 4 E4
The secondcolumn gives the frequency 7.1 7.2 15.6
72 28 6.9 6.9 22.5
of eachvalue,includingmissingvalues. 73 40 9.9 9.9 32.3
The third columngivesthe percentage of 74 27 6.7 6.7 39.0
75 30 7.4 46.4
all records (including records with 76 34 8.4 8.4 54.8
77 28 6.9 61.7
missingdata) for eachvalue.The fourth 6.9
8.9 8.9 70.6
column, labeled Valid Percenl, gives the 79 29 7.1 f.2 77.8
80 29 7.1 7.2 84.9
percentage of records(without including 81 30 7.4 7.4 92.3
records with missing data) for each 82 31 7.6 7.7 | 00.0
Total 405 99.8 100 .0
value. If there were any missingvalues, Missing 0 (Missing) 1
these values would be larger than the Total 406 100.0
r8
Chapter3 DescriptiveStatistics
l9
Ch a p re ,1
r D e s c ri p tier S ta ti s ti cs
Practice Exercise
UsingPracticeDataSet I in AppendixB, createa frequencydistributiontablefor
the mathematicsskills scores.Determine
the mathematicsskills scoreat which the 60th
percentile
lies.
Description
The Crosslabscommandproducesfrequencydistributionsfor multiple variables.
The outputincludesthenumberof occurrences of eachcombinationof levelJof eachvari-
able.It is possibleto havethecommandgivepercentages for anyor all variables.
The Crosslabscommandis usefulfor describingsampleswherethe mean is not
useful(e'g.,nominalor ordinal scales).
It is alsousefulasa methodfor gettinga feelfor
your data.
Assumptions
Becausethe outputcontainsa row or columnfor eachvalue of a variable.this
command
worksbeston variables
with a relativelysmallnumberof values.
,SPSSData Format
The SPSS data file for the Crosstabs
lnalyzc Orphn Ut||Uot
commandrequirestwo or more variables.Those I
RcF*r )
variablescanbe of anytype.
(orprycrllcEnr )
Runningthe CrosstabsCommand G*ncral llrgar Flodcl )
,
This exampleusesthe SAMpLE.sav
data ;ilffi; )
file, which you createdin Chapter l. To run the chrfy )
procedure, ctick Analyze, then Descriptive DttaRcd.Etbn )
Statistics,then Crosstabs.This will bring up ttt. scah )
main Crosstabsdialog box, below.
20
Chapter3 DescriptiveStatistics
TRAINING'
WURKCross|nl)tilntlo|l
WORK
NO Parl-Time Tolal
TRAINING Yes Count I 1
% withinTRAININO 50.0% 50.0% 100.0% Interpreting Cros stabs Output
% withinwoRK 50. 0% 50.0% 50.0%
% ofTolal 25. 0% 25.0% 50.0% The output consistsof a
No Count 1 1 ? contingencytable. Each level of
% withinTRAINING 50.0% 50.0% 1000%
% withinWORK 50. 0% 50.0% 50.0%
WORK is given a column. Each
% ofTolal 25.0% 25.0% 50.0% level of TRAINING is given a
Total Count a
4 row. In addition, a row is added
% withinTRA|NtNo 50.0% 5 00 % r 00.0%
% wilhinWORK 100. 0% 100.0% 100.0%
for total, and a column is added
% ofTolal 50. 0% 50.0% 100.0% for total.
Practice Exercise
Using PracticeData Set I in AppendixB, createa contingencytable using the
Crosstabscommand.Determinethe numberof participantsin eachcombinationof the
variablesSEX andMARITAL. Whatpercentage of participants
is married?Whatpercent-
ageof participants
is maleandmarried?
Description
Measuresof centraltendencyare valuesthat representa typical memberof the
sampleor population.Thethreeprimarytypesarethemean,median,andmode.Measures
of dispersiontell you the variabilityof your scores.The primarytypesare the range and
the standarddeviation.Together,a measure of centraltendencyanda measureof disper-
sionprovidea greatdealof informationabouttheentiredataset.
2l
Chapter,l DescriptiveStatistics
Assumptions
Each measureof central tendencyand measureof dispersionhas different assump-
tions associatedwith it. The mean is the most powerful measureof centraltendency,and it
has the most assumptions.For example,to calculatea mean, the data must be measuredon
an interval or ratio scale.In addition,the distribution shouldbe normally distributedor, at
least,not highly skewed.The median requiresat leastordinal data.Becausethe median
indicatesonly the middle score (when scoresare arrangedin order), there are no assump-
tions about the shapeof the distribution.The mode is the weakestmeasureof central ten-
dency.There are no assumptionsfor the mode.
The standard deviation is the most powerful measureof dispersion,but it, too, has
severalrequirements.It is a mathematicaltransformationof the variance (the standard
deviation is the squareroot of the variance).Thus, if one is appropriate,the other is also.
The standard deviation requiresdata measuredon an interval or ratio scale.In addition,
the distribution should be normal. The range is the weakestmeasureof dispersion.To cal-
culatea range, the variablemust be at leastordinal. For nominal scale data,the entire
frequencydistribution shouldbe presentedas a measureof dispersion.
Drawing Conclusions
A measureof central tendencyshouldbe accompaniedby a measureof dispersion,
Thus, when reporting a mean, you should also report a standard deviation. When pre-
sentinga median, you shouldalso statethe range or interquartilerange.
.SPSSData Format
Only one variable is required.
22
Chapter3 DescriptiveStatistics
,l t
cases/participants),the minimum value, the maximum F Morr l- Slm r@t il
value, the mean, and the standard deviation. Note that qq..' I
F su aa**n F, Mi*ilm 'i
some of thesemay not be appropriatefor the type of data
f u"or- F7 Maiilrn ,|' ?bl I
I
DescriptiveStatistics
N Minimum Maximum Mean Std.Deviation
graoe 4 73.00 85.00 80.2500 5.25198
ValidN (listwise) 4
23
Chapter3 DescriptiveStatistics
Practice Exercise
Using PracticeData Set I in AppendixB, obtainthe descriptive statisticsfor the
ageof the participants.
What is the mean?The median?The mode?What is the standard
deviation?Minimum? Maximum?The range?
Description
The measuresof centraltendencydiscussedearlierare often needednot only for
the entiredataset,but also for severalsubsets.One way to obtainthesevaluesfor subsets
would be to use the data-selection techniquesdiscussedin Chapter2 and apply the De-
scriptivescommandto each subset.An easierway to perform this task is to use the Means
command.The Means commandis designedto provide descriptive statisticsfor subsets
ofyour data.
Assumptions
The assumptionsdiscussedin the sectionon Measuresof Central Tendencyand
Measuresof Dispersionfor a SingleGroup(Section3.3) alsoapply to multiplegroups.
Drawing Conclusions
A measureof centraltendencyshouldbe accompanied by a measureof dispersion.
Thus,when giving a mean, you shouldalsoreporta standard deviation. When presenting
a median,you shouldalsostatethe range or interquartilerange.
SPSSData Format
Two variablesin the SPSSdata file are required.One representsthe dependent
variable and will be the variablefor which you receivethe descriptive statistics.The
other is the independentvariable and will be usedin creatingthe subsets.Note that while
SPSScalls this variablean independentvariable, it may not meet the strict criteriathat
definea true independentvariable (e.g.,treatmentmanipulation).Thus, someSPSSpro-
ceduresreferto it as the grouping variable.
Runningthe Command
This example uses the ! RnalyzeGraphs Utilities WindowHetp I
SAMPLE.sav data file you created in F r.l
-l
Firulbgt5il |
nsportt
Chapterl. The Means commandis run by ' DescriptiveStatistirs )
clicking Analyze, then Compare Means,
thenMeans. General Linearftladel F Ona-Sarnple
f feft.
This will bring up the main dialog ' Csrrelata ) Independent-Samdes T Te
-
box for the Means command. Place the . Regression I T Test,,,
Falred-SarnplEs
' (fassify F
selectedvariablein the blank field labeled Ons-Way*|iJOVA,,,
DependentList.
1A
LA
Chapter3 DescriptiveStatistics
I
:I
tu
€ arv r T ril
List
Dependant
ryl
,du** ll".i I
/wqrk Heset
I
€tr"ining
lLayarl al1*- Cancel
I
I :'r:rrt| Ii
..!'l?It.
I Independent
Li$:
l"rpI
i r:ffi
lr-, tffi,
r l*i.rl I
L-:-
VsianNc
GaseProcessingSummary
Cases
lncluded Excluded Total
N Percent N Percent N Percent
grade - morning 4 100.0% 0 .OYo 4 | 100.0%
25
Chapter3 DescriptiveStatistics
26
Chapter3 DescriptiveStatistics
Practice Exercise
UsingPracticeDataSet I in AppendixB, computethe meanandstandarddevia-
tion of agesfor eachvalueof maritalstatus.What is the averageageof the marriedpar-
Thedivorcedparticipants?
ticipants?The singleparticipants?
Description
Standardscoresallow thecomparison of differentscalesby transforming
thescores
into a commonscale.The mostcommonstandardscoreis the z-score.A z-scoreis based
on a standardnormal distribution(e.g.,a meanof 0 anda standarddeviationof l). A
z-score,therefore,representsthenumberof standarddeviationsaboveor belowthe mean
(e.9.,a z-scoreof -1.5 represents
a scoreI % standarddeviationsbelowthemean).
Assumptions
Z-scoresare basedon the standardnormal distribution. Therefore,the distribu-
tionsthatareconverted shouldbe normallydistributed,
to z-scores andthescalesshouldbe
eitherinterval or ratio.
Drawing Conclusions
Conclusionsbasedon z-scoresconsistof thenumberof standarddeviationsabove
or belowthe mean.For example,a studentscores85 on a mathematics examin a classthat
hasa meanof 70 andstandarddeviationof 5. The student'stestscoreis l5 pointsabove
the classmean(85 - 70: l5). The student's z-scoreis 3 because shescored3 standard
deviationsabovethe mean(15 + 5 :3). If the samestudentscores90 on a readingexam,
with a classmeanof 80 anda standarddeviationof 10,thez-scorewill be I .0 because
she is one standard deviation abovethe mean. Thus,even thoughher raw scorewas
higheron thereadingtest,sheactuallydid betterin relationto otherstudentson the mathe-
maticstestbecauseherz-scorewashigheron thattest.
.SPSSData Format
Calculatingz-scoresrequiresonly a singlevariablein SPSS.That variablemustbe
numerical.
27
Chapter3 DescriptiveStatistics
lr| -tsJX
Eb E* S€w Qpt. lrnsfam end/2. gr$t6
Practice Exercise
Using PracticeData Set 2 in Appendix B, determinethe z-scorethat correspondsto
each employee's salary.Determinethe mean z-scoresfor salariesof male employeesand
femaleemployees.Determinethe mean z-scorefor salariesof the total sample.
28
Chapter4
GraphingData
Section4.1 GraphingBasics
In addition to the frequency distributions,the measuresof central tendency and
measuresof dispersiondiscussedin Chapter3, graphingis a useful way to summarize,or-
ganize,and reduceyour data. It has been said that a picture is worth a thousandwords. In
the caseof complicateddatasets,this is certainlytrue.
With Version 15.0of SPSS,it is now possibleto make publication-qualitygraphs
using only SPSS.One importantadvantageof using SPSSto createyour graphsinsteadof
other software (e.g., Excel or SigmaPlot)is that the data have alreadybeen entered.Thus,
duplicationis eliminated,and the chanceof makinga transcriptionerror is reduced.
29
Chapter4 GraphingData
Make sure you have enteredthe data correctly by calculating a mean for each of
the three variables(click Analyze,thenDescriptive Statistics,thenDescriptives).Compare
your resultswith thosein the tablebelow.
Descrlptlve Statistics
srd.
N Minimum Maximum Mean Dpvi2lion
l-ttstuFl
I 16 60.00 74.00 66.9375 J.9Ub//
WEIGHT 16 06 nn 165.00 129.0625 26.3451
SEX 16 1.00 2.00 1.5000 .5164
ValidN
(listwise) 16
The Chart Builder allows you to make any kind of graph that
cc[ffy
is normally used in publication or presentation,and much of it is be-
yond the scopeof this text. This text, however,will go over the basics Eesknotnents
of the Chart Builder so that you can understandits mechanics.
On the left side of the Chart Builder window are the four main
l"ry{Y:_
litleo/Footndar
tabs that let you control the graphsyou are making. The first one is
the Gallery tab. The Gallery tab allows you to choosethe basic format
ofyour graph.
30
Chapter4 Graphing Data
Drawing Conclusions
TheFrequencies commandproduces outputthatindicatesboththenumberof cases
in the samplewith a particular valueand the percentage of caseswith that value.Thus,
conclusionsdrawn shouldrelateonly to describingthe numbersor percentages for the
regardingthe cumulativeper-
sample.If the dataareat leastordinal in nature,conclusions
centages and/orpercentilescanalsobedrawn.
SPSSData Format
You needonlv onevariableto usethiscommand.
3l
Chapter4 GraphingData
Output
The bar chart consistsof a I'axis, representingthe
frequency,and an Xaxis, representingeachscore.Note that
the only values representedon the X axis are those values
with nonzero frequencies(61, 62, and 7l are not repre-
sented).
h.lgtrt
G
,
a
I
a
L
hclght
Practice Exercise
Use PracticeData Set I in Appendix B. After you have enteredthe data,constructa
histogramthat representsthe mathematicsskills scoresand displaysa normal curve, and a
bar chart that representsthe frequenciesfor the variableAGE.
Section4.4 Scatterplots
Description
Scatterplots(also called scattergramsor scatterdiagrams)display two values for
eachcasewith a mark on the graph.The Xaxis representsthe value for one variable.The I
axis representsthe value for the secondvariable.
JJ
Ch a p te -1
r Gra p h i n gD a ta
Assumptions
Both variablesshouldbe interval or ratio scales.If nominalor ordinal dataare
used,be cautiousaboutyour interpretation
of thescattergram.
.SPSSData Format
You needtwo variablesto performthiscommand.
8n
Lh
PlrifsLa
Scfflnal
xbbrs
Hg||rd
iEbM{ Ffip*t!4.,
opbr.,
34
Chapter4 Graphing Data
Output
Theoutputwill consistof a markfor eachparticipant
at theappropriate
X and
levels.
35
Chapter4 GraphingData
Now our output will have two different setsof marks. One set representsthe male
participants,and the secondset representsthe female participants.Thesetwo setswill ap-
pear in two different colors on your screen.You can use the SPSSchart editor (seeSection
4.6) to make them different shapes,as shown in the examplebelow.
sPx
iil
os
Practice Exercise
to examinetherela-
UsePracticeDataSet2 in AppendixB. Constructa scatterplot
tionshipbetweenSALARY andEDUCATION.
Description
Bar chartscan be producedwith the Frequencie.s command(see Section4.3).
Sometimes.however.we are interestedin a bar chartwherethe I/ axis is not a frequency.
To producesucha chart,we needto usetheBar chartscommand.
36
Chapter4 GraphingData
l?i;ffitF.t-
select the Clustered Bar Chart example
d'd{4rfr trrd...
/ft,Jthd)
from the middle of the top row.
/l*n*|ts,.,
dq*oAtrm, ,
h4 | G.lary ahd lsr to @ t 6 p
Drag the example to the top work-
cfwxry
ing area. Once you do, the working area
m
ffi * $r 0* dds t bto h.td. drr
should look like the screenshotbelow.
drrrl by.lr!*
(Note that you will need to open the data
file you would like to graph in order to run
this command.)
9{
m
hlpd{
sc.ffp/Dat
tffotm
tldrtff
60elot
oidA#
:gi
Cdtfry
LSdrl
If you are using a repeated-measures design like our example here using
GRADES.sav from Chapter 6 (three different variablesrepresentingthe i values that we
want), you needto selectall threevariables(you can <Ctrl>-click them to selectmultiple
variables)and then drag all threevariablenamesto the Y-Axisarea.When you do. vou will
be given the warning messageabove. Click OK.
JI
Chapter4 GraphingData
Output
inilrut
lr
&t:
nt
Practice Exercise
Use PracticeData Set I in Appendix B. Constructa clusteredbar graph examining
the relationshipbetweenMATHEMATICS SKILLS scores(as the OepenOentvariabtej
and MARITAL STATUS and SEX (as independentvariables). Make sure you classify
both SEX and MARITAL STATUS as nominalvariables.
38
Chapter4 Graphing Data
IE :,- r--."1
Ittttr
tlttIr
tllrwel
w&&$!{!rJ
r 9-,rt JJJJ-JJ
I JJJJJJ
.nlqrl
cnl,f,,!sl
fil mlryl
OnceChart Editor is open,you can easily edit eachelementof the graph.To select
an element,just click on the relevantspot on the graph. For example,if you have addeda
title to your graph ("Histogram" in the examplethat follows), you may selectthe element
representingthe title of the graphby clicking anywhereon the title.
39
Chapter4 GraphingData
trTT.":.TJ *"
^' ir sGssir ' . '*t A:
r::-'
:J*r o :l A I 3 *l.A-I,-- whether the correct element is
selectedbecauseit will have
handlesaroundit.
If the item you have
selectedis a text element(e.g.,
the title of the graph), a cursor
will be presentand you can edit
the text as you would in a word
processing program. If you
would like to change another
attribute of the element (e.g.,
the color or font size), use the
Propertiesbox. (Text properties
are shownbelow.)
With a linle practice,
you can make excellent graphs
using SPSS.Once your graph
is formatted the way you want
it, simply select File, Save,
then Close.
o,tl*" ffiln*fot*.1
P?*l!r h ?frtmd Sa . .
$o gdt lbw gsion Ek
AaBbCc123 $vr {hat Trm$tr,,,
Spdy$a*Tmpt*c.,.
flpoft {bdt rf'.|1,,,
gltaridfu; Ua*tr$Sie
40
Chapter5
PredictionandAssociation
Section5.1 PearsonCorrelation Coefficient
Description
The Pearson correlationcoefficient(sometimes
calledthe Pearson product-moment
correlationcoefficientor simplythe Pearsonr) determinesthe strengthof the linearrela-
tionshipbetween two variables.
Assumptions
4
Both variablesshouldbe measured on intervalor ratio scales(or a dichotomous n
nominalvariable).If a relationship
existsbetween them,thatrelationship
shouldbe linear.
Becausethe Pearsoncorrelationcoefficientis computedwith z-scores,both variables 1
SP.SSData Format
Two variablesarerequiredin your SPSSdatafile. Eachsubjectmusthavedatafor
bothvariables.
Vdri.blcr
9@,. 1
4l
Chapter5 Predictionand Association
lsffi -N-
OX I
Drawing Conclusions
The correlationcoefficientwill be between-1.0 and +1.0. Coefficientscloseto 0.0
representa weak relationship.Coefficientscloseto 1.0or-1.0 representa strongrelation-
ship. Generally,correlationsgreaterthan 0.7 are consideredstrong. Correlationsless than
0.3 are consideredweak. Correlationsbetween0.3 and 0.7 are consideredmoderate.
Significant correlationsare flagged with asterisks.A significant correlation indi-
catesa reliable relationship,but not necessarilya strong correlation.With enoughpartici-
pants,a very small correlationcan be significant.PleaseseeAppendix A for a discussion
of effect sizesfor correlations.
Phrasinga SignificantResult
In the example above, we obtained a correlation of .806 between HEIGHT and
WEIGHT. A correlationof .806 is a strongpositivecorrelation,and it is significantat the
.001level.Thus,we could statethe following in a resultssection:
4/
Chapter5 Predictionand Association
Practice Exercise
Use PracticeData Set 2 in Appendix B. Determinethe value of the Pearsonconela-
tion coefficient for the relationshipbetweenSALARY and YEARS OF EDUCATION.
Section5.2 SpearmanCorrelationCoeflicient
Description
The Spearmancorrelationcoefficientdeterminesthe strengthof the relationshipbe-
tween two variables.It is a nonparametricprocedure.Therefore,it is weakerthan the Pear-
son correlationcoefficient.but it can be usedin more situations.
Assumptions
Becausethe Spearmancorrelationcoefficient functions on the basisof the ranks of
data,it requiresordinal (or interval or ratio) data for both variables.They do not needto
be normally distributed.
43
Cha p te r5 Pre d i c ti o na n dA s s o c i a ti on
SP.SSData Format
Two variablesarerequiredin your SPSSdatafile. Eachsubjectmustprovidedata
for bothvariables.
Drawing Conclusions
The correlationwill be between-1.0 and +1.0. Scorescloseto 0.0 representa weak
relationship.Scorescloseto 1.0or -1.0 representa strongrelationship.Significantcorrela-
tions are flagged with asterisks.A significant correlation indicatesa reliable relationship,
but not necessarilya strong correlation.With enoughparticipants,a very small correlation
can be significant. Generally,correlationsgreaterthan 0.7 are consideredstrong. Correla-
tions less than 0.3 are consideredweak. Correlationsbetween0.3 and 0.7 arc considered
moderate.
44
Chapter5 Predictionand Association
PhrasingResultsThatAre Significant
In the exampleabove,we obtaineda correlationof .883 betweenHEIGHT and
WEIGHT. A correlationof .883 is a strongpositivecorrelation,and it is significantat the
.001level.Thus,we could statethe following in a resultssection:
A Spearmanrho correlationcoefficient was calculatedfor the relationship
betweenparticipants'height and weight. A strong positive correlationwas
found (rh o (1 4 ):.883, p <. 0 0 1 ), in d ic a t in ga s ig n if ic a n re
t la t io n s h ip
betweenthe two variables.Taller participants tend to weigh more.
Practice Exercise
Use PracticeData Set 2 in AppendixB. Determinethe strengthof the relationship
betweensalaryandjob classification
by calculatingthe Spearmanr&o correlation.
Description
allowsthe predictionof one variablefrom another.
Simplelinearregression
Assumptions
Simple linear regressionassumesthat both variablesare interval- or ratio-scaled.
In addition,the dependentvariable shouldbe normally distributedaroundthe prediction
line. This, of course,assumesthat the variablesare relatedto each other linearly.Typi-
45
Ch a p te 5
r P re d i c ti o na n dA s s o c i ati on
.SPSSData Format
Two variablesare required in the SPSSdata file. Each subject must contribute
to
both values.
46
Chapter5 Predictionand Association
one standard error of estimate (predicted)value. Just over 95ohwill fall within two stan-
dard errors.Thus, in the previous example,95o/oof the time, our estimatedweight will be
within 32.296poundsof beingcorrect(i.e.,2 x 16.148:32.296).
ANOVAb
Sumof Mean
Model Sorrares df Souare F Sio.
1 Kegressron 6760.323 I 6760.323 25.926 .0004
Residual 3650.614 14 260.758
Total 10410.938 15
a' Predictors:
(Constant),
HEIGHT
b. Dependent
Variable:
WEIGHT
The secondpart of the output that we are interestedin is the ANOVA summaryta-
ble, as shown above.The important numberhere is the significance level in the rightmost
column. If that value is lessthan .05, then we have a significantlinear regression.If it is
/:
largerthan .05,we do not.
The final sectionof the output is the table of coefficients.This is wherethe actual "
predictionequationcan be found. I
Coefficientt'
Unstandardized Standardized
Coefficients Coefficients
Model B Std.Error Beta t S i o.
1 (Constant) -234.681 71.552 -3.280 .005
height 5.434 1.067 .806 5.092 .000
a. Dependent weight
Variable:
47
Chapter5 Predictionand Association
Drawing Conclusions
Conclusionsfrom regressionanalysesindicate(a) whether or not a significant pre-
diction equation was obtained,(b) the direction of the relationship,and (c) the equation
itself.
Note that for resultsthat are not significant,the ANOVA results and R2resultsare
given,but the regressionequationis not.
Practice Exercise
Use PracticeData Set 2 in Appendix B. If we want to predict salary from years of
education,what salary would you predict for someonewith l2 years of education?What
salarywould you predict for someonewith a collegeeducation(16 years)?
48
Chapter5 Predictionand Association
Description
The multiple linear regressionanalysisallows the prediction of one variable from
severalother variables.
Assumptions
Multiple linear regressionassumesthat all variablesare interval- or ratio-scaled.
In addition, the dependent variable should be normally distributedaround the prediction
line. This, of course,assumesthat the variablesare relatedto eachother linearly. All vari-
ablesshouldbe normally distributed.Dichotomousvariables are also acceptableas inde-
pendentvariables.
,SP,S,S
Data Format
At least three variablesare required in the SPSSdata file. Each subject must con-
tributeto all values.
Dmmd*
We are interested in predicting
l-...G
.roj I someone'sweight basedon his or her height
LLI l&-*rt
n{.rI and sex. We believe that both sex and
ryl height influence weight. Thus, we should
tb.l place the dependent variable WEIGHT in
the Dependentblock and the independent
variables HEIGHT and SEX in the Inde-
S€lcdirnVdir*
pendent(s)block. Enterboth in Block l.
fn f*---*-- ,it'r:,I This will perform an analysisto de-
Cs Lrbr&:
Er- termine if WEIGHT can be predicted from
ti4svlit{ SEX and/or HEIGHT. There are several
'---
Li-J r- methods SPSS can use to conduct this
sr"u*t. I Pr,rr...
I s* | oei*. I
analysis. These can be selected with the
Method box. Method Enter. the most widely
49
Cha p te 5
r Pre d i c ti o n
a n dA s s o c i a ti on
used,puts all variablesin the equation, whether they are significant or not. The other
methodsuse various meansto enter only those variables that are significant predictors.
Click OK to run the analvsis.
UethodlE,rt-rl
Readingthe Output
For multiplelinearregres- ModelSummary
sion,therearethreecomponents of Adjusted Std.Errorof
the output in which we are inter- Model R R Souare R Square the Estimate
ested.The first is calledthe Model .99 .993 .992 2.29571
Summary,which is foundafterthe a. Predictors:
(Constant),
sex,height
VariablesEntered/Removedsection.For our example,you should get the output above.R
Square(calledthe coefficientof determination) tells you the proportionof the variance
in the dependentvariable (WEIGHT) that can be explainedby variationin the independ-
ent variables (HEIGHT and SEX, in this case).Thus, 99.3%of the variationin weight can
be explained by differencesin height and sex (taller individuals weigh more, and men
weigh more). Note that when a secondvariable is added,our R Squaregoes up from .649
to .993.The .649was obtainedusingthe SimpleLinear Regressionexamplein Section5.3.
The StandardError of the Estimategives you a margin of error for the prediction
equation.Using the predictionequation,68%o of the datawill fall within one standard er-
ror of estimate (predicted)value.Just over 95% will fall within two standard errors of
estimates.Thus, in the exampleabove,95ohof the time, our estimatedweight will be
within 4.591 (2.296 x 2) poundsof being correct.In our Simple Linear Regressionexam-
ple in Section5.3, this numberwas 32.296.Note the higherdegreeof accuracy.
The secondpart of the output that we are interestedin is the ANOVA summaryta-
ble. For more information on readingANOVA tables,refer to the sectionson ANOVA in
Chapter6. For now, the importantnumberis the significancein the rightmostcolumn. If
that value is lessthan .05,we havea significantlinearregression. If it is largerthan .05,we
do not.
eHoveb
Sum of
Model Souares df Mean Souare F S i o.
xegresslon 03424 24 z 5171.212 v61.ZUZ .0000
Residual 68.514 13 5.270
Total 10410.938 15
a. Predictors:(Constant),sex, height
b. DependentVariable:weight
50
Chapter5 Predictionand Association
Coefficientf
Unstandardized Standardized
Coefficients Coefficients
In most texts, you learn that Y' = a + bX is the regressionequation.For multiple re-
gression,our equationchangesto l" = Bs + B1X1+ BzXz+ ... + B.X.(where z is the number
of IndependentVariables). I/' is your dependent variable, and the Xs are your independ-
ent variables. The Bs are listed in a column. Thus, our predictionequationfor the example
aboveis WEIGHT' :47.138 - 39.133(SEX)+ 2.101(HEIGHT)(whereSEX is codedas
I : Male, 2 = Female,and HEIGHT is in inches).In other words, the averagedifferencein
weight for participantswho differ by one inch in height is 2.101 pounds.Males tend to
weigh 39.133 pounds more than females.A female who is 60 inchestall should weigh
47.138- 39.133(2)+ 2.101(60):94.932 pounds.Given our earlierdiscussionof the stan-
dard error of estimate,95o/oof femaleswho are 60 inchestall will weigh between90.341
(94.932- 4.591: 90.341)and99.523(94.932+ 4.591= 99.523)pounds.
Drawing Conclusions
Conclusionsfrom regressionanalysesindicate(a) whether or not a significant pre-
diction equation was obtained,(b) the direction of the relationship,and (c) the equation
itself. Multiple regressionis generallymuch more powerful than simple linear regression.
Compareour two examples.
you must alsoconsiderthe significancelevel of eachin-
With multiple regression,
dependentvariable. In the exampleabove,the significancelevel of both independent
variables is lessthan .001.
Morbl Sratrtny
Unslanda.dizsd Slandardizad
5l
Chapter5 Predictionand Association
Practice Exercise
UsePracticeDataSet2 in AppendixB. Determinethe predictionequationfor pre-
dictingsalarybasedon education,yearsof service,andsex.Whichvariablesaresignificant
predictors?If you believethat men were paid more than womenwere,what would you
concludeafterconducting thisanalysis?
52
Chapter6
ParametricInferentialStatistics
Parametricstatisticalproceduresallow you to draw inferencesabout populations
basedon samplesof those populations.To make theseinferences,you must be able to
makecertainassumptions aboutthe shapeof the distributionsof the populationsamples.
TheNull Hypothesis
In hypothesistesting,we createtwo hypothesesthat are mutually exclusive(i.e.,
both cannotbe true at the sametime) and all inclusive(i.e.,one of them must be true).We
refer to thosetwo hypothesesas the null hypothesisand the alternative hypothesis.The
null hypothesis generallystatesthat any differencewe observeis causedby random error.
The alternative hypothesis generallystatesthat any differencewe observeis causedby a
systematicdifferencebetweengroups.
53
Chapter6 ParametricInferentialStatistics
sampleof boys. This result may have been obtainedmerely becausethe chanceenors as-
sociatedwith random sampling createdthe observeddifference (this is what the null hy-
pothesis asserts).If there is a sufficiently low probability that random errors were the
cause(as determinedby a significance test),we can statethat the differencebetweenboys
and girls is statisticallysignificant.
SignificanceLevelsvs.Critical Values
Moststatistics
textbooks
present
hypothesis
testingby usingtheconcept
of a criti-
cal value. With such an approach,we obtain a value for a test statisticand compareit to a
critical value we look up in a table.If the obtainedvalue is larger than the critical value, we
reject the null hypothesis and concludethat we have found a significant difference(or re-
lationship).If the obtainedvalue is lessthan the critical value, we fail to reject the null hy-
pothesisand concludethat there is not a significantdifference.
The critical-value approachis well suited to hand calculations.Tables that give
critical valuesfor alpha levels of .001, .01, .05, etc., can be created.tt is not practicalto
createa table for every possiblealpha level.
On the other hand, SPSScan determinethe exact alpha level associatedwith any
value of a test statistic.Thus, looking up a critical value in a table is not necessary. This,
however,doeschangethe basic procedurefor determiningwhetheror not to reject the null
hypothesis.
The sectionof SPSSoutput labeledSrg.(sometimesp or alpha) indicatesthe likeli-
hood of making a Type I error if we rejectthe null hypothesis.A value of .05 or lessin-
dicatesthat we should reject the null hypothesis(assumingan alpha level of .05). A value
greaterthan .05 indicatesthat we shouldfail to reject the null hypothesis.
In other words, when using SPSS,we normally reject the null hypothesis if the
output value under Srg. is equal to or smaller than .05, and we fail to reject the null hy-
pothesisif the outputvalueis largerthan .05.
Phrasing Results
Resultsof hypothesistestingcan be statedin different ways, dependingon the con-
ventionsspecifiedby your institution.The following examplesillustrate someof thesedif-
ferences.
54
Chapter6 ParametricInferentialStatistics
Degreesof Freedom
Sometimesthe degreesof freedom are given in parenthesesimmediately after the
symbol representingthe test,as in this example:
(3):7.0 0 ,p<.0 1
Other times, the degreesof freedomare given within the statementof results,as in
this example:
t:7.0 0 , df :3, p < .01
SignificanceLevel
When you obtain results that are significant, they can be describedin different
ways. For example,if you obtaineda significancelevel of .006 on a t test, you
could describeit in any of the following threeways:
(3 ):7 .00,p <.05
(3 ):7 .00,p <.01
(3 ) : 7.0 0 ,p : .006
Notice that becausethe exactprobabilityis .006,both .05 and .01 are alsocorrect. I
Description
The single-sampleI test comparesthe mean of a single sampleto a known popula-
tion mean. It is useful for determiningif the current set of data has changedfrom a long-
55
Chapter6 ParametricInferentialStatistics
term value (e.g., comparing the curent year's temperaturesto a historical averageto de-
termine if global wanning is occuning).
Assumptions
The distributions from which the scoresare taken should be normally distributed.
However, the t test is robust and can handleviolations of the assumptionof a normal dis-
tribution. The dependentvariable must be measuredon an interval or ratio scale.
.SP,SS
Data Format
The SPSSdata file for the single-sample / test requiresa single variablein SPSS.
That variablerepresentsthe set of scoresin the samplethat we will compareto the popula-
tion mean.
Readingthe Output
The output for the single-samplet test consistsof two sections.The first section
lists the samplevariable and somebasic descriptive statistics (N, mean, standard devia-
tion, and standarderror).
56
Chapter6 ParametricInferentialStatistics
T-Test
One-SampleS'tatistics
std. Std.Enor
N Mean Deviation Mean
LENGTH 10 35.9000 1.1972 .3786
One-SampleTes{
TeslValue= 35
95%gsnl i 6sngg
Interual
ofthe
sis. Mean Difierence
? df (2-tailed) DitTerence Lowef U pner
LENGTH 2.377 s .041 .9000 4.356E -02 7564
The secondsection of output containsthe results of the t test. The example pre-
sentedhere indicatesa / value of 2.377,with 9 degreesof freedomand a significance level
of .041.The mean differenceof .9000is the differencebetweenthe sampleaverage(35.90)
and the populationaveragewe enteredin the dialog box to conductthe test (35.00).
Drawing Conclusions
The I test assumesan equality of means.Therefore, a significant result indicates
that the sample mean is not equivalentto the population mean (hence the term "signifi-
cantly different"). A result that is not significantmeansthat there is not a significantdiffer-
ence betweenthe means.It doesnot mean that they are equal. Refer to your statisticstext
for the sectionon failure to reject the null hypothesis.
PhrasingResultsThatAre Significant
The aboveexamplefound a significantdifferencebetweenthe populationmean and
the samplemean.Thus, we could statethe following:
A single-samplet test compared the mean length of the sample to a
populationvalue of 35.00. A significantdifferencewas found (t(9) = 2.377,
p <.05).The samplemeanof 35.90(sd: 1.197)was significantlygreater
than the population mean.
57
Chapter6 ParametricInferentialStatistics
TsstValue= 57.1
A single-sample / test 95% Contldsnc6
comparedthe mean temp- llean
Inlsryalofthe
Oit8rsnce
erature over the past year t dt Sro (2-lail€d) DilTer€nce Lowsr UDOEI
lemp€latute I 688 1 ?6667 - 5.736 2 8.2696
to the long-term average.
The difference was not significant (r(8) = .417, p > .05). The mean
temperatureover the past year was 68.67(sd = 9.1l) comparedto the long-
term averageof 67.4.
Practice Exercise
The averagesalary in the U.S. is $25,000.Determine if the averagesalary of the
participantsin PracticeData Set 2 (Appendix B) is significantlygreaterthan this value.
Note that this is a one-tailedhypothesis.
Section6.3 Independent-samples
I Test
Description
The independent-samples I test comparesthe meansof two samples.The two sam-
ples are normally from randomly assignedgroups.
Assumptions
The two groupsbeing comparedshouldbe independentof eachother. Observations
are independentwhen information about one is unrelated to the other. Normally, this
meansthat one group of participantsprovidesdata for one sampleand a different group of
participants provides data for the other sample (and individuals in one group are not
matchedwith individualsin the other group).One way to accomplishthis is throughran-
dom assignmentto form two groups.
The scoresshould be normally distributed,but the I test is robust and can handle
violationsof the assumptionof a normal distribution.
The dependent variable must be measuredon an interval or ratio scale.The in-
dependentvariable shouldhaveonly two discretelevels.
SP,S,S
Data Format
The SPSSdata file for the independentI test requirestwo variables.One variable,
the grouping variable, representsthe value of the independentvariable. The grouping
variable should have two distinct values (e.g., 0 for a control group and I for an experi-
58
Chapter6 ParametricInferentialStatistics
Group Statistics
Std.Error
mornrnq N Mean Std.Deviation Mean
grade No 2 82.5000 3.53553 2.50000
Yes 2 78.0000 7.07107 5.00000
59
Chapter6 ParametricInferentialStatistics
Next, there will be a sectionwith the resultsof the I test. It should look like the
outputbelow.
The columns labeledt, df, and Sig.(2-tailed) provide the standard"answer" for the
I test.They provide the value of t, the degreesof freedom(number of participants,minus 2,
in this case),and the significancelevel (oftencalledp). Normally, we usethe "Equal vari-
ancesassumed"row.
Drawing Conclusions
Recall from the previous section that the , test assumesan equality of means.
Therefore,a significant result indicatesthat the means are not equivalent.When drawing
conclusionsabouta I test,you must statethe directionof the difference(i.e., which mean
was larger than the other). You should also include information about the value of t, the
degreesof freedom,the significancelevel,and the meansand standard deviationsfor the
two groups.
60
Chapter6 ParametricInferentialStatistics
Practice Exercise
Use PracticeData Set I (AppendixB) to solvethis problem.We believethat young
individualshave lower mathematicsskills than older individuals.We would test this hy-
pothesisby comparingparticipants25 or younger(the "young" group) with participants26
or older (the "old" group). Hint: You may need to createa new variable that represents
which age group they are in. SeeChapter2 for help.
/ Test
Section6.4 Paired-Samples
Description
The paired-samplesI test (also called a dependent/ test) comparesthe means of
two scoresfrom relatedsamples.For example,comparinga pretestand a posttestscorefor
a group of participantswould requirea paired-samples
I test.
Assumptions
The paired-samplesI test assumesthat both variablesare at the interval or ratio
levels and are normally distributed.The two variablesshould also be measuredwith the
samescale.If the scalesare different. the scoresshouldbe convertedto z-scoresbefore the
, testis conducted.
.SPSSData Format
Two variablesin the SPSSdata file are required.These variablesshould represent
two measurementsfrom eachparticipant.
6l
Chapter6 ParametricInferentialStatistics
6l 83 94 J
t
58 75 92 J
58 74 92 J
i c*@
i lryt'sn
i'w
62
Chapter6 ParametricInferentialStatistics
fc
Within the third part of the output (on the next page),the sectioncalled paired Dif-
ferences contains information about the differencesbetweenthe two variables.you miy
have learnedin your statisticsclassthat the paired-samples / test is essentiallya single-
samplet testcalculatedon the differencesbetweenthe scores.The final threecolumnscon-
tain the value of /, the degreesof freedom,and the probability level. In the examplepre-
sentedhere,we obtaineda I of -11.646,with 20 degreesof freedomand a significance
level of lessthan.00l. Note that this is a two-tailedsignificancelevel. Seethe startof this
chapterfor more detailson computinga one-tailedtest.
63
Chapter6 ParametricInferentialStatistics
Paired Differences
95%Confidence
Inlervalof the
srd, Std.Errot l)iffcrcncc
sig
Mean Dcvirl Maan Lower U D oer I df l)Jailatl\
Pair1 PRETEST
. FINAL -22.8095 8.9756 1.9586 -26.8952 -18.7239 '|1.646 20 .000
Drawing Conclusions
Paired-samplesI testsdeterminewhetheror not two scoresare significantly differ-
ent from each other. Significant values indicate that the two scoresare different. Values
that are not significant indicatethat the scoresare not significantly different.
PhrasingResultsThatAre Significant
When statingthe resultsof a paired-samplesI test,you shouldgive the value of t,
the degreesof freedom,and the significancelevel. You should also give the mean and
standard deviation for each variable.as well as a statementof results that indicates
whetheryou conducteda one- or two-tailedtest.Our exampleabovewas significant,so we
could statethe following:
A paired-samples I test was calculatedto comparethe mean pretestscoreto
the mean final exam score.The mean on the pretestwas 63.33 (sd : 8.93),
and the mean on the posttestwas 86.14(sd:9.63). A significantincrease
from pretestto final was found (t(20) : -ll .646,p < .001).
Sld.Etrof
x6an N Sl d D d a l l o n
Patr midlsrm r B t1a3 7 I 9.509 3 75889
1 nnal 79.571 1 7 95523
I 3.00680
N Cotrelali 5 to
ts4trI mtdt€rm&flnal I 96S 000
ParredOrflerences
95%ConidBncs
lnlsNrlotth€
Std.Eror DilYerenca
x s an Std Oryiation X€an Lmt u008r tl Sio (2-lailad)
. 8571a 2 96808 I 12183 1 88788 - 76a b a7a
64
Chapter6 ParametricInferentialStatistics
Practice Exercise
Usethe sameGRADES.sav datafile, andcomputea paired-samples
/ testto deter-
from midtermto final.
mineif scoresincreased
Description
Analysisof variance(ANOVA) is a procedurethat determines the proportionof
variabilityattributedto eachof severalcomponents. It is oneof the mostusefulandadapt-
ablestatisticaltechniques available.
The one-wayANOVA compares the meansof two or moregroupsof participants
that vary on a singleindependentvariable (thus,the one-waydesignation). When we
havethreegroups,we couldusea I testto determinedifferencesbetweengroups,but we
wouldhaveto conductthreet tests(GroupI comparedto Group2, Group I comparedto
Group3, andGroup2 compared to Group3). Whenwe conductmultipleI tests,we inflate
the Type I error rate and increaseour chanceof drawingan inappropriate conclusion.
ANOVA compensates for thesemultiplecomparisons and gives us a singleanswerthat
tellsus if anyof thegroupsis differentfrom anyof theothergroups.
Assumptions
The one-wayANOVA requiresa singledependentvariable and a singleinde-
pendentvariable. Which groupparticipants
belongto is determinedby the valueof the
independentvariable. Groupsshouldbe independent of eachother.If our participants
belong to more than one group each,we will have to conducta repeated-measures
ANOVA. If we havemorethanone independentvariable,we would conducta factorial
ANOVA.
ANOVA alsoassumes thatthedependentvariableis at the intervalor ratio lev-
is
elsand normallydistributed.
SP,S,S
Data Format
Two variablesare requiredin the SPSSdatafile. One variableservesas the de-
pendentvariable andtheotherasthe independentvariable.Eachparticipantshouldpro-
videonly onescorefor thedependentvariable.
65
Chapter6 ParametricInferentialStatistics
To conduct a one-way ANOVA, click Ana- i An$tea &ryhr l.{lith; wr$|f $alp
R;pa*tr )
lyze, then Compare Means, then One-I7ay ANOVA. I
DrrsFfivsft#l|'
This will bring up the main dialog box for the One-
lhcd friodd ] orn-5dtFh f f64...
lltay ANOVA command. I
Csnalda l|1|J46tddg.5arylarT16t,.,
R!E'l'd6al ) P*G+tunpbs I IcJt..,
l ':l i rl :.,.
:J cl#y rffillEil
You should place the independ-
xrded
f *wm
f,'',I ent variable in the Factor box. For our
dtin"t
S instt.,ct R*g I example, INSTRUCT representsthree
/ rcqulad
c*"d I different instructors.and it will be used
Hdp I
as our independentvariable.
Our dependentvariable will be
miqq|4^__
FINAL. This test will allow us to de-
!qt5,l fqiryt,,,l odiors"'
I termine if the instructor has any effect
on final sradesin the course.
-rl
pela$t
ToKl
/n*Jterm
d f- Fncdrdr*dcndlscls
'cqfuda
Ro"t I l* Honroga&olvairre,* j *_It J
c*ll
H& l
mrffi,.*-
Pqyg*-I odi"*.'.I
"?s'qt,.,l
66
Chapter6 ParametricInferentialStatistics
Readingthe Output
Descriptivestatisticswill be givenfor eachinstructor(i.e.,levelof the independ-
ent variable)andthe total.For example,in hisftrerclass,InstructorI hadan averagefinal
examscoreof 79.57.
Descriptives
final
95% ConfidenceIntervalfor
Mean
N Mean Std. Deviation Std.Enor LowerBound UooerBound Minimum Maximum
UU 79.5714 7.95523 3.00680 72.2',t41 86 9288 tiv.uu 6Y.UU
2.00 7 86.4286 1 0 .9 2180 4.12805 76.3276 96.5296 69.00 100.00
3.00 7 92.4286 5.50325 2.08003 87.3389 97.5182 83.00 100.00
Total 2'l 86. 1429 9.63476 2.10248 81.7572 90.5285 69.00 100.00
67
Chapter6 ParametricInferentialStatistics
Drawing Conclusions
Drawing conclusionsfor ANOVA requiresthat we indicate the value of F, the de-
greesof freedom,and the significance level. A significantANOVA should be followed by
the resultsof a post-hocanalysisand a verbal statementof the results.
our dependent variable in- N Sld Ddaton Sld Eiror LMf Bo u n d Uooer gound
stead of FINAL, we would 200 63.1r 29 10.605r8 4.00819 5333r a 7? 95r 3 1700 78 00
300 1 592857 7
6.5501 2.at5t1 53 2278 55 3a36 r9.00 71 00
have received the following Tolel 21 633333 I 92935 I ga€sa 592587 67 3979 at 0 0 79 00
output:
All:'Vl
The ANOVA was not signifi-
cant. so there is no need to re- sumof
68
Chapter6 ParametricInferentialStatistics
Practice Exercise
UsingPracticeDataSet I in AppendixB, determineif the averagemathscoresof
of
are significantlydifferent.Write a statement
single,married,and divorcedparticipants
results.
Description
The factorialANOVA is one in which thereis morethan one independentvari-
able.A 2 x 2 ANOVA, for example,hastwo independentvariables,eachwith two lev-
els.A 3 x 2 x 2 ANOVA hasthreeindependentvariables.Onehasthreelevels,andthe
othertwo havetwo levels.FactorialANOVA is very powerfulbecauseit allowsus to as-
sesstheeffectsof eachindependentvariable,plustheeffectsof the interaction.
Assumptions
FactorialANOVA requiresall of the assumptions of one-wayANOVA (i.e.,the
dependentvariable mustbe at the interval or ratio levelsand normallydistributed).In
variablesshouldbe independent
addition,theindependent of eachother.
.SPSSData Format
SPSSrequiresonevariablefor the dependentvariable,andone variablefor each
independent variable.If we haveanyindependent asmulti-
variablethatis represented
ple variables(e.g., PRETESTand POSTTEST),we must use the repeated-measures
ANOVA.
Runningthe Command
This exampleuses the GRADES.savI Rnalyze Graphs USiUas llilindow Halp
data file from earlierin this chapter.Click Ana- Repsrts
lyze, then GeneralLinear Model. then Univari- DescrbtiwStatFHcs
ate.
1 .IN S T R U C T
Reading the Output
70
Chapter6 ParametricInferentialStatistics
Tests of Between-SubjectsEffects
Dependent
Variable:FINAL
635.8214 5
Intercept 1s1998.893 1 151998.893 1867.691 .000
INSTRUCT 536.357 2 268.179 3.295 .065
REQUIRED 34.32'l 1 34.321 .422 .526
INSTRUCT ' REQUIRED 22.071 2 11.036 .136 .874
Error 1220.750 15 81.383
Total 157689.000 21
CorrectedTotal 1856.571 20
a. R Squared= .342(Adjusted =
R Squared .123)
7l
Chapter6 ParametricInferentialStatistics
Tests of Between-SubjectsEffects
t Variable:FINAL
Typelll
Sumof Mean
Source Souares df Souare F siq.
uorrecleoMooel 635.8214 5 127.164 1.563 .230
Intercept 151998.893 1 151998.893 1867.691 .000
INSTRUCT 536.357 2 268.179 3.295 .065
REQUIRED 34.321 1 34.321 .422 .526
INSTRUCT - REQUIRED 22.071 2 11.036 .136 .874
Error 1220.750 1E
81.383
Total 157689.000 21
CorrectedTotal 't856.571 20
a. R Squared= .342(AdjustedR Squared= .123)
The source table above gives us these three answers (in the INSTRUCT,
REQUIRED, and INSTRUCT * REQUIRED rows). In the example,none of the main ef-
fects or interactions was significant.In the statementsof results,you must indicatef', two
degreesof freedom (effect and residual/error),the significance level, and a verbal state-
ment for each of the answers(three,in this case).Note that most statisticsbooks give a
much simpler versionof an ANOVA sourcetable where the CorrectedModel, Intercept,
and CorrectedTotal rows are not included.
71
Chapter6 Parametriclnferential Statistics
Practice Exercise
Using PracticeDataSet 2 in AppendixB, determineif salariesare influencedby
or an interactionbetweensexandjob classification.
sex,job classification, Write a state-
mentof results.
ANOVA
Section6.7 Repeated-Measures
Description
Repeated-measuresANOVA extendsthe basicANOVA procedureto a within-
subjectsindependentvariable (whenparticipants providedatafor morethanonelevelof
/ test when more thantwo
an independentvariable).It functionslike a paired-samples
levelsarebeingcompared.
Assumptions
The dependentvariableshouldbe normallydistributedandmeasured on an inter-
of the dependentvariable shouldbe from the
val or ratio scale.Multiple measurements
same(or related)participants.
Data Format
SP,S^S
At leastthreevariablesarerequired.Eachvariablein the SPSSdatafile shouldrep-
resenta singledependentvariableat a singlelevelof theindependent variable'Thus,an
analysisof a designwith four levelsof an independentvariable would requirefour vari-
ablesin the SPSSdatafile.
If any variablerepresents effect,usethe Mixed-Design
a between-subjects ANOVA
commandinstead.
72
Chapter6 ParametricInferentialStatistics
Runningthe Command
ThisexampleusestheGRADES.sav sample
data set. Recall that GRADES.sav includesthree
sets of grades-PRETEST, MIDTERM, and
FINAL-that represent threedifferenttimesduring '@
!8*vxido"'
the semester.This allowsus to analyzethe effects t{!idf-!de r $.dryrigp,,,
)
of time on the test performanceof our sample f,txrd*r
gp{|ssriori
population(hencethe within-groupscomparison). tSoar garlrr! Cffiporur&'..
h{lruc{ l'or I
-Ff? l
-9v1"I
-ssl
/J
Chapter6 ParametricInferentialStatistics
Readingthe Output
This procedureusesthe GLM command. fil sesso,.rtput
GLM standsfor "GeneralLinearModel." It is a H &[ GeneralLinearModel
Title
very powerful command,and many sectionsof
Notes
outputarebeyondthe scopeofthis text (seeout- Wlthin-Subject s Factors
put outlineat right). But for the basicrepeated- MultivariateTests
measures ANOVA, we are interested only in the Mauchly'sTes{ol Sphericity
Testsof l{ithin-SubjectsEffects. Note that the Testsol Wthin-SubjectsEflecls
SPSSoutputwill includemanyothersectionsof Tesisol \ /lthin-SubjectsContrasts
output,whichyou canignoreat thispoint. Tests ol Between-Subjects Ellec{s
The TPesls
of l{ithin-SubjectsEffectsoutput shouldlook very similar to the output
from the otherANOVA commands. In the aboveexample,the effect of TIME has an F
value of 121.90with 2 and 40 degreesof freedom(we use the line for SphericityAs-
sumed).It is significantat less than the .001 level. When describingtheseresults,we
shouldindicatethetypeof test,F value,degrees of freedom,andsignificancelevel.
74
Chapter6 ParametricInferentialStatistics
PairedDifferences
950/o Confidence
Intervalof the
std. Std.Error l)ifferance sig
Mean Dcvietior Mean Lower UoDer df {2-tailed)
Yatr 1 l,KE I E5 |
-15.2857 3 .9 6 41 .8650 -17.0902 - 1 3 . 4 8 1 3 -'t7.670 20 .000
M IDT ERM
Pai 2 PRETEST
. F INAL -22.8095 8.97s6 1 . 9 5 8 6 -26.8952 -18.7239 -11.646 20 .000
P a i r3 M IDT ERM
. F INAL -7.5238 6.5850 1 . 4 3 7 0 - 1 0 . 5 231 4.5264 -5.236 20 .000
Practice Exercise
Use PracticeData Set 3 in Appendix B. Determine if the anxiety level of partici-
pants changedover time (regardlessof which treatmentthey received) using a one-way
repeated-measuresANOVA and protecteddependentI tests.Write a statementof results.
Description
The mixed-designANOVA (sometimescalled a split-plot design) teststhe effects
of more than one independentvariable. At leastone of the independentvariables must
75
Chapter6 ParametricInferentialStatistics
be within-subjects(repeatedmeasures).
At leastone of the independentvariables must be
between-subjects.
Assumptions
The dependent variable shouldbe normally distributedand measuredon an inter-
val or ratio scale.
76
Chapter6 ParametricInferentialStatistics
Readingthe Output
As with the standardrepeated measures command,the GLM procedureprovidesa
lot of output we will not use.For a mixed-design ANOVA, we are interestedin two sec-
f Effects.
tions.The first is Testsof Within-Subjects
Tests ot Wrtlrilr-SuuectsEflects
M eas u re :
Typelll Sum
S our c e of Squares df MeanS ouare F sis
time sphericity Assumed 6
5673.74 -
2 2836.873 817.954 000
Gre e n h o u s e -Ge i s s el 5673.746 1.181 4802.586 817.954 000
Huynh-Feldt 6
5673.74 1.356 4' t83.583 817.954 000
Lower-bound 6
5673.74 1.000 5673.746 817.954 000
time' instruct Sphericily Assumed 806.063 4 201.516 58.103 .000
0 re e n h o u s e -Ge i s s e r 806.063 2.363 341.149 58.103 .000
Huynh-Feldt 3
806.06 2.f' t2 297.179 58.103 .000
Lower-bound 806.063 2.000 403.032 58.103 .000
Erro(time) Sphericity Assumed 124857 36 3.468
Gre e n h o u sGee- isser 124.857 21.265 5.871
Huynh-Feldt 124.857 24.41'.| 5.115
Lower-bound 124.857 18.000 6.S37
This sectiongives two of the threeanswerswe need (the main effect for TIME and
the interactionresult for TIME x INSTRUCTOR). The secondsectionof output is Testsof
Between-subjectsEffects (sample output is below). Here, we get the answersthat do not
contain any within-subjects effects. For our example, we get the main effect for
INSTRUCT. Both of thesesectionsmust be combinedto oroduce the full answer for our
analysis.
Testsot Betryeen-Sill)iects
Eltecls
Me a s u reM:EA SU R1E
ransformed Variable:
Typelll Sum
S o u rc e of Squares df MeanS ouare F Siq.
Intercepl 3 6 4 19 2 .0 6 3 I
364192.063 1500.595 .000
inslrucl 18 .6 9 8 2 9.349 .039 .962
En o r 43 6 8 .5 7 1 18 242.6S 8
77
Chapter6 ParametricInferentialStatistics
For the aboveexample,we could statethe following in the resultssection(note that this
assumesthat appropriatepost-hoctestshave beenconducted):
A 3 x 3 mixed-designANOVA was calculatedto examinethe effectsof the
instructor(Instructors1,2, and,3)and time (pretest,midterm, and final) on
scores.A significant time x instructor interactionwas present(F(4,36) =
58.10, p <.001). In addition,the main effect for time was significant
(F(2,36): 817.95,p < .001). The main effect for instructor was not
significant(F(2,18): .039,p > .05).Upon examinationof the data,it appears
that Instructor3 showedthe most improvementin scoresover time.
With significant interactions, it is often helpful to provide a graph with the de-
scriptive statistics.By selectingthe Plotsoption in the main dialog box, you can make
graphsof the interaction like the one below. Interactionsadd considerable complexityto
the interpretationof statisticalresults.Consult a researchmethodstext or ask your instruc-
tor for more help with interactions.
ilme
E@rdAs f Crtt"-l -l
-
I, I lffi---_ -z
- - cd.r_J o0
sw&t.tc -3
- x
-==- E
i
s.Cr&Plob: I
L:J r---r
- .
ia
E
, ,,,:;',,1.
"l ","11::::J I
!
-gr!I
E
t
UT
2(Il
inrt?uct
Practice Exercise
Use PracticeData Set 3 in AppendixB. Determineif anxiety levels changedover
time for each of the treatment(CONDITION) types. How did time changeanxiety levels
for eachtreatment?Write a statementof results.
78
Chapter6 ParametricInferentialStatistics
Description
Analysis of covariance(ANCOVA) allows you to remove the effect of a known
covariate. In this way, it becomesa statisticalmethod of control. With methodological
controls (e.g., random assignment),internal validity is gained.When such methodologi-
cal controlsare not possible,statisticalcontrolscan be used.
ANCOVA can be performed by using the GLM command if you have repeated-
measuresfactors. Becausethe GLM commandis not included in the Base Statisticsmod-
ule, it is not includedhere.
Assumptions
ANCOVA requiresthatthe covariatebe significantlycorrelatedwith the depend-
ent variable.The dependentvariableandthecovariateshouldbe at theinterval or ratio
levels.In addition.bothshouldbe normallydistributed.
SPS,SData Format
The SPSSdatafile mustcontainonevariablefor eachindependentvariable,one
variablerepresenting
thedependentvariable,andat leastonecovariate.
*l*:" I
till q&,, I
Crd{'r.t{r}
td
f-;-'l wlswc0't
l '|ffi
j.*.1,q*J*lgl
79
Chapter6 ParametricInferentialStatistics
Drawing Conclusions
This sampleanalysiswas performedto determineif malesand femalesdiffer in
height,afterweightis accounted for. We knowthatweightis relatedto height.Ratherthan
matchparticipants or usemethodological controls,we canstatisticallyremovethe effectof
weight.
When giving the resultsof ANCOVA, we must give F, degreesof freedom,and
significancelevelsfor all main effects,interactions,and covariates.If main effectsor
interactions are significant,post-hoctests must be conducted.Descriptive statistics
(meanandstandarddeviation)for eachlevelof theindependentvariable shouldalsobe
given.
Tests of Between-SubjectsEffects
Va ri a b l e :H EIGH T
Typelll
Sumof Mean
Source Souares df S orrare F Siq.
9orrecreoMooel 215.0274 2 1U /.D 'tJ 100.476 .000
lntercept 5.580 I 5.580 5.215 .040
WEIGHT 1 1 9.964 1 1't9.964 112.112 .000
SEX 66.367 1 66.367 62.023 .000
Error 1 3.911 13 1.070
Total 7 1 9 1 9.000 16
CorrectedTotal 228.938 15
a. R Squared= .939(AdjustedR Squared= .930)
PhrasingResultsThatAre Significant
Theaboveexample
obtained
a significant
result,sowecouldstatethefollowing:
A one-waybetween-subjects ANCOVA wascalculatedto examinethe effect
of sexon height,covaryingout theeffectof weight.Weightwassignificantly
relatedto height(F(1,13): ll2.ll,p <.001). The maineffectfor sexwas
significant(F(1,13): 62.02,p< .001),with malessignificantl!taler (rn:
69.38,sd:3.70) thanfemales (m = 64.50,sd = 2.33\.
80
Chapter6 ParametricInferentialStatistics
Practice Exercise
Using Practice Data Set 2 in Appendix B, determine if salariesare different for
males and females.Repeatthe analysis,statisticallycontrolling for years of service.Write
a statementof resultsfor each.Compareand contrastyour two answers.
Description
Multivariatetestsare thosethat involve more than one dependentvariable. While
it is possible to conduct severalunivariatetests (one for each dependent variable), this
causesType I error inflation.Multivariatetestslook at all dependentvariables at once,
in much the sameway that ANOVA looks at all levels of an independent variable at
once.
Assumptions
MANOVA assumesthat you havemultipledependentvariables that are relatedto
each other. Each dependent variable should be normally distributedand measuredon an
interval or ratio scale.
8l
Chapter6 Parametriclnferential Statistics
)
(onddc )
Bsrysrsn )
Iqkrcc cocporpntr'.,
Lgdrcs ,
Readingthe Output
in two primarysectionsof output.The first onegivesthe results
We are interested
of the multivariatetests.The sectionlabeledGROUPis the one we want. This tells us
whetherGROUPhadan effecton anyof our dependentvariables.Fourdifferenttypesof
multivariatetestresultsaregiven.The mostwidely usedis Wilks'Lambda.Thus,the an-
swerfor theMANOVA is a Lambdaof .828,with 4 and28 degrees of freedom.Thatvalue
is not significant.
82
Chapter6 ParametricInferentialStatistics
Tests ot BelweerFs[l)iectsEflects
Typelll Sum
S our c e Dependen Vta ri a b l e of Squares df Mean
Suuare F Sio.
ConecledModel sat 3077.778' 2 1538.889 .360 .703
gre 5200.000b 2 2600.000 .587 .568
Intercept sat 5 2 05688.889 1 5205688.889 1219.448 .000
gre 5 2 48800.000 1 5248800.000 1185.723 .000
group sat 307t.t78 2 1538.889 .360 .703
gre 5200.000 I 2600.000 .597 .568
Enor sal 64033.333 15 4268.889
gre 66400.000 15 4426.667
Total sat 5272800.000 18
g te 5320400.000 18
Corrected Total sat 67111.111 4'
ta
qre 71600.000 17
a. R S qu a re=d .0 4 6(A d j u s teRd S q u a re=d -.081)
b. R S qu a re d = .0 7(Ad
3 j u s teRdSq u a re d=051)
-
Drawing Conclusions
We interpretthe resultsof the univariatetestsonly if the group Wilks'Lambdais
significant.Our resultsare not significant,but we will first considerhow to interpretre-
sultsthataresignificant.
83
Chapter6 ParametricInferentialStatistics
we would have had a Inlercepl PillaIsTrace .989 613.592. 2.000 t.000 .000
Wilks'Lambda .0tI 6a3.592. 2.000 r.000 .000
significant MANOVA, and Holsllln0's
T€cs 91.912 6t3.592. 2 000 1.000 .000
RoYsLargsslRoot 91.912 613.592. 2.000 {.000 .000
we could statethe following: group piilai's Trec€ .579 1 n{o | 000 30.000 032
Wllks'Lambda .123 3.157. I 000 28.000 011
HolellingbTrace 1.t01 I 000 26.000 008
Roy's Largsst Rool t .350 't0.t25r 2.000 I 5.000 002
a Exatlslalistic
b. Thestausticis an uppsrboundon F halyisldsa lffisr boundonthesiqnitcancB
lml
c. Desi0n:
Intercepl+0roup
84
Chapter7
NonparametricInferentialStatistics
Nonparametric testsareusedwhenthecorresponding parametricprocedure is inap-
propriate.Normally,this is becausethe dependentvariable is not interval- or ratio-
scaled.It can alsobe becausethe dependentvariable is not normallydistributed.If the
statisticsmayalsobe appropriate.
dataof interestarefrequencycounts,nonparametric
Description
The chi-squaregoodness whetheror not sampleproportions
of fit testdetermines
values.For example,it couldbe usedto determineif a die is "loaded"
matchthe theoretical
or fair. couldalso usedto comparethe proportionof childrenborn with birth defects
It be
to the populationvalue (e.g.,to determineif a certainneighborhood has a statistically
higher{han-normal rateof birth defects).
Assumptions
We needto makevery few assumptions. Thereareno assumptionsaboutthe shape
for eachcategoryshouldbe at leastl, andno
of the distribution.The expectedfrequencies
morethan20%o shouldhaveexpectedfrequencies
of thecategories of lessthan5.
Data Format
SP,S,S
onlya singlevariable.
SPSSrequires
Runningthe Command
We will createthe following dataset and call it COINS.sav.The following data
theflippingof eachof two coins20 times(H is codedasheads,T astails).
represent
C o i NI: H T H H T H H T H HHTTTHTHTTH
C O i N 2:T T H H T H T H TTHHTH HTHTHH
Namethe two variablesCOINI andCOIN2,andcodeH as I andT as 2. The data
file thatyou createwill have20 rowsof dataandtwo columns,calledCOINI andCOIN2.
85
Chapter7 NonparametricInferentialStatistics
n3nI
iryd I
i E$cOrURutClc
' 3Ll
; 6 Garrna*c
r (. Usarp.dirdrarr0o
l
: : . i) iir ' l I
i
i r t , t , i, t I
86
Chapter7 NonparametricInferentialStatistics
Drawing Conclusions
A significantchi-square testindicatesthat the datavary from the expected
A testthatis not significaniindicatesthatthedata values.
areconsistent with theexpectedvalues.
Phrasing ResultsThat Are Significant
In describingtheresults,you shouldstatethe v.alue
of chi-square (whosesymbolis
the
1'1' degrees.of dt.aot, it.'rignin.ance level,anaa description
ample'with a significantchi-square of the results.For ex-
(for,a sample'different
from the exampleabove,such
asif we haduseda "loaded"die),we courdstati
the foilowing:
A chi-square goodness of fit testwascalculated comparingthe frequencyof
occurrence of eachvalueof a die.It washypothesized thateachvaluewould
occuran equalnumberof times.Significanideviation
nor trtr rtypothesized
valueswasfoundfXlSl :25.4g,p i.OS).Thedie
appearsto be.,loaded.,,
Notethatthisexampleuseshypothetical values.
Phrasing ResultsThatAre NotSignificant
If the analysisproducesno significantdifference,
as in the previousexample,we
couldstatethefollowing:
A chi-squaregoodnessof fit testwascalculated
comparingthe frequencyof
occulrence
of headsandtailson a coin.It washypothesized
would occur an equalnumberof times.No significant thateachvalue
deviationfrom the
hypothesized
valueswasfound(xre): .20,p rlOsl. The coin
fair. apfearsto be
Practice Exercise
Use Practice Data Set 2 in Appendix B. In the entire population from which the
sample was drawn , 20yo of employees are clerical, 50Yo are technical, and 30%oare profes-
sional.Determinewhetheror not thesampledrawnconformsto thesevalues.HINT: Enter
therelative proportionsof thethreesamplesin order(20, 50, 30) in the"ExpectedValues"
afea.
Description
The chi-squaretest of independencetests whether or not two variables are inde-
pendent of each other. For example, flips of a coin should be independent events, so
knowing the outcome of one coin toss should not tell us anything about the secondcoin
toss.The chi-squaretest of independenceis essentiallya nonparametricversion of the in-
teraction term in ANOVA.
87
Chapter7 NonparametricInferentialStatistics
Drawing Conclusions
A significant chi-squaretest indicatesthat the data vary from the expectedvalues.
A test that is not significantindicatesthat the dataare consistentwith the expectedvalues.
Practice Exercise
Use PracticeData Set 2 in Appendix B. In the entire populationfrom which the
samplewas drawn, 20o of employeesare clerical, 50Yoaretechnical, and30o/oare profes-
sional. Determinewhetheror not the sampledrawn conformsto thesevalues.HINT: Enter
the relativeproportionsof the threesamplesin order(20, 50,30) in the "ExpectedValues"
area.
Description
The chi-squaretest of independencetests whether or not two variables are inde-
pendentof each other. For example,flips of a coin should be independent events, so
knowing the outcome of one coin toss should not tell us anything about the secondcoin
toss.The chi-squaretest of independenceis essentiallya nonparametricversion of the in-
teraction term in ANOVA.
87
Chapter7 NonparametricInferentialStatistics
Assumptions
Very few assumptionsare needed.For example,we make no assumptionsaboutthe
shapeof the distribution. The expectedfrequenciesfor each categoryshould be at least l,
and no more than 20o/oof the categoriesshouldhave expectedfrequenciesof lessthan 5.
Data Format
.SP,SS
At leasttwo variablesare required.
*:51
.!rd I
siJ
H'b I
$,iii1ilr'tirir
':l
Click Statistics.then check the Chi- p&ffi |- Cqrddionc
square box. Click Continue. You may also , x\ire- Or*ut -*-, -
88
Chapter7 NonparametricInferentialStatistics
Readingthe Output
The output consistsof two COINI' GOIN2Crosstabulatlon
parts.The first part gives you the corN2
counts.In this example,the actual Head Tail Total
golNl Fleao uounl 4 11
and expectedfrequencies are shown Expected 11.0
becausethey were selectedwith the 6.1 EN
Count
Cel/soption. Tail Count 4 Y
Expected
5.0 4.1 9 .0
Count
I
?r{t$t,,,.l!{h I l?ry1!,,, Total Count
Expected
11 I 20
11.0 on 20.0
Count
!
Ccrr*c fq.rrt I
Note that you can also use the Cells option to
,--!
;l? 0b*cvcd : Cr"* |
display the percentagesof each variable that are each
1v rxpccico
fron I value. This is especiallyuseful when your groups are
N
P"cc*$or Rrdd'Alc different sizes.
i
il* npw i if uttlaltddtu I
The secondpart of the output gives the results
;l" Colrm I lf $.n*rtu I of the chi-squaretest.The most commonlyusedvalue
t|* ral I ll* n4,a"e*aru*Auco I
1.,..-...-..-..--,..----".*.J
i.-*-..--.--.--.'***--*--***-l is the Pearson chi-square, shown in the first row
: N o d r l c $a W{ t*t - ' - ""- "- ^ ' - " ' - "- l
(valueof .737).
i r no,"u.or"or*r l^ 8o.ndc..trradtr
i
I f Trurlccalcorntr l. Innc*cr*vralrf*
|
I r Xor4.r*rprr I
Chi-SquareTests
Asymp.Sig ExactSig, ExactSig.
Value df (2-sided) (2-sided) (1-sided)
PearsonL;nr-!'quare {3t" I .391
ContinuityCorrectiorf 165 1 .684
LikelihoodRatio 740 1
.390
Fisher'sExactTest .653 .342
Linear-by-Linear .403
700 I
Association
N of ValidCases 20
a' ComputedonlYfor a 2x2lable
b. 3 cells(75.0%)haveexpectedcountlessthan 5. The minimumexpectedcountis 4.
05.
Drawing Conclusions
A significantchi-squaretest resultindicatesthat the two variablesare not inde-
pendent.A valuethatis not significantindicatesthatthe variablesdo not vary significantly
from independence.
89
Chapter7 NonparametricInferentialStatistics
Practice Exercise
A researcherwantsto knowwhetheror not individualsaremorelikely to helpin an
emergencywhen they are indoorsthan when they are outdoors.Of 28 participantswho
wereoutdoors,19helpedand9 did not.Of 23 participants who wereindoors,8helpedand
15 did not. Enterthesedata,and find out if helpingbehavioris affectedby the environ-
ment.The key to this problemis in the dataentry.(Hint: How many participantswere
there,andwhatdo you knowabouteachparticipant?)
Description
The Mann-WhitneyU testis thenonparametric equivalentof the independentt test.
samplesarefrom thesamedistribution.The Mann-
It testswhetheror not two independent
WhitneyU testis weaker thanthe t test,and the / testshouldbe usedif you
independent
canmeetits assumptions.
Assumptions
The Mann-WhitneyU testusesthe rankingsof the data.Therefore,the datafor the
two samplesmustbe at leastordinal. Thereareno assumptionsaboutthe shapeof the dis-
tribution.
90
!il;tIFfFF".
Chapter7 NonparametricInferentialStatistics
,SPS,S
Data Format
This command requiresa singlevariablerepresenting
thedependentvariableand
a second
variableindicating
groupmembership.
I Andy* qaphr U*lx WMw lbb To run the command,click Analyze, then Non-
i
I
R.po.ti ,
[6 gl*l parametric Tests, then 2 Independent Samples. This
Doroht*o1raru.r ,
Cd||olaaMcrfE t willbring up the maindialog box.
| 6on ralthccl'lodd I Enter the dependentvariable (LONG, for this
j C*ru|*c )
r Rooa!33hn example) in the Test Variable List blank. Enter the in-
cta*iry )'
i
j DatrRoddton )
dependentvariable (EXPERIENCE)as the Grouping
' Sada ) Variable.Make surethat Mann-WhitneyU is checked.
:@e@
J llrsSarfr t
O$squr.,..
thd|*rl... Click Define Groupsto selectwhich two groups
I Q.s*y Cr*rd t Rur... you will compare.For this example,we will compare
I BOCCuv.,,, 1-sdTpbX-5,..
nl 3 those runners with no experience(0) to those runners
' --- rl@ttsstpht,.,
nl ei--'
e Rlnand5{ndnr,,,
with a lot of experience(2). Click OK to run the
n]' rf KRdetcdsrmplo*.,, analvsis.
bl 2:
liiiriil',,,
-5J
IKJ | &r*!r*l
C.,"* |
],TYTI ryl
GnuphgVadadr: fry* |
fffii0?r-* l*J
l ; ..' ,''r 1,
iTod Tiryc**-
il7 Mxn.\dWn6yu I_ Ko&nonsov'SrnirnovZ
I
l- Mosasa<tcnproactkmc l- W#.Wdrowfrznnrg
9l
Chapter7 NonparametricInferentialStatistics
sis.)l
a. Not corscled for ties.
b GrouprngVanabb: sxporience
92
Chapter7 NonparametricInferentialStatistics
Practice Exercise
Assumethat the mathematicsscoresin PracticeExercise I (Appendix B) are meas-
ured on an ordinal scale.Determineif youngerparticipants(< 26) have significantly lower
mathematicsscoresthan older participants.
Description
The Wilcoxontestis the nonparametric equivalentof the paired-samples (depend-
ent) t test.It testswhetheror not two relatedsamplesare from the samedistribution.The
Wilcoxontestis weakerthanthe independent I test,so the I testshouldbe usedif you can
meetits assumptions.
Assumptions
The Wilcoxontestis basedon thedifferencein rankings.The datafor the two sam-
plesmustbe at leastordinal. Thereareno assumptions
aboutthe shapeof thedistribution.
SP^SS
Data Format
The testrequirestwo variables.Onevariablerepresents
the dependentvariable at
onelevelof the independentvariable.The othervariablerepresentsthe dependentvari-
ableat thesecondlevelof theindependentvariable.
93
Chapter7 NonparametricInferentialStatistics
Readingthe Output
The outputconsistsof two parts.The first partgivessummarystatisticsfor thetwo
variables.The secondsectioncontainstheresultof theWilcoxontest(givenas4.
Ranks TestStatisticsb
Mean Sumof MEDIUM -
N Rank Ranks LONG
MEUTUM
- NegaUVe a z -1214
LONG Ranks
4 5.38 21.50
Asymp.
Positive D sig. .904
Ranks
5 4.70 23.50
(2-tailed)
Ties 3c a' Basedon
Total 12 negativeranks.
A . M ED IU M< L ON G b'wilcoxon
b. tr,lgOtUtr,it>
LONG SignedRanks
c . LON G = M ED IU M
Test
Practice Exercise
Use the RACE.savdata file to determinewhetheror not the outcomeof short-
races.Phraseyour results.
distanceracesis differentfrom thatof medium-distance
94
Chapter7 NonparametricInferentialStatistics
Description
The Kruskal-Wallis .F/ test is the nonparametric equivalent of the one-way
ANOVA. It testswhetheror not severalindependentsamplescome from the samepopula-
tion.
Assumptions
Becausethe test is nonparametric,there are very few assumptions.However, the
test doesassumean ordinal level of measurement for the dependentvariable. The inde-
pendentvariable shouldbe nominal or ordinal.
,SPS,S
Data Format
SPSSrequiresone variableto representthe dependentvariable and anotherto rep-
resentthe levelsof the independentvariable.
RangpforEm{*tEVrri*lt
t c"{h.I
Mi{run l0
+ry{ |
Mrrdlrrrn la Hdp I
Ranks
Reading the Output
exoerience N MeanRank
The output consists of two parts. rong .uu 4 1 0. 5 0
The first part gives summary statisticsfor 1.00 4 6.50
each of the groups defined by the group- 2.00 4 2.50
ing (independent) variable. Total 12
95
Chapter7 NonparametricInferentialStatistics
1.00 4 7.25 df 2
2.00 Asymp.Sig. .861
4 5.88
Total 12 a. KruskalWallisTest
b. GroupingVariable:
experience
96
Chapter7 NonparametricInfbrentialStatistics
Practice Exercise
Use PracticeData Set 2 in AppendixB. Job classificationis ordinal (clerical<
technical< professional).
Determineif malesandfemaleshavedifferinglevelsofjob clas-
sifications.Phraseyourresults.
Description
The Friedmantestis thenonparametric
equivalentof a one-wayrepeated-measures
ANOVA. It is usedwhenyou havemorethantwo measurements from relatedparticipants.
Assumptions
The testusesthe rankingsof thevariables,so the datamustbe at leastordinal. No
otherassumptionsarerequired.
SP,SSData Format
SPSSrequiresat leastthreevariablesin the SPSSdatafile. Eachvariablerepre-
sentsthedependentvariableat oneof thelevelsof theindependentvariable.
f r;'ld{iv f* Cdi#ln
97
Chapter7 NonparametricInferentialStatistics
Readingthe Output
The outputconsistsof two sections.The first sectiongivesyou summarystatistics
for eachof the variables.The secondsectionof the outputgivesyou the resultsof the test
as a chi-squarevalue. The examplehere has a value of 0.049 and is not significant
(Asymp.Sig.,otherwise knownasp,is .976,whichis greater than.05).
Mean 12
Rank Chi-Square| .049
LUNU 2.00 d f lz
MEDIUM 2.04 Asymp.Sis.
| .976
SHORT 1.96
a. Friedman
Test
Drawing Conclusions
The Friedmantest assumesthat the threevariablesare from the samepopulation.A
significantvalue indicatesthat the variablesare not equivalent.
PhrasingResultsThatAre Significant
If we obtained a significant result, we could state the following (these are hypo-
theticalresults):
A Friedman test was conductedcomparingthe averageplace in a race of
nrnnersfor short-distance, medium-distance, and long-distanceraces.A sig-
nificant differencewas found C@: 0.057,p < .05). The length of the race
significantlyaffectsthe resultsof the race.
Practice Exercise
Usethedatain PracticeDataSet3 in AppendixB. If anxietyis measured on an or-
dinal scale,determineif anxietylevelschangedovertime.Phraseyour results.
98
Chapter8
$
't,
s
.fi
i$
TestConstruction
j
Section 8.1 ltem-Total Analvsis
Description
Item-totalanalysisis a way to assessthe internal consistencyof a data set. As
such,it is one of many testsof reliability. Item-totalanalysiscomprisesa numberof items
that make up a scaleor testdesignedto measurea singleconstruct(e.g.,intelligence),and
determinesthe degreeto which all of the items measurethe sameconstruct.It doesnot tell
you if it is measuringthe correctconstruct(that is a questionof validity). Before a test can
be valid, however,it must first be reliable.
Assumptions
All the itemsin the scaleshouldbe measuredon an interval or ratio scale.In addi-
tion, eachitem shouldbe normallydistributed.If your itemsare ordinal in nature,you can
conduct the analysisusing the Spearmanrho correlationinsteadof the Pearsonr correla-
tion.
SP.SSData Forrnat
SPSSrequiresone variablefor each item (or I Andyle Grrphs Ulilities Window Help
question)in the scale.In addition,you must have a RPpo*s
variablerepresentingthe total scorefor the scale. , Ds$rripllvoStatirlicc )
Ccmpar*Msans F
GtneralLinearMadel I
iilrliililrr'
Vuiabla* WRiqression
.q ! l )
!:*l', ." :
I"1l Conducting the Test
J13l
. ry1 PearsonItem-total analysisuses the
Coruelation command.
, Condation
Coalfubr*c To conduct it, open the
l|7 P"as* l- Kcndafsta* f- Spoamm
file you cre-
l * * - * _ _ . -_ ^ _ _ QUESTIONS.sav data
,'. Tsd ot SlJnific&cs ated in Chapter 2. Click Analyze,
I rr r'oorxaa f ona.{atud I then Coruelate, then Bivariate.
|7 nag ris{icsitqorrcldimr Place all questionsand the
total in the right-hand window, and
click OK. (For more help on conductingconelations,seeChapter5.) The total can be cal-
culatedwith the techniquesdiscussedin Chapter2.
99
Chapter8 Test Construction
Readingthe Output
The output consistsof a Correlatlons
correlation matrix containingall o1 o2 Q3 TOTAL
questionsand the total. Use the Lll l'oarson \,orrelallon 1.000 cll t to o/J
Description
Cronbach's alphais a measure of internalconsistency. As such,it is oneof many
testsof reliability. Cronbach'salphacomprisesa numberof itemsthat makeup a scale
designedto measurea singleconstruct(e.g.,intelligence), and determines the degreeto
whichall the itemsaremeasuring the sameconstruct.It doesnot tell you if it is measuring
the correctconstruct(thatis a questionof validity). Beforea testcanbe valid,however,it
mustfirst be reliable.
Assumptions
on an interval or ratio scale.In addi-
All the itemsin thescaleshouldbe measured
tion, eachitem shouldbe normallydistributed.
100
Chapter8 Test Construction
Data Format
,SP,SS
SPSSrequiresonevariablefor eachitem(or question)in the scale.
Analysis. ;
This will bring up the maindialogbox l,
it.
for Reliability Analysis.Transferthe ques- )
tions from your scaleto the Itemsblank,and )
click OK. Do not transferany variablesrepre'
sentingtotal scores.
Readingthe Output
RelLrltilityStntistics
In this example,the reliability coefficientis 0.407.
Numberscloseto 1.00are very good,but numberscloseto Cronbach's
0.00representpoorinternal consistency. Aloha N of ltems
.407 3
Section8.3 Test-RetestReliability
Description
Test-retestreliabilityis a measureof temporal stability. As such,it is a measure
of reliability. Unlikemeasures of internal consistencythattell you theextentto whichall
of the questionsthat makeup a scalemeasurethe sameconstruct,measures of temporal
overtime and/orovermultiple
stability tell you whetheror not the instrumentis consistent
administrations.
Assumptions
The total scorefor the scaleshouldbe an interval or ratio scale.The scalescores
shouldbe normallydistributed.
l 0l
Chapter8 Test Construction
.SP,S,S
Data Format
SPSSrequiresa variablerepresenting thetotalscorefor the scaleat the timeof first
administration.A secondvariablerepresenting thetotal scorefor the sameparticipants at a
differenttime (normallytwo weekslater)is alsorequired.
Readingthe Output
The correlationbetweenthe two scoresis the test-retestreliabilitycoeffrcient.It
shouldbe positive.Strongreliability is indicatedby valuescloseto 1.00.Weakreliability
is indicated
by valuescloseto 0.00.
Description
Criterion-related validity determinesthe extentto which the scaleyou are testing
correlateswith a criterion.For example,ACT scoresshouldcorrelatehighly with GPA. If
theydo, thatis a measure of validity forACT scores.If theydo not,thatindicatesthatACT
scoresmaynot be valid for theintendedpurpose.
Assumptions
All of the sameassumptions for the Pearsoncorrelationcoefficientapplyto meas-
uresof criterion-related
validity(intervalor ratio scales,normal distribution,etc.).
.SP,S,S
Data Format
Two variablesarerequired.Onevariablerepresentsthetotal scorefor the scaleyou
aretesting.Theotherrepresents
thecriterionyou aretestingit against.
Readingthe Output
The correlationbetweenthe two scoresis the criterion-relatedvalidity coefficient.
It shouldbe positive.Strongvalidity is indicatedby valuescloseto 1.00.Weakvalidity is
indicatedby valuescloseto 0.00.
102
AppendixA
Effect Size
Many disciplines are placing increasedemphasison reporting effect size. While
statisticalhypothesistestingprovidesa way to tell the odds that differencesare real, effect
sizesprovide a way to judge the relative importanceof thosedifferences.That is, they tell
us the size of the differenceor relationship.They are alsocritical if you would like to esti-
matenecessarysamplesizes,conducta power analysis,or conducta meta-analysis. Many
professionalorganizations(e.g.,the AmericanPsychologicalAssociation)are now requir-
ing or strongly suggestingthat effect sizesbe reportedin addition to the resultsof hypothe-
sis tests.
Becausethere are at least 4l different types of effect sizes,reach with somewhat
different properties,the purposeof this Appendix is not to be a comprehensiveresourceon
effect size, but ratherto show you how to calculatesomeof the most common measuresof
effectsizeusingSPSS15.0.
Co h e n ' s d
One of the simplest and most popular measuresof effect size is Cohen's d.
Cohen's d is a memberof a classof measurements called"standardized meandifferences."
In essence, d is the differencebetweenthe two meansdivided by the overall standard de-
viation.
It is not only a popularmeasureof effect size,but Cohenhasalso suggested a sim-
ple basisto interpretthe value obtained.Cohen'suggested that effect sizesof .2 aresmall,
.5 are medium,and .8 are large.
We will discussCohen's d as the preferredmeasureof effect size for I tests.Unfor-
tunately,SPSSdoes not calculateCohen's d. However,this appendixwill cover how to
calculateit from the output that SPSSdoesproduce.
EffectSizefor Single-Sample
t Tests
Although SPSSdoesnot calculateeffect size for the single-sample
/ test,calculat-
ing Cohen's d is a simplematter.
' Kirk, R.E. (1996). Practicalsignificance:A conceptwhose time has come. Educational & Psychological
Measurement,56, 7 46-7 59.
' Cohen,J. (1992). A power primer. PsychologicalBulletin, t t 2, 155-159.
103
Appendix A Effect Size
In this example,using Cohen's guidelinesto judge effect size, we would have an effect
size betweenmedium and large.
Group Statlltlcr
Std.Error
morntno N Mean Std. Oeviation Mean
graoe No z UZ.SUUU J.JJIDJ Z,SUUUU
yes 2 78.0000 7.07107 s.00000
(n ,-1 )s , 2+ (n , -l)s r2
S pooled =
n r+ n r-2
- tlr.Sr552+ (2 -t)7.07| (
rp o o te d-\ /12
2+ 21
S pool"d =
Spooted= 5'59
, - 78.00
82.50
o =-
5.59
d =.80
104
Appendix A Effect Size
EffectSizefor Paired-Samples
t Tests
As you have probably learned in your statisticsclass, a paired-samplesI test is
really just a specialcaseof the single-sampleI test.Therefore,the procedurefor calculat-
ing Cohen's d is also the same.The SPSSoutput,however,looks a little different,so you
will be taking your valuesfrom different areas.
Pair€dSamplosTest
Paired Differences
95% Confidence
Intervalof the
f)iffcrcnne
std. Std.Error sig
Mean f)eviatior Mean Lower Upper df (2-tailed)
YAlt 1 |'KE ttss I
. FINAL -22.809s A OTqA 1.9586 -26.8952 -18.7239 11.646 20 .000
,D
u --
JD
- 22.8095
8.9756
d = 2.54
Notice that in this example,we representthe effect size (d; as a positive number
eventhoughit is negative.Effect sizesare alwayspositivenumbers.In this example,using
Cohen'sguidelinesfor the interpretationof d, we have found a very large effect size.
12(Coefficient of Determination)
While Cohen's d is the appropriatemeasureof effect size for / tests, correlation
and regressioneffect sizes should be determinedby squaringthe correlation coefficient.
This squaredcorrelationis called the coefficient of determination. Cohen' suggestedhere
that correlationsof .5, .3, and.l corresponded to large,moderate,and small relationships.
Thosevaluessquaredyield coefficientsof determination of .25,.09, and .01 respectively.
It would appear,therefore,that Cohen is suggestingthat accounting for 25o/oof the vari-
ability representsa large effbct, 9oha moderateeffect, and lo/oa small effect.
EffectSizefor Correlation
Nowhere is the effect of sample size on statisticalpower (and therefore signifi-
cance) more apparentthan with correlations.Given a large enoughsample,any correlation
can becomesignificant.Thus, effect size becomescritically important in the interpretation
of correlations.
'Cohen, J. (1988). Statisticalpower analysisfor the behavioral sciences(2"d ed). New Jersey:Lawrence
Erlbaum.
105
Appendix A Effect Size
EffectSizefor Regression
TheModelSummary sec- ModelSummary
tion of the output reports R2 for
you. The example output here Std. Error
shows a coefficient of determi- Adjusted of the
Model R R Souare R Souare Estimate
nation of .649, meaningthat al- .8064 .649 .624 16.1 480
most 65% (.649\ of the variabil- a. Predictors:
(Constant),
HEIGHT
ity in the dependentvariable is
accountedfor by the relationship
betweenthe dependent and in-
dependentvariables.
Eta SquaredQtz)
A third measureof effect size is Eta Squared (ry2).Eta Squared is usedfor Analy-
sis of Variancemodels.The GLM (GeneralLinear Model) functionin SPSS(the function
that runs the proceduresunderAnalyze-General Linear Model) will provide Eta Squared
(tt').
Eta Squared has an interpretationsimilar to a squaredcor-
relationcoefficient1l1.lt represents the proportionof the variance tt SS"ik",
n, --
accountedfor by the effect.-Unlike ,2, ho*euer, which represents Sqr," . SS*
onlylinearrelationships,,72canrepresentanytypeofrelationship.
106
Appendix A Effect Size
l-[qoral*
|- $FdYr|dg
f A!.|it?ld
rli*dr
f'9n drffat*l
Cr**l.|tniliivdrra5:
tl l c,,*| n* |
Testsot EetweerF$iltectsEftcct3
Dependenl Variable:
score
TypelllSum PadialEta
Source nf Sdila!es df tean Souare F si q, Souared
u0rrecleo M00el 1 0 .4 5 0 r 2 5.225 19.096 .000 761
Inlercepl 91.622 4
I s't.622 331.862 .000 965
gr0up 1 0 .4 5 0 1
5.225 1S .096 .000 761
Enor 3 .2 8 3 12 .274
Total 1 0 5 .0 0 0 15
CorrectedTotal 1 3 .7 3 3 't4
a. R Squared= .761(Adiusied R Squared= .721)
107
Appendix B PracticeExerciseData Sets
PracticeData Set2
A survey of employeesis conducted.Each employeeprovides the following infor-
mation: Salary (SALARY), Years of Service (YOS), Sex (SEX), Job Classification
(CLASSIFY), and EducationLevel (EDUC). Note that you will haveto code SEX (Male:
l, Female: 2) and CLASSIFY (Clerical: l, Technical: 2, Professional
: 3).
ll*t:g - Jrs$q
Numsric
lNumedc
PracticeData Set3
Participantswho havephobiasare given one of threetreatments(CONDITION).
at threeintervals-beforetreatment(ANXPRE),
Their anxietylevel (1 to l0) is measured
one hour after treatment(ANXIHR), and againfour hoursafter treatment(ANX4HR).
Notethatyou will haveto codethevariableCONDITION.
4lcondil ,Numeric
i'-
7l '* '"
ll0
Appendix B Practice Exercise Data Sets
lll
AppendixC
Glossary
everypossibleoutcome.
All Inclusive.A set of eventsthat encompasses
Alternative Hypothesis. The oppositeof the null hypothesis,normally showing that there
is a true difference. Generallv. this is the statementthat the researcherwould like to
support.
Cohen's d. A common and simple measureof effect size that standardizesthe difference
betweengroups.
$ nialog Box. A window that allows you to enter information that SPSS will use in a
$ command.
t
* Oichotomous with onlytwo levels(e.g.,gender).
Variables.Variables
il
I DiscreteVariable. A variablethat can have only certainvalues(i.e., valuesbetween
'f *hich thereis no score,like A, B, C, D, F).
I'
i Effect Size. A measurethat allows one to judge the relative importanceof a differenceor
. relationshipby reportingthe size of a difference.
I 13
AppendixC Glossary
Interval Scale. A measurement scale where items are placed in mutually exclusive
categories,with equal intervalsbetweenvalues.Appropriatetransformationsinclude
counting,sorting,andaddition/subtraction.
Mean.A measure
of centraltendencywherethesumof thedeviationscoresequalszero.
Mutually Exclusive. Two events are mutually exclusivewhen they cannot occur
simultaneously.
NormalDistribution.A symmetric,
unimodal,
bell-shaped
curve.
n4
Appendix
C Glossary
Outliers. Extreme scoresin a distribution. Scoresthat are very distant from the mean and
the rest of the scoresin the distribution.
Output Window. The SPSSwindow that containsthe resultsof an analysis.The left side
summarizesthe resultsin an outline. The right side containsthe actual results.
Protected Dependent / Tests. To preventthe inflation of a Type I error, the level needed
to be significantis reducedwhen multipletestsare conducted.
Quartiles. The points that define a distribution into four equal parts. The scoresat the
25th,50th,and 75th percentileranks.
Skew. The extent to which a distribution is not symmetrical.Positive skew has outliers on
the positive(right) sideof the distribution.Negativeskew hasoutlierson the negative(left)
sideof the distribution.
I 15
AppendixC Glossary
erroneouslyrejectsthe null
Type I Error. A Type I error occurswhen the researcher
hypothesis.
Variance.A measure
of dispersion deviation.
equalto thesquaredstandard
l16
AppendixD
SampleDataFilesUsedin Text
A varietyof smalldatafiles areusedin examplesthroughoutthis text. Hereis a list
of whereeachappears.
COINS.sav
Variables: COINI
COIN2
Enteredin Chapter7
GRADES.sav
Variables: PRETEST
MIDTERM
FINAL
INSTRUCT
REQUIRED
Enteredin Chapter6
HEIGHT.sav
Variables: HEIGHT
WEIGHT
SEX
Enteredin Chapter4
QUESTIONS.Sav
Variables: Ql
Q2 (recodedin Chapter2)
Q3
TOTAL (addedin Chapter2)
GROUP(addedin Chapter2)
Enteredin Chapter2
Modifiedin Chapter2
tt7
AppendixD SampleDataFilesUsedin Text
RACE.sav
Variables: SHORT
MEDIUM
LONG
EXPERIENCE
Enteredin Chapter7
SAMPLE.sav
Variables: ID
DAY
TIME
MORNING
GRADE
WORK
TRAINING (addedin Chapter1)
Enteredin ChapterI
Modifiedin ChapterI
SAT.sav
Variables: SAT
GRE
GROUP
Enteredin Chapter6
Other Files
For somepracticeexercises,
seeAppendixB for neededdatasetsthatarenot usedin any
otherexamplesin thetext.
l18
AppendixE
Informationfor Users
of EarlierVersionsof SPSS
Thereare a numberof differences betweenSPSS15.0and earlierversionsof the
software.Fortunately,mostof themhavevery little impacton usersof this text. In fact,
r{
mostusersof earlierversionswill be ableto successfully
usethis text withoutneedingto
ril reference
this appendix.
Graphing functions.
Prior to SPSS12.0,the graphingfunctionsof SPSSwerevery limited.If you are
using a version of SPSSolder than version 12.0, third-partysoftwarelike Excel or
SigmaPlotis recommended for theconstructionof graphs.If you areusingVersion14.0of
thesoftware,useAppendixF asan alternativeto Chapter4, whichdiscusses graphing.
l 19
Appendix E Information for Usersof Earlier Versionsof SPSS
Variableiconsindicatemeasurement
type.
In versions of SPSS earlier
than 14.0,variableswere represented
in dialog boxes with their variable
label and an icon that represented
whether the variable was string or
numeric (the examplehere shows all
variablesthat were numeric).
Starting with Version 14.0,
SPSS shows additional information
about each variable. Icons now rep-
resentnot only whether a variable is
numeric or not, but also what type of
measurementscale it is. Nominal
variables are representedby the & f Mlcrpafr&nlm
/srsinoPtrplecnft
icon. Ordinal variables are repre- / itsscpawei[hura
sentedby the dfl i.on. Interval and /v*tctaweir**&r
/TimanAccAolao
ratio variables(SPSSrefers to them
$ Cur*ryU Orlgin
1c I
as scale variables) are represented ClNrmlcnu clrnaaJ
bv the d i"on. dq$oc.t l"ylo.:J
It liqtq'ftc$/droyte
4*r.-r-l qql{.,I
sqFr*u.,|
md* Heb
t|sr
$$imlzeA[ Whdows
lCas,sav [Dda*z] - S55 DctoEdtry
t20
AppendixF
GraphingDatawithSPSS13.0and14.0
This appendixshouldbe usedas an alternativeto Chapter4 whenyou are us-
ing SPSS13.0or 14.0.Theseproceduresmay alsobe usedin SPSS15.0,if de-
sired,by selectingLegacyDialogsinsteadof ChortBuilder.
GraphingBasics
In addition to the frequency distributions,the measuresof central tendency and
measuresof dispersiondiscussedin Chapter3, graphing is a useful way to summarize,or-
ganize,and reduceyour data. It has been said that a picture is worth a thousandwords. In
the caseof complicateddatasets,that is certainlytrue.
With SPSSVersion 13.0 and later,it is now possibleto make publication-quality
graphsusing only SPSS.One importantadvantageof using SPSSinsteadof other software
to createyour graphs(e.9., Excel or SigmaPlot)is that the data have alreadybeen entered.
Thus, duplicationis eliminated,and the chanceof making a transcriptionerror is reduced.
Editing SP,S,SGraphs
Whatever command you use
tl 9:.1tl 1bl rl .l @l kl D l ol rl : j
to create your graph, you will
probably want to do some editing to
make it look exactly the way you 8L
want. In SPSS,you do this in much
the sameway that you edit graphsin l
,
other software programs (e.9., l
t21
AppendixF GraphingDatawith SPSS13.0and 14.0
I* -l ql ryl
Once you have selectedan element,you can tell that the correctelementis selected
becauseit will havehandlesaroundit.
If the item you have selectedis a text element(e.g.,the title of the graph),a cursor
will be presentand you can edit the text as you would in word processingprograms.If you
would like to changeanotherattributeof the element(e.g., the color or font size), use the
Propertiesbox (Text propertiesare shownabove).
t22
AppendixF GraphingDatawith SPSS13.0and 14.0
Data Set
For the graphingexamples,we will usea new set of
data.Enterthe databelowand savethe file as HEIGHT.sav.
The data representparticipants' HEIGHT (in inches),
WEIGHT(in pounds), andSEX(l : male,2= female).
Descrlptlve Statlstlcs
std.
N Minimum Maximum Mean Deviation
ntrt(,n I 'to OU.UU I1.U U oo.YJ/0 J .9U O/
WEIGHT 16 95.00 165.00 129.0625 26.3451
SEX 16 1.00 2.00 1.s000 .5164
Valid N
(listwise)
16
123
AppendixF GraphingDatawith SPSS13.0and 14.0
Description
Bar charts,pie charts,and histogramsrepresentthe number of times each scoreoc-
curs by varying the height of a bar or the size of a pie piece.They are graphicalrepresenta-
tions of the frequencydistributionsdiscussedin Chapter3.
Drawing Conclusions
The Frequenciescommandproducesoutput that indicatesboth the number of cases
in the sample with a particular value and the percentageof caseswith that value. Thus,
conclusionsdrawn should relate only to describing the numbers or percentagesfor the
sample.If the dataare at leastordinal in nature,conclusionsregardingthe cumulativeper-
centages and/orpercentilescan alsobe drawn.
124
AppendixF GraphingDatawith SPSS13.0and 14.0
Output
The bar chart consistsof a hrleht
Y axis, representing the
frequency, and an X axis, repre-
senting each score. Note that the
only values representedon the X 1.5
axis are those with nonzero
frequencies(61, 62, and 7l are c
a
not represented). f
s
a
1.0
Practice Exercise
Use PracticeData Set I in Appendix B. After you have enteredthe data,constructa
histogramthat representsthe mathematicsskills scoresand displaysa normal curve, and a
bar chart that representsthe frequenciesfor the variableAGE.
t25
AppendixF GraphingDatawith SPSS13.0and 14.0
Scatterplots
Description
Scatterplots(also called scattergramsor scatterdiagrams)display two values for
eachcasewith a mark on the graph.The Xaxis representsthe value for one variable.The /
axis representsthe value for the secondvariable.
Assumptions
Both variablesshouldbe interval or ratio scales.If nominal or ordinal data are
used,be cautiousaboutyour interpretationof the scattergram.
.SP,t^tData Format
You needtwo variablesto perform this command.
xl
*t
m
m
Simpla
Scattal
0veday
Scattel
ffi
ffi
Mahix
Scattel
3,0
Scatter
ffi sittpt"
lelLl Oot
li Definell
-
q"rylJ
,
Helo I
ffil;|rlil',r
m
s.fd
3*J
,- 9dMrk6by
qej
| ,J l-- *fl
,- L*dqs.bx
IJ T--
P*lb - This will give you the main Scatterplot
8off,
dialog box. Enter one of your variablesas the I
rl[- axis and the secondas the X axis. For example,
I ti ...:
I'u;i**
L":# .
126
AppendixF GraphingDatawith SPSS13.0and 14.0
Output
X andI levels.
Theoutputwill consistof a markfor eachsubjectat theappropriate
74.00
r20.m
t27
AppendixF GraphingDatawith SPSS13.0and 14.0
Graph
sox
aLm
o 2.6
72.00
70.00
58.00
c
.9
I
s
56.00
o
64.00 oo
o
52.00
Practice Exercise
to examinethe rela-
UsePracticeDataSet2 in AppendixB. Constructa scatterplot
tionshipbetweenSALARY andEDUCATION.
Description
You canproducebar chartswith theFrequencies command(seeChapter4, Section
4.3). Sometimes,
however,we are interestedin a bar chartwherethe I axis is not a fre-
quency.To producesucha chart,we needto usetheBar Chartscommand.
SPS,SData Format
At leasttwo variablesare neededto performthis command.Thereare two basic
kinds of bar charts-thosefor between-subjects
designsand thosefor repeated-measures
designs.Usethe between-subjects methodif onevariableis the independentvariableand
the otheris the dependentvariable.Use the repeated-measures
methodif you havea dependentvariable for eachvalueof the G"dr tJt$ths Mdsns
independentvariable(e.g.,you wouldhavethreevariablesfor a €pkrv
designwith three valuesof the independentvariable). This IrfCr$We
Map )
normallyoccurswhenyou takemultipleobservations overtime.
128
]1
lr#
I t*ls
o rhd
Practice Exercise
Use PracticeDataSet I in AppendixB. Constructa bar graphexaminingthe rela-
tionshipbetweenmathematics skills scoresandmaritalstatus.Hint: In the BarsRepresent
area.enterSKILL asthevariable.
t29