Professional Documents
Culture Documents
AnIntroductionto PartI:TheQuantitativeTradition
PartII:AnIntroductiontoSPSS
QuantitativeDataAnalysis&SPSS PartIII:UnivariateAnalysis
PartIV:Recoding
Jared Wesley, PhD PartV:CompositeMeasures
Assistant Professor
PartVI:BivariateAnalysis
home.cc.umanitoba.ca/~wesley
PartVII:InferentialStatistics
PartVIII:ControlVariables
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
Paradigm?Revolution?Imperialism?
PARTI:TheQuantitativeTradition
JustHowRelevantisPoliticalScience?
PatriciaCohen,NewYorkTimes (09/10/20)
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
QuantitativeAnalysis EthicsofSurveyResearch
tradeoff:reliabilityforvalidity
1
SPSSasaTool
SPSSisthepackageofchoicefornovices.
StatisticsPackagefortheSocialSciences
PARTII:AnIntroductiontoSPSS recentlypurchasedbyIBM(PASW)
developedbyNormanNieinthelate1960s
sophistication(esp.withsyntax)
easeofuse(esp.withWindows)
foradvancedanalysis:Stata,Gauss,Nero
AccessingSPSS GettingStarted
FreeTrialVersion:spss.com UniversityofManitobacomputerlabs
Windows:21days;Mac:30days Programs MathStats SPSSforWindows
ComputersonCampus(studentversion)
OpeningtheDataFile(.sav)
GraduatePack:$239.00(PASW)
File Open Data
Manualsavailableinmostlibraries.
GettingStarted GettingStarted
RemembertosetyourJournal
Journalssaveyoursyntax
futureuse,replication,verification
Edit Options
Generaltab
RecordsyntaxinJournal
Append
Browse
2
GettingStarted ColumnsinVariableView
Name:variableid(max.7characters)
DataView
Type:Numeric(quant) orString(qual)
columns:variables(surveyquestions)
Width:charactersinvariablename(default=8)
rows:cases(surveyrespondents)
Decimals:numberofdecimalplaces
VariableView Label:moredescriptivename
columns:cases(respondents) Values:responsesets
rows:variables(questions) Missing:valuesthatareexcluded
Columns:columnwidthfordisplaypurposes
UsetheCodebookasaguide.
Align:justification
Measure:nominal,ordinal,scale(interval)
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
BasicUnivariateAnalysis
Frequencies
measuresofcentraltendency
PARTIII:UnivariateAnalysis mean,median,mode
Frequencies Frequencies
Analyze DescriptiveStatistics
Frequencies
3
Yourturn YourSyntaxJournal
MeasuresofCentralTendency MeasuresofCentralTendency
Calculatingthemean(entiresample)
Analyze DescriptiveStatistics
Descriptive Statistics
Descriptives
N Minimum Maximum Mean Std. Deviation
Public vs. Private Health
77 1 7 2.84 1.770
Care: Personal Stance
Valid N (listwise) 77
Yourturn ComparingMeans
Determine
Analyze CompareMeans Means
themeanplacementoftheNDPonthehealth
carespectrum DependentList:thedependentvariable(s)
forwhichyouwanttocalculatemeanscores
themeanplacementoftheLiberalParty
e.g.,NDPposition;Liberalposition;PCposition
themeanplacementofthePCParty
IndependentList:theindependent
Whatifwewantedtoknowhow variable(s)uponwhichyouwanttocompare
respondentsfromdifferentpartiesviewed e.g.,partyaffiliationoftherespondent
theiropponentsplacements?
requiresbivariate analysis
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
4
ComparingMeans ComparingMeans
Report
(Wesley,2009:225)
NonCommittalResponses NonCommittalResponses
Runafrequencytableforthefollowing WhatifnotascertainedwereNOTcoded
variable:OverallSpectrum:Personal asmissing?
Stance[xallmy][Q88]. ReturntoVariableView.
Overall Spectrum: Personal Stance Locatexallmy[line#118].
Frequency Percent Valid Percent
Cumulative
Percent ClickonValues.
Valid Far Left 4 4.9 5.4 5.4
Left
Centre-Left
16
10
19.8
12.3
21.6
13.5
27.0
40.5
Notethatnotascertainediscodedas9.
Centre 18 22.2 24.3 64.9
Centre-Right 17 21.0 23.0 87.8 Imagineif9swereincludedinthecalculation.
Right 9 11.1 12.2 100.0
Missing
Total
not ascertained
74
7
91.4
8.6
100.0
WhatiftheywerecodedasCentre?
Total 81 100.0
Recoding
Sometimesvariablesarenotcodedina
usefulway.
E.g.,regionintheCanadianElectionStudy
PartIV:Recoding theCEScontainsonlyaprovincevariable
NL=10,PE=11,NS=12,NB=13 Atlantic:1000
Atlantic:100
QC=24 CentralCanada:2000
Quebec:200
ON=35 Ontario:300
MB=46,SK=47,AB=48 Prairies:400
BC=59 Western&NorthernCanada:3000
BritishColumbia:500
YT=60,NT=61 Territories:600
5
Recoding Recoding
Almosteveryvariableweuseneedstobe RecodingvariablesinSPSS
recodedinsomeway,atsomepoint. Transform Recode IntoDifferent Variables
E.g.,YearofBirthintoAge[Q93]
HowdowewanttomeasureAge?
intervallevel(xyearsold)
ordinallevel(cohorts)
e.g.,Seniors,Boomers,GenX,GenY
nominallevel
e.g.,peoplewhocameofageinanelectionyear
Recoding Recoding
RecodingvariablesinSPSS
InputVariable=thevariableyouwishtochange
Recode
OutputVariable=thechangedvariable
00through44 4
namesandlabelsareuptoyou
45through66 3
RemembertoclickChange 67through79 2
Then,Old&NewValues 80through98 1
Allothervalues Systemmissing
ClickContinueandOK.
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
Recoding Recoding
Thenewvariable[age101]appearsatthe
bottomofthedataset(inVariableView). Verifyyourrecoding runafrequencytable
Analyze DescriptiveStatistics Frequencies
Wecannowformatthevariable.
clickonValues Age
6
Yourturn OutlineofDiscussion
Recodespndwel[Q40]asameasure PartI:TheQuantitativeTradition
ofsupport forcutstowelfare. PartII:AnIntroductiontoSPSS
Recodeindivid3[Q17]asa PartIII:UnivariateAnalysis
dichotomousvariable. PartIV:Recoding
PartV:CompositeMeasures
PartVI:BivariateAnalysis
PartVII:InferentialStatistics
PartVIII:ControlVariables
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
CompositeMeasures
Often,wecombineaseriesofvariablesto
createamorecomprehensive(valid,
PARTV:CompositeMeasures reliable)indicatorofaparticularconcept.
e.g.,theConsumerPriceIndex(CPI)
Therearetwotypesofcomposite
measures:
indexes
scales
Indexes Indexes
Stepsincreatinganindex:
definetheconcept HypotheticalExample
searchforindicators(surveyquestions) concept:PoliticalKnowledge
testforconvergentvalidity componentindicators:testquestions
Dothevariouscomponentindicatorscorrelatewithoneanother? 1point
Whoisthepremierofyourprovince?
Dotheycorrelatetoo perfectly?(Aresomeredundant?)
WhoisthePrimeMinister? 1point
0points
recode thecomponentvariables
WhocampaignedontheGreenShift? 1point
0points
e.g.,sothattheyareintheappropriatedirection
compute yourindex Whattypeofelectoralsystemdoweuse? 1point
0points
createsanewvariable
Note:Ascaleisacompositemeasurewhosecomponentsarelogicallyrelated.
E.g.,theBogardussocialdistancescale
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
7
Indexes Indexes
CreatingaPopulismIndex[Q12Q14] CreatingaPopulismIndex[Q12Q14]contd
recodingnecessary? addvaluelabels
combiningthenewvariables recodingnecessary?
Transform Compute e.g.,transformtheindexintoatrichotomous variable
TargetVariable:thenew(composite)variable
Type&Labeloptional,atthisstage Yourturn.
usetheonscreencalculator
e.g.,pop1+pop2+pop3
Createazerotothreepointindextomeasure
moralconservatism.
verifyyourindex(runfrequencytable)
BivariateAnalysis
Bivariate analysisexaminesthe
relationshiporassociationbetweentwo
variables.
correlationisthefirststeptoward
PARTVI:BivariateAnalysis establishingcausation
Bivariateanalysismayproduce
crosstabs(aka contingencytables)
correlationcoefficients
(gamma),(lambda),CramersV,taub
Crosstabs Crosstabs
Whatistherelationshipbetweenparty(IV)and
Contingencytables moralconservatism(DV)?
dependentvariable(DV)ontherows
Moralism Index Score B (0 or 1, out of 3) * Party Crosstabulation
independentvariable(IV)onthecolumns Party
NDP Liberal Party PC Party Total
Moralism Index 0 Count 17 7 5 29
Presentingdataincrosstabularform Score B (0 or 1,
out of 3)
% within Party 63.0% 33.3% 15.2% 35.8%
1 Count 4 8 7 19
requiresasmallnumberofIV&DV % within Party 14.8% 38.1% 21.2% 23.5%
2 Count 4 4 9 17
categories. % within Party 14.8% 19.0% 27.3% 21.0%
3 Count 2 2 12 16
% within Party 7.4% 9.5% 36.4% 19.8%
Total Count 27 21 33 81
% within Party 100.0% 100.0% 100.0% 100.0%
8
Crosstabs Yourturn
Locate/prepareyourDVandIV. Createacrosstabtodemonstratethe
Analyze DescriptiveStatistics Crosstabs relationshipbetween
dependentvariableintoRow(s)pane genderandmoralconservatism
independentvariableintoColumn(s)pane supportforprivatehealthcareandmoral
conservatism
Cells
Counts: Observed
Percentages: Column
Continue.OK.
CorrelationCoefficients CorrelationCoefficients
atleastonevariableatthe bothvariablesatthe
Sometimesitisdifficulttointerpretthe NOMINAL ORDINAL
relationshipdepictedinacrosstab. level level
e.g.,tabletoomassive
PRE* Lambda Gamma
measures
Summarystatistics(correlation
coefficients)existtosummarizethe nonPRE
CramersV Taub
relationship. measures
*ProportionalReductioninError
*ProportionalReductioninError I.e.,Howmuchbettercanwepredictthevalue
ofthedependentvariablebyknowingthevalueoftheindependentvariable?
ofthedependentvariablebyknowingthevalueoftheindependentvariable?
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
NominalLevelCoefficients OrdinalLevelCoefficients
GammaandTaubrangefrom1to+1
LambdaandCramersVrangefrom0to1 0indicatesnorelationship(perfectindependence)
0indicatesnorelationship(perfectindependence) 1indicatesaperfectnegativerelationship
1indicatesaperfectrelationship(orcorrelation) +1indicatesaperfectpositiverelationship
scoresalmostalwaysfallbetweenthesepoles scoresalmostalwaysfallbetweenthesepoles
Forexample Forexample
genderandvotechoice:=.17 incomeandconservatism:=.11
religionandvotechoice:=.22 numberofcatsandconservatism:=.24
Whichrelationshipisstrongest? Whichrelationshipisstrongest?
Whatdoeseachrelationshipmean? Whatdoeseachrelationshipmean?
9
ChoosingCoefficients InterpretingtheCoefficients
Thetradeoffs GeneralStandardsinPoliticalScience
Gammaandlambdaaremoreintuitive,but under.15 generallyinsubstantial
morepronetoerror. .10to.20 weak
.20to.25 moderate
CramersVandTaubaremoreconservative,
.25to.30 moderatetostrong
butgenerallymoreaccurate.
.30to.35 strong
.35to.40 verystrong
.40to.45 abnormallystrong
over.45 redundant(variablesarecongruent)
CalculatingCoefficients
SPSScalculatescorrelationcoefficientsalong
withcrosstabs.
Analyze DescriptiveStatistics Crosstabs
Statistics PARTVII:InferentialStatistics
Yourturn
rerunthefollowingcrosstabs,withtheappropriate
correlationcoefficients:
genderandmoralconservatism
supportforprivatehealthcareandmoralconservatism
InferentialStatistics Formoreinformation
Sampling in Political Science
Descriptivestatisticsdescribe thenature http://home.cc.umanitoba.ca/~wesley/3950/sampling
oftherelationship. Probability Theory
http://home.cc.umanitoba.ca/~wesley/3950/probability
Inferentialstatisticsdefinehowconfident
wecanbeinmakinginferences fromour MinimumSampleSizesata95%ConfidenceLevel
findings(aboutasample)tothelarger PercentageSamplingError MinimumSampleSize
+/1% 10,000
population. +/ 2% 2,500
+/3% 1,111
+/4% 625
+/5% 400
+/10% 100
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
10
Whatyouneedtoknow ErrorsinInference
Statisticalsignificance Twotypesoferrors:
Arelationshipisstatisticallysignificantifwe TypeIerror:afalsepositive
canlegitimatelyinferthatitexistsbeyondthe
sampleunderexamination. TypeIIerror:afalsenegative
Justbecausearelationshipisstatistically
Testsofstatisticalsignificanceare
significantdoesnotmeanitissubstantive,nor
viceversa. designedtokeepusfrommakingTypeI
Statisticalsignificancedependsonmanythings errors,buttheymakeusmorelikelyto
(esp.thesizeandqualityofoursample). commitaTypeIIerror.
DeterminingStatisticalSignificance InterpretingStatisticalSignificance
predetermineminimumlevel .000=significantatthe99.9%level
typically,95%confidence
I.e.,ifwesampledfromthispopulation100times, .01=significantatthe99%level
wewouldfindthisrelationship95times(19times .01to.05=significantatthe95%level
outof20).
otherbenchmarks:99%or99.9% over.05=notstatisticallysignificant
Recall:higherstandardsreducethe I.e.,wemustconfirmthenullhypothesis
likelihoodofTypeIerrors,butincrease
thelikelihoodofTypeIIerrors.
CalculatingStatisticalSignificance
InSPSS,inferentialstatistics(statistical
significance)arecalculatedalongsidedescriptive
statistics(correlationcoefficients).
PARTVIII:ControlVariables
Returntoyourrecentcrosstabsandstatistics
genderandmoralconservatism
supportforprivatehealthcareandmoral
conservatism
Aretheserelationshipsstatisticallysignificant?
11
ControlVariables ControlVariables
Controlvariablestesttheaccuracyofthe Controlvariablesareusedto
original(zeroorder)bivariaterelationship. testforspuriousness
e.g.,PR fringeparties(politicalculture)
E.g.,Doestherelationshipbetweengender
andsupportforpublicchildcareholdtrue, testforinterveningeffects
whenwecontrol foroccupation? e.g.,education participation(efficacy)
testforreinforcingeffects
e.g.,gender prochoice(feminism)
testforspecifyingeffects
e.g.,region conservatism(religion)
Table1:OriginalRelationship Table2:PartialRelationship
Independent Variable
Independent Variable IV IV IV Total
category 1 category 2 category 3 Total category 1 category 2 category 3
category A
Dependent
category B Control DV category A
Variable
Total Variable DV category B
category (i) Total (CV cat i)
Control DV category A
Variable DV category B
category (ii) Total (CV cat ii)
Table3:ParticipationbyReligiositybyGender Table4a:CivicDutybyRegion
Engagement
Region
low medium high Total Atlantic Quebec Ontario West Total
It is every citizen's strongly agree Count 345 773 1022 1101 3241
low participation duty to vote in % within 2004 Region 75.0% 74.5% 78.6% 74.9% 76.0%
federal elections. somewhat agree Count 81 216 187 250 734
% within 2004 Region 17.6% 20.8% 14.4% 17.0% 17.2%
female high participation somewhat disagree Count 21 32 51 73 177
% within 2004 Region 4.6% 3.1% 3.9% 5.0% 4.1%
strongly disagree Count 13 16 40 46 115
Total (female) % within 2004 Region 2.8% 1.5% 3.1% 3.1% 2.7%
Total Count
low participation 460 1037 1300 1470 4267
% within 2004 Region 100.0% 100.0% 100.0% 100.0% 100.0%
Source: 2004 Canadian Election Study
male high participation
Total (male)
12
CorrelationCoefficients:CivicDutybyRegion Table4b:CivicDutybyRegionbyGender
Asymp. Region
Value Std. Error Approx. T Approx. Sig. Gender Atlantic Quebec Ontario West Total
Nominal by Lambda Symmetric .000 .000 . . female It is every citizen's strongly agree Count 199 402 527 618 1746
duty to vote in % within 2004 Region 73.7% 73.1% 78.9% 75.4% 75.6%
Nominal It is every citizen's duty to federal elections. somewhat agree Count 51 126 97 145 419
vote in federal elections. .000 .000 . .
% within 2004 Region 18.9% 22.9% 14.5% 17.7% 18.2%
Dependent
somewhat disagree Count 12 16 26 37 91
2004 Region Dependent .000 .000 . . % within 2004 Region 4.4% 2.9% 3.9% 4.5% 3.9%
Goodman and It is every citizen's duty to strongly disagree Count 8 6 18 20 52
Kruskal tau vote in federal elections. .002 .001 .000 % within 2004 Region 3.0% 1.1% 2.7% 2.4% 2.3%
Dependent Total Count 270 550 668 820 2308
2004 Region Dependent .003 .001 .000 % within 2004 Region 100.0% 100.0% 100.0% 100.0% 100.0%
male It is every citizen's strongly agree Count 146 371 495 483 1495
duty to vote in % within 2004 Region 76.8% 76.2% 78.3% 74.3% 76.3%
federal elections. somewhat agree Count 30 90 90 105 315
Value Approx. Sig. % within 2004 Region 15.8% 18.5% 14.2% 16.2% 16.1%
somewhat disagree Count 9 16 25 36 86
Nominal by Phi .081 .001
% within 2004 Region 4.7% 3.3% 4.0% 5.5% 4.4%
Nominal Cramer's V .047 .001 strongly disagree Count 5 10 22 26 63
N of Valid Cases 4267 % within 2004 Region 2.6% 2.1% 3.5% 4.0% 3.2%
Total Count 190 487 632 650 1959
% within 2004 Region 100.0% 100.0% 100.0% 100.0% 100.0%
Cor.Coef.:CivicDutybyRegionbyGender Cor.Coef.:CivicDutybyRegionbyGender
Asymp.
2004 Gender Value Std. Error Approx. T Approx. Sig.
female Nominal by Lambda Symmetric .000 .000 . . 2004 Gender Value Approx. Sig.
Nominal It is every citizen's duty to female Nominal by Phi .094 .016
vote in federal elections. .000 .000 . . Nominal Cramer's V .054 .016
Dependent
2004 Region Dependent
N of Valid Cases 2308
.000 .000 . .
Goodman and It is every citizen's duty to male Nominal by Phi .075 .274
Kruskal tau vote in federal elections. .004 .002 .001 Nominal Cramer's V .043 .274
Dependent N of Valid Cases 1959
2004 Region Dependent .003 .001 .010
male Nominal by Lambda Symmetric .007 .018 .384 .701
Nominal It is every citizen's duty to
vote in federal elections. .000 .000 . .
Dependent
2004 Region Dependent .009 .024 .384 .701
Goodman and It is every citizen's duty to
Kruskal tau vote in federal elections. .002 .001 .351
Dependent
2004 Region Dependent .002 .001 .153
ComparingCoefficients Replication
Duty*Region
Lambda:.000(p>.05)
CramersV:.047(p=.001)
Iftheoriginalrelationshipisremainsidenticalin
eachcategoryofthecontrolvariable,thecontrol
Duty*Region*Gender variableissaidtohaveareplicating effect.
Female I.e.,theoriginalrelationshipisreplicatedineach
Lambda:.000(p>.05) categoryofthecontrolvariable
CramersV:.054(p<.05)
Male
Lambda:.007(p>.05)
CramersV:.043(p>.05)
13
Specification,suppression,distortion Explanation,interpretation
Iftheoriginalrelationshipdisappears (orissignificantly
Iftheoriginalrelationshipisstrengthened/weakenedin
diminished)inallcategoriesofthecontrolvariable,the
somecategoriesofthecontrolvariable,wesaywehave
specified therelationship. originalrelationshipmaybespurious.Inthiscase,we
I.e.,thecontrolvariablehasaspecifyingeffect haveexplainedawaytheoriginalrelationship.
Thismayrevealthatthe(absenceofthe)controlvariableis I.e.,thecontrolvariablemaybecausingvariationinboththe
suppressing thetruenatureoftherelationship. independentanddependentvariables
I.e.,thepartialrelationshipsarestrongerthantheoriginal Thenextstep:testforeachoftheserelationships.
I.e.,thec.v.isasuppressorvariable Iftheyeachexist:spuriousness
Oritmayrevealthatthe(absenceofthe)controlvariableis Iftheydont:thec.v.maybeeitherantecedentorintervening
distortingthetruenatureoftherelationship. Inthiscase,thec.v.ishavinganinterpretationeffect.
I.e.,thepartialrelationshipsarethereverseoftheoriginal
I.e.,thec.v.isadistortervariable
ConductingControlVariableAnalysis OutlineofDiscussion
Addalayertothecrosstab. PartI:TheQuantitativeTradition
Analyze DescriptiveStatistics Crosstabs PartII:AnIntroductiontoSPSS
DependentVariable:rows
IndependentVariable:columns
PartIII:UnivariateAnalysis
ControlVariable:layers PartIV:Recoding
Returntoyourrecent crosstabs andstatistics PartV:CompositeMeasures
genderandmoralconservatism
supportforprivatehealthcareandmoral PartVI:BivariateAnalysis
conservatism PartVII:InferentialStatistics
Controlforagecohort.
PartVIII:ControlVariables
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
AnIntroductionto
QuantitativeDataAnalysis&SPSS
home.cc.umanitoba.ca/~wesley
AnIntroductiontoQuantitativeDataAnalysisandSPSS Dr.JaredWesley,PoliticalStudies,UniversityofManitoba
14