Professional Documents
Culture Documents
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 1 / 394
Introduction
Mentimeter
Go to
www.menti.com
and use the code
7803 8552
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 2 / 394
Introduction: The book
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 3 / 394
Introduction: F.A.Q.
Recordings?
Yes, they will be available on Microsoft Teams. Please, download them all
before they will expire
Slides?
Yes, they will be available on Microsoft Teams (after the Lecture)
Homeworks?
Yes, up to 3 individual homeworks, but they will not be mandatory
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 4 / 394
Introduction: F.A.Q.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 5 / 394
Written exam
Both theoretical questions and practical exercises (R). However, it will
not be asked to produce R Code
While approaching the exam day, and old exam text will be given as
material. Furthermore, a simulation will be scheduled on the last days
of lectures
50% of the final score
Minimum score for access the Oral exam is 16/30
Oral exam
Mainly theoretical questions, with some clarifications on the written
exam. It might be required to solve an exercise.
50% of the final score
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 6 / 394
Introduction: The exam
Final Score
Composed by the arithmetical mean of Written exam and Oral exam
scores
Minimum final score for pass the Exam is 18/30
Disclaimer
Both the Written and the Oral exams will be held in presence
There might be exceptions according to the University policies (for
instance, if you have Covid)
However, w.r.t. the pandemic evolution, there might be unexpected
variations (i.e. only virtual mode exams)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 7 / 394
Chapter 1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 8 / 394
Why do we need statistics?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 9 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 10 / 394
Initial Observation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 11 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 12 / 394
Generating and Testing Theories
Theory
A hypothesized general principle or set of principles that explains
known findings about a topic and from which new hypotheses can be
generated
Hypothesis
A prediction from a theory
E.g. the number of people turning up for a Big Brother audition that
have narcissistic personality disorder will be higher than the general
level (1%) in the population
Falsification
The act of disproving a theory or hypothesis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 13 / 394
Table 1.1
A table of the number of people at the Big Brother audition split by
weather they had narcissistic personality disorder and whether they were
selected as contestants by the producers
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 14 / 394
The Research Process
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 15 / 394
Hypothesis
Chocolate kills dogs
Independent Variable
The proposed cause
A predictor variable
A manipulated variable (in experiments)
Chocolate in the above hypothesis
Dependent Variable
The proposed effect
An outcome variable
Measured not manipulated (in experiments)
Dogs in the above hypothesis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 16 / 394
Some important terms
Independent variable
A variable thought to be the cause of some effect. This term in usually used in
experimental research to denote a variable that the experimenter has manipulated
Dependent variable
A variable thought to be affected by changes in an independent variable. You can
think of this variable as an outcome
Predictor variable
A variable thought to predict an outcome variable. This is basically another term
for independent variable
outcome variable
A variable thought to change as a function of changes in a predictor variable. This
term could be synonymous with “dependent variable” for the sake of an easy life.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 17 / 394
Levels of Measurement
Measurement error
The discrepancy between the actual value we are trying to measure,
and the number we use to represent that value
Example:
You (in reality) weigh 80 kg
You stand on your bathroom scales and they say 83 kg
The measurement error is 3 kg
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 19 / 394
Validity
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 20 / 394
Reliability
Reliability
The ability of the measure to produce the same results under the same
conditions
Test-Retest Reliability
The ability of a measure to produce consistent results when the same
entities are tested at two different points in time
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 21 / 394
Correlational research:
Observing what naturally goes on in the world without directly
interfering with it
Cross-sectional research:
This term implies that data come from people at different age points,
with different people representing each age point
Experimental research:
One or more variable is systematically manipulated to see their effect
(alone or in combination) on an outcome variable
Statements can be made about cause and effect
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 22 / 394
Experimental Research Methods
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 23 / 394
Between-group/between-subject/independent
Different entities in experimental conditions
Repeated-measures (within-subject)
The same entities take part in all experimental conditions
Economical
Practice effects
Fatigue
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 24 / 394
Types of Variation
Systematic Variation
Differences in performance created by a specific experimental
manipulation
Unsystematic Variation
Differences in performance created by unknown factors
e.g. Age, gender, IQ, time of a day, measurement error, etc.
Randomization
Minimizes unsystematic variation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 25 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 26 / 394
Analysing Data: Histograms
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 27 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 28 / 394
Properties of Frequency Distributions
Skew
The symmetry of the distribution
Positive skew (scores bunched at low values with the tail pointing to
high values)
Negative skew (scores bunched at high values with the tail pointing to
low values)
Kurtosis
The “heaviness” of the tails
Leptokurtic = heavy tails
Platykurtic = light tails
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 29 / 394
Skew
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 30 / 394
Kurtosis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 31 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 32 / 394
A Bimodal Distribution
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 33 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 34 / 394
Central tendency: The Mean
Mean
The sum of scores divided by the number of scores
Number of friends of 11 Facebook.com users
Example
Pn
i=1 xi
X̄ = n
Pn
i=1 xi = 22+40+53+57+93+98+103+108+116+121+252 = 1063
Pn
i=1 xi 1063
X̄ = n = 11 = 96.64
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 35 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 36 / 394
The Dispersion: The interquartile range
Quartiles
The three values that split the sorted data into four equal parts
Second quartile = median
Lower quartile = median of lower half of the data
Upper quartile = median of upper half of the data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 37 / 394
z-scores
Standardising a score with respect to the other scores in the group
Expresses a score in terms of how many standard deviations it is away
from the mean
The distribution of z-scores has a mean of 0 and SD = 1
X −X̄
z= S
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 38 / 394
Properties of z-scores
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 39 / 394
Types of Hypotheses
Null hypothesis, H0
There is no effect
e.g. Big Brother contestants and members of the public will not
differ in their scores on personality disorder questionnaires
Alternative hypothesis, H1
Aka the experimental hypothesis
e.g. Big Brother contestants will score higher on personality disorder
questionnaires than members of the public
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 40 / 394
Chapter 2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 41 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 42 / 394
The Research Process
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 43 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 44 / 394
Populations and Samples
Population:
The collection of units (be they people, plankton, plants, cities, suicidal
authors, etc.) to which we want to generalize a set of findings or a
statistical model
Sample:
A smaller (but hopefully representative) collection of units from a
population used to determine truths about that population
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 45 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 46 / 394
A simple statistical model
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 47 / 394
The Mean
The mean is the sum of all scores divided by the number of scores
The mean is also the value from which the (squared) scores deviate
least (it has the least error)
Example
Pn
i=1 xi
mean(X̄ ) = n
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 48 / 394
The Mean: Example
1, 3, 4, 3, 2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 49 / 394
1 = 2.6 + errorlecturer 1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 50 / 394
Measuring the “Fit” of the model
The mean is a model of what happens in the real world: the typical
score
It is not a perfect representation of the data
How can we assess how well the mean represents reality?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 51 / 394
A perfect fit
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 52 / 394
Calculating the “Error”
deviation = xi − x̄
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 53 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 54 / 394
Use the Total Error?
We could just take the error between the mean and the data and add
them
Score Mean Deviation
1 2.6 -1.6
2 2.6 -0.6
3 2.6 0.4
3 2.6 0.4
4 2.6 1.4
Total = 0
P
(X − X̄ ) = 0
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 55 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 56 / 394
Sum of Squared Errors
(X − X̄ )2 = 5.20
P
SS =
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 57 / 394
Variance
(xi − x̄)2
P
SS 5.20
variance(s 2 ) = = = = 1.3
N −1 N −1 4
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 58 / 394
Degrees of Freedom
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 59 / 394
Standard Deviation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 60 / 394
Important Things to Remember
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 61 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 62 / 394
The SD and the Shape of a Distribution
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 63 / 394
Sample
Mean and SD describe only the sample from which they were calculated
Population
Mean and SD are intended to describe the entire population (very rare in
psychology)
Sample to Population
Mean and SD are obtained from a sample, but are used to estimate the
mean and SD of the population (very common in psychology)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 64 / 394
Samples vs. Populations
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 65 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 66 / 394
Confidence Intervals
True mean
15 million toothpicks
Sample mean
17 million toothpicks
Interval estimate
12 to 22 million (contains true value)
16 to 18 million (misses true value)
CIs constructed such that 95% contain the true value
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 67 / 394
Confidence Intervals
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 68 / 394
Confidence Intervals
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 69 / 394
Test Statistics
test statistic =
variance explained by the model effect
=
variance not explained by the model error
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 70 / 394
One- and Two-Tailed Tests
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 71 / 394
Type I error
occurs when we believe that there is a genuine effect in our
population when, in fact, there isn’t
The probability is the α-level (usually .05)
Type II error
occurs when we believe that there is no effect in the population when,
in reality, there is
The probability is the β-level (often .2)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 72 / 394
What does Statistical Significance tell us?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 73 / 394
Effect Sizes
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 74 / 394
Effect Size Measures
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 75 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 76 / 394
Chapter 5
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 77 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 78 / 394
Assumptions
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 79 / 394
Assessing Normality
Graphical displays
Q-Q plot (or P-P plot)
Histogram
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 80 / 394
Assessing Normality
Values of skew/kurtosis
0 in a normal distribution
Convert to z (by dividing value by SE)
Kolmogorov-Smirnov test
Tests if data differ from a normal distribution
Significant = non-normal data
NON-significant = normal data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 81 / 394
Normality Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 82 / 394
The Q-Q plot
To draw a Q-Q plot of the hygiene scores for day 1 of the music
festival
1 qqplot . day1 <- qplot ( sample = dlf $ day1 , stat = " qq " )
2 qqplot . day1
To draw a Q-Q plot of the hygiene scores for day 2 of the music
festival
1 qqplot . day2 <- qplot ( sample = dlf $ day2 , stat = " qq " )
2 qqplot . day2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 83 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 84 / 394
Assessing Skew and Kurtosis
Using by()
1 by ( data = rexam $ exam , INDICES = rexam $ uni , FUN = describe )
Using stat.desc()
1 by ( data = rexam $ exam , INDICES = rexam $ uni , FUN = stat . desc )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 85 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 86 / 394
Assessing Skew and Kurtosis
We can also use describe() and stat.desc() with more than one variable at
the same time using cbind():
1 describe ( cbind ( dlf $ day1 , dlf $ day2 , dlf $ day3 ) )
2
3 stat . desc ( cbind ( dlf $ day1 , dlf $ day2 , dlf $ day3 ) ,
4 basic = FALSE , norm = TRUE )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 87 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 88 / 394
Another Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 89 / 394
Assessing Normality
Output:
1 > Shapiro - Wilk normality test
2 data : rexam $ exam
3 W = 0.9613 , p - value = 0.004991
4 > Shapiro - Wilk normality test
5 data : rexam $ numeracy
6 W = 0.9244 , p - value = 2.424 e -05
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 90 / 394
Assessing Normality
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 91 / 394
Assessing Normality
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 92 / 394
Assessing Normality
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 93 / 394
Q-Q Plots
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 94 / 394
Assessing Homogeneity of Variance
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 95 / 394
Homogeneity of Variance
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 96 / 394
Assessing Homogeneity of Variance with R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 97 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 98 / 394
Fixing Data Problems
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 99 / 394
log transformation
1 dlf $ logday1 <- log ( dlf $ day1 )
2 dlf $ logday1 <- log ( dlf $ day1 + 1)
Reciprocal transformation
1 dlf $ recday1 <- 1 / ( dlf $ day1 + 1)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 100 / 394
The Effect of Transformations
Distributions of the hygiene data on da1 and day2 after various
transformations
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 101 / 394
To Transform . . . or not
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 102 / 394
Robust Methods: Examples
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 103 / 394
Chapter 6
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 104 / 394
Aims
Measuring relationships
Scatterplots
Covariance
Pearson’s correlation coefficient
Nonparametric measures
Spearman’s rho
Kendall’s tau
Interpreting correlations
Causality
Partial correlations
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 105 / 394
What is a Correlation?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 106 / 394
Very small relationship
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 107 / 394
Positive relationship
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 108 / 394
Negative relationship
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 109 / 394
Measuring relationships
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 110 / 394
Measuring relationships
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 111 / 394
Measuring relationships
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 112 / 394
Revision of variance
The variance tells us by how much scores deviate from the mean for a
single variable
It is closely linked to the sum of squares
Covariance is similar: it tells how much, scores on two variables, differ
from their respective means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 113 / 394
Variance
The variance tells us by how much scores deviate from the mean for a
single variable
It is closely linked to the sum of squares
(xi − x̄)2
P
variance =
PN − 1
(xi − x̄)(xi − x̄)
=
N −1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 114 / 394
Covariance
Calculate the error between the mean and each subject’s score for the
first variable (x)
Calculate the error between the mean and their score for the second
variable (y)
Multiply these error values
Add these values and you get the cross product deviations
The covariance is the average cross-product deviations:
P
(xi − x̄)(yi − ȳ )
cov (x, y ) =
N −1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 115 / 394
Covariance
P
(xi − x̄)(yi − ȳ )
cov (x, y ) =
N −1
(−0.4)(−3) + (−1.4)(−2) + (−1.4)(−1) + (0.6)(2) + (2.6)(4)
=
4
1.2 + 2.8 + 1.4 + 1.2 + 10.4
=
4
17
=
4
= 4.25
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 116 / 394
Problems with covariance
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 117 / 394
covxy
r =
s s
Px y
(xi − x̄)(yi − ȳ )
=
(N − 1)sx sy
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 118 / 394
The Correlation Coefficient
covxy
r =
sx sy
4.25
=
1.67 · 2.92
= .87
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 119 / 394
Correlation: Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 120 / 394
General Procedure for Correlations using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 121 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 122 / 394
Reporting the Results
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 123 / 394
It is an effect size
±0.1 small effect
±0.3 medium effect
±0.5 large effect
Coefficient of determination, r 2
By squaring the value of r you get the proportion of variance in one
variable shared by the other
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 124 / 394
Correlation and Causality
Direction of causality
Correlation coefficients say nothing about which variable causes the other
to change
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 125 / 394
Non-parametric Correlation
Spearman’s rho
Pearson’s correlation on the ranked data
Kendall’s tau
Better than Spearman’s for small samples
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 126 / 394
Spearman’s Rho Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 127 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 128 / 394
Partial and Semi-partial Correlations
Partial correlation
Measures the relationship between two variables, controlling for the effect
that a third variable has on them both
Semi-partial correlation
Measures the relationship between two variables controlling for the effect
that a third variable has on only one of the others
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 129 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 130 / 394
Partial and Semi-partial Correlations
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 131 / 394
We can then see the partial correlation and the value of R 2 in the
console by executing:
1 pc
2 pc ^2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 132 / 394
Doing Partial Correlation using R
Basically, you enter an object that you have created with pcor() (or
you can put the pcor() command directly into the function):
1 pcor . test ( pc , 1 , 103)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 133 / 394
Chapter 7
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 134 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 135 / 394
What is Regression?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 136 / 394
Describing a Straight Line
Yi = bo + bi Xi + i
b0
Intercept (value of Y when X = 0)
Point at which the regression line crosses the Y-axis (ordinate)
bi
Regression coefficient for the predictor
Gradient (slope) of the regression line
Direction/strength of relationship
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 137 / 394
This graph shows a scatterplot of some data with a line representing the general trend.
The vertical lines (dotted) represent the differences (or residuals)
between the line and the actual data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 138 / 394
How good is the model?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 139 / 394
SST
Total variability (variability between scores and the mean)
SSR
Residual/error variability
(variability between the regression model and the actual data)
SSM
Model variability
(difference in variability between the model and the mean)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 141 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 142 / 394
Testing the Model: ANOVA
MSM
F =
MSR
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 143 / 394
R2
The proportion of variance accounted for by the regression model
The Pearson Correlation Coefficient Squared
SSM
R2 =
SST
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 144 / 394
Regression: An Example
Data
200 Different album releases
Outcome variable
Sales (CDs and downloads) in the week after release
Predictor variable
The amount (in units of 1000$) spent promoting the record before release
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 145 / 394
Regression in R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 146 / 394
Regression in R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 147 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 148 / 394
Using the Model
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 149 / 394
Multiple Regression
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 150 / 394
Multiple regression fit
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 151 / 394
Adjusted R 2
n−1 n−2
R 2 = 1 − [( n−k−1 )( n−k−2 )( n+1 2
n )](1 − R )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 152 / 394
T-test on regression coefficients
bobserved − bexpected
t =
SEb
bobserved
=
SEb
t ∼ T (N − p − 1)
Confidence intervals for b
Hypothesis testing:
H0 : b = 0vs.H1 : b > 0, H1 : b < 0, H1 : b 6= 0
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 153 / 394
Methods of regression
Hierarchical
Known predictors (previous research) at first, new predictors afterwards
(forced entry, stepwise)
Forced entry
Model suggested from a theory
Stepwise
Forward (from null or intercept-only model)
Backward (from full or complete model)
Stepwise (like forward but admitting subsequent elimination)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 154 / 394
Checking assumptions
Variables types: All Xs (predictors) must be quantitative or
categorical, and Y (response variable) must be quantitative,
continuous and unbounded
Non-zero variance: The predictors should have some variation in value
No perfect multicollinearity
Predictors are uncorrelated with ’external variables’: there should be
no external variables that correlate with any of the Xs included in the
regression model. Obviously, if external variables do correlate with
some Xs, then the model become unreliable (because other variables
can predict Y just as well).
Homoscedasticity
Independent errors (No autocorrelation)
Normally distributed errors
Independence (among different values of Y)
Linearity
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 155 / 394
Checking assumptions
When the assumptions of regression are met, the model that we get
for a sample can be accurately applied to the population of interest
(the coefficients and parameters of the regression equation are said to
be unbiased )
What an unbiased model does tell us is that on average the
regression model from the sample is the same as the population model
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 156 / 394
Multicollinearity
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 157 / 394
(yi − yˆi )2
P
Mean Square Error: MSE =
n
Training set / Test set (Training set is often called also “Validation”
set)
(K-fold, Leave-One-Out) Cross Validation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 158 / 394
K-fold Cross Validation
Since data are often scarce, there might not be enough to set aside
for a validation sample
To work around this issue k-fold CV works as follows:
1 Split the sample into k subsets of equal size
2 For each fold estimate a model on all the subsets except one
3 Use the left out subset to test the model, by calculating a CV metric of
choice
4 Average the CV metric across subsets to get the CV error
This has the advantage of using all data for estimating the model.
Common used K values are K=1, 5, 10. K=1 is a particular case
called LOOCV (Leave-One-Out CV)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 159 / 394
Chapter 8
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 160 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 161 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 162 / 394
Logistic with one predictor
1
P(Y ) =
1 + exp−(b0 +b1 X1 +)
Outcome
We predict the probability of the outcome occurring
b0 and b1
Can be thought of in much the same way as multiple regression
Note the normal regression equation forms part of the logis,c regression
equation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 163 / 394
1
P(Y ) =
1 + exp−(b0 +b1 X1 +b2 X2 +···+bn Xn +)
Outcome
We still predict the probability of the outcome occurring
Differences
Note the multiple regression equation forms part of the logistic
regression equation
This part of the equation expands to accommodate additional
predictors
expresses the multiple linear regression equation in logarithmic terms
(called the logit) and thus overcomes the problem of violating the
assumption of linearity
The resulting value from the equation varies between 0 and 1. A value
close to 0 means that Y is very unlikely to have occurred, and a value
close to 1 means that Y is very likely to have occurred.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 164 / 394
Assessing the Model
N
X
log −likelihood = [Yi ·ln(P(Yi ))+(1−Yi )·ln(1−P(Yi ))]
i=1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 165 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 166 / 394
Assessing predictors: the Wald Statistic
b
Wald =
SEb
Similar to t-statistic in regression
The Wald statistic follows a normal distribution (a.k.a. z-statistic)
Test the null hypothesis that b=0
Is biased when b is large
Better to look at likelihood ration statistics (change in LL)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 167 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 168 / 394
Assessing predictors: the Odds Ratio
P(event)
odds =
P(no event)
1
P(event Y ) =
1 + exp−(b0 +b1 X1 )
P(no eventY ) = 1 − P(event Y )
odds after a unit change in the predictor
∆odds =
original odds
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 169 / 394
Methods of Regression
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 170 / 394
Things that can go wrong
Unique problems
Incomplete information
Complete separation
Overdispersion
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 171 / 394
Incomplete information
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 172 / 394
Incomplete information
Complete separation
the outcome variable can be perfectly predicted by one variable or a
combination of variables:
this problem often arises when too many variables are fitted to too
few cases
Often the only satisfactory solution is to collect more data
sometimes a neat answer is found by using a simpler model
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 173 / 394
Overdispersion
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 174 / 394
An example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 175 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 176 / 394
Logistic Regression Analysis using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 177 / 394
1 Call :
2 glm ( formula = Cured ~ Intervention , family = binomial () , data = eelData )
3
4 Deviance Residuals :
5 Min 1Q Median 3Q Max
6 -1.5940 -1.0579 0.8118 0.8118 1.3018
7
8 Coefficients :
9 Estimate Std . Error z value Pr ( >| z |)
10 ( Intercept ) -0.2877 0.2700 -1.065 0.28671
11 I n t e r v e n t i o n I n t e r v e n t i o n 1.2287 0.3998 3.074 0.00212 * *
12
13 ( Dispersion parameter for binomial family taken to be 1)
14
15 Null deviance : 154.08 on 112 degrees of freedom
16 Residual deviance : 144.16 on 111 degrees of freedom
17 AIC : 148.16
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 178 / 394
Assessing the model: R
s
z 2 − 2df
R=
−2LL(baseline)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 179 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 180 / 394
Assessing the model: R 2
AIC = −2LL + 2k
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 182 / 394
Writing a function to compute R 2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 183 / 394
To use the function on our model, we simply place the name of the
logistic regression model (in this case eelModel.1) in the function and
execute:
1 l o g is t ic P s eu d o R 2 s ( eelModel .1)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 184 / 394
Calculating the Odds Ratio
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 185 / 394
1 Call :
2 glm ( formula = Cured ~ Intervention + Duration , family = binomial () , data = eelData )
3
4 Deviance Residuals :
5 Min 1Q Median 3Q Max
6 -1.6025 -1.0572 0.8107 0.8161 1.3095
7
8 Coefficients :
9 Estimate Std . Error z value Pr ( >| z |)
10 ( Intercept ) -0.234660 1.220563 -0.192 0.84754
11 I n t e r v e n t i o n I n t e r v e n t i o n 1.233532 0.414565 2.975 0.00293 * *
12 Duration -0.007835 0.175913 -0.045 0.96447
13
14 ( Dispersion parameter for binomial family taken to be 1)
15
16 Null deviance : 154.08 on 112 degrees of freedom
17 Residual deviance : 144.16 on 110 degrees of freedom
18 AIC : 150.16
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 186 / 394
Comparing the models
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 187 / 394
Casewise diagnostics
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 188 / 394
Summary
The overall fit of the final model is shown by the deviance statistic
and its associated chi-square statistic.
If the significance of the chi-square statistic is less than .05, then the
model is a significant fit to the data.
Check the table labelled coefficients to see which variables
significantly predict the outcome.
For each variable in the model, look at the z-statistic and its
significance (which again should be below .05).
Use the Odds Ratio for interpretation. You can obtain this using
exp(model$coefficients), where model is the name of your model.
If the value is greater than 1 then as the predictor increases, the odds
of the outcome occurring increase.
A value less than 1 indicates that as the predictor increases, the odds
of the outcome occurring decrease.
For the aforementioned interpretation to be reliable the confidence
interval of the Odds Ratio should not cross 1!
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 189 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 190 / 394
Chapter 9: Comparing Two Means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 191 / 394
Aims
t-tests:
Independent
Dependent (aka paired, matched)
Rationale for the tests
Assumptions
t-tests as a GLM
Interpretation
Calculating an effect size
Reporting results
Robust methods
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 192 / 394
Experiments
The simplest form of experiment that can be done is one with only
one independent variable that is manipulated in only two ways and
only one outcome is measured.
More often than not, the manipulation of the independent variable
involves having an experimental condition and a control
E.g., Is the movie Scream 2 scarier than the original Scream? We could
measure heart rates (which indicate anxiety) during both films and
compare them
This situation can be analysed with a t-test
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 193 / 394
Experiments
Independent t-test
Compares two means based on independent data.
E.g., data from different groups of people.
Dependent t-test
Compares two means based on related data.
E.g., Data from the same people measured at different times.
Data from “matched” samples.
Significance testing
Testing the significance of Pearson’s correlation coefficient
Testing the significance of b in regression.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 194 / 394
Rationale for the t-test
Two samples of data are collected and the sample means calculated.
These means might differ by either a little or a lot
If the samples come from the same population, then we expect their
means to be roughly equal. Although it is possible for their means to
differ by chance alone, we would expect large differences between
sample means to occur very infrequently
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 195 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 197 / 394
Ai = b0 + b1 Gi + i
anxietyi = b0 + b1 groupi + i
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 198 / 394
Picture Group
X̄Picture = b0 + (b1 × 0)
b0 = X̄Picture
b0 = 40
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 199 / 394
X̄Real = b0 + (b1 × 1)
X̄Real = X̄Picture + b1
b1 = X̄Real − X̄Picture
= 47 − 40 = 7
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 200 / 394
Output from a Regression
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 201 / 394
Both the independent t-test and the dependent t-test are parametric tests based
on the normal distribution. Therefore, they assume:
The sampling distribution is normally distributed. In the dependent
t-test this means that the sampling distribution of the differences
between scores should be normal, not the scores themselves.
Data are measured at least at the interval level.
The independent t-test, because it is used to test different groups of people, also
assumes:
Variances in these populations are roughly equal (homogeneity of
variance).
Scores in different treatment conditions are independent (because they
come from different people).
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 202 / 394
The Independent t-test
X̄1 − X̄2
t=s
sp2 sp2
+
n1 n2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 203 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 204 / 394
The Independent t-test Using R
If you have the data for different groups stored in two columns:
1 newModel <- t . test ( scores group 1 , scores group 2 , paired = FALSE / TRUE )
2
3 ind . t . test <- t . test ( spiderWide $ real , spiderWide $ picture )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 205 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 206 / 394
Calculating an Effect Size
s
t2
r =
t 2 + df
s
(−1.681)2
r =
(−1.681)2 + 22
r
2.826
=
24.826
= 0.34
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 207 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 208 / 394
The Dependent t-test
D̄ − µD
t= s √
D/ N
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 209 / 394
Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 210 / 394
The Dependent t-test Using R
If we had our data stored in long format so that our group scores are
in a single column and group membership is expressed in a second
column:
1 dep . t . test <- t . test ( Anxiety ~ Group , data = spiderLong , paired = TRUE )
2 dep . t . test
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 211 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 212 / 394
Calculating the Effect Size
We can compute this value in the same way that we did for the
independent t-test by executing:
1 t <- dep . t . test $ statistic [[1]]
2
3 df <- dep . t . test $ parameter [[1]]
4
5 r <- sqrt ( t ^2 / ( t ^2+ df ) )
6
7 round (r , 3)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 213 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 214 / 394
When Assumptions are Broken
Dependent t-test
Mann–Whitney test
Wilcoxon rank-sum test
Independent t-test
Wilcoxon signed-rank test
Robust tests
Bootstrapping
Trimmed means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 215 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 216 / 394
Robust Methods to Compare Independent Means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 217 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 218 / 394
Robust Methods to Compare Dependent Means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 219 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 220 / 394
Robust Methods to Compare Dependent Means
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 221 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 222 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 223 / 394
When we want to compare means we can use a t-test. This test has
limitations:
You can compare only 2 means: often we would like to compare means
from 3 or more groups
It can be used only with one predictor/independent variable
ANOVA
Compares several means
Can be used when you have manipulated more than one independent
variable
It is an extension of regression (the general linear model)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 224 / 394
Why don’t use a lots of t-Tests?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 225 / 394
k!
C=
2(k − 2)!
5! 120
C= = = 10
2(5 − 2)! 2 · (3 · 2 · 1)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 226 / 394
Example: Viagra dataset
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 227 / 394
ANOVA as Regression
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 228 / 394
Placebo group
libidoi = b0
b0 = X̄placebo
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 229 / 394
libidoi = b0 + b2
X̄high = X̄placebo + b2
b2 = X̄high − X̄placebo
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 230 / 394
Low dose group
libidoi = b0 + b1
X̄low = X̄placebo + b1
b1 = X̄low − X̄placebo
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 231 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 232 / 394
Experiments vs. Correlation
ANOVA in regression:
Used to assess whether the regression model is good at predicting an
outcome.
ANOVA in experiments:
Used to see whether experimental manipulations lead to differences in
performance on an outcome (DV)
By manipulating a predictor variable can we cause (and therefore
predict) a change in behaviour?
Asking the same question, but in experiments we systematically
manipulate the predictor, in regression we don’t.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 233 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 234 / 394
Rationale to Experiments
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 235 / 394
Rationale to Experiments
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 236 / 394
Theory behind ANOVA
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 237 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 238 / 394
ANOVA by hand
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 239 / 394
The Data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 240 / 394
The Data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 241 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 242 / 394
Step 1: calculate SST
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 243 / 394
Degrees of Freedom
Degrees of freedom (df) are the number of values that are free to
vary.
Think about rugby teams!
In general, the df are one less than the number of values used to
calculate the SS.
dfT = N − 1 = 15 − 1 = 14
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 244 / 394
Model Sum of Squares SSM
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 245 / 394
X
SSM = ni (x̄i − x̄grand )2
= 20.135
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 246 / 394
Model Degrees of Freedom
dfM = k − 1 = 3 − 1 = 2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 247 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 248 / 394
Step 3: calculate SSR
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 249 / 394
2 2 2
SSR = sgroup1 (n1 − 1) + sgroup2 (n2 − 1) + sgroup3 (n3 − 1)
= 6.8 + 6.8 + 10
= 23.60
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 250 / 394
Residual Degrees of Freedom
= (5 − 1) + (5 − 1) + (5 − 1)
= 12
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 251 / 394
Double Check
43.74 = 43.74
14 = 2 + 12
14 = 14
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 252 / 394
Step 4: calculate the Mean Squared Error
SSR 23.60
MSR = = = 1.967
dfR 12
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 253 / 394
MSM
F =
MSR
MSM 10.067
F = = = 5.12
MSR 1.967
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 254 / 394
Step 6: Construct a Summary Table
Source SS df MS F
Model 20.14 2 10.067 5.12*
Residual 23.60 12 1.967
Total 43.74 14
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 255 / 394
ANOVA Assumptions
Homogeneity of variance:
The variances of the groups are supposed to be equal. This assumption
can be tested using Levene’s test (see section 5.7.1). If Levene’s test is
significant then we can say that the variances are significantly different.
This would mean that we had violated one of the assumptions of ANOVA
and we would have to take steps to rectify this matter. However, when
sample sizes are unequal, ANOVA is not robust to violations of
homogeneity of variance
Non-normality
When group sizes are equal the F-statistic can be quite robust to
violations of normality
Independence
when this assumption is broken (i.e., observations across groups are
correlated) then the Type I error rate is substantially inflated.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 256 / 394
Why Use Follow-Up Tests?
The F-ratio tells us only that the experiment was successful (i.e.
group means were different)
It does not tell us specifically which group means differ from which.
We need additional tests to find out where the group differences lie.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 257 / 394
How?
Multiple t-tests
We saw earlier that this is a bad idea
Orthogonal contrasts/comparisons
Hypothesis driven
Planned a priori
Post hoc tests
Not planned (no hypothesis)
Compare all pairs of means
Trend analysis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 258 / 394
Planned Contrasts
Basic idea:
The variability explained by the model (experimental manipulation,
SSM ) is due to participants being assigned to different groups.
This variability can be broken down further to test specific hypotheses
about which groups might differ.
We break down the variance according to hypotheses made a priori
(before the experiment).
It’s like cutting up a cake (yum yum!)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 259 / 394
Independent
Contrasts must not interfere with each other (they must test unique
hypotheses).
K-1
You should always end up with one less contrast than the number of
groups.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 260 / 394
Generating Hypotheses
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 261 / 394
Generating Hypotheses
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 262 / 394
How do I Choose Contrasts?
Big hint:
In most experiments we usually have one or more control groups.
The logic of control groups dictates that we expect them to be
different from groups that we’ve manipulated.
The first contrast will always be to compare any control groups (chunk
1) with any experimental conditions (chunk 2).
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 263 / 394
Hypotheses
Hypothesis 1:
People who take Viagra will have a higher libido than those who don’t.
placebo 6= (low, high)
Hypothesis 2:
People taking a high dose of Viagra will have a greater libido than
those taking a low dose.
low 6= high
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 264 / 394
Planned Comparisons
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 265 / 394
Another Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 266 / 394
Another Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 267 / 394
Rule 1
Groups coded with positive weights compared to groups coded with
negative weights.
Rule 2
The sum of weights for a comparison should be zero.
Rule 3
If a group is not involved in a comparison, assign it a weight of zero.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 268 / 394
Defining contrasts using weights Coding Planned
Contrasts: Rules
Rule 4
For a given contrast, the weights assigned to the group(s) in one chunk of
variation should be equal to the number of groups in the opposite chunk
of variation.
Rule 5
If a group is singled out in a comparison, then that group should not be
used in any subsequent contrasts.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 269 / 394
Defining contrasts
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 270 / 394
Defining contrasts
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 271 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 272 / 394
Orthogonal contrasts for the Viagra data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 273 / 394
X high + X low
3b1 = − X placebo
2
1 X high + X low
b1 = − X placebo
3 2
X high + X low
3b1 = − X placebo
2
5 + 3.2
= − 2.2
2
= 1.9
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 274 / 394
Orthogonal contrasts for the Viagra data
b2 = X high − b1 − b0
1 X high + X low X high + X low + X placebo
b2 = X high − − X placebo −
3 2 3
X high + X low
3b2 = 3X high − − X placebo − X high + X low + X placebo
2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 275 / 394
1
b2 = (X high − X low ) (1)
2
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 276 / 394
Orthogonal contrasts for the Viagra data: output
F-statistic unchanged
The intercept is the “grand mean”
The regression coefficient for contrast1 is one-third of the difference
between the average of the experimental conditions and the control
condition
The regression coefficient for contrast2 is half of the difference
between the experimental groups experimental groups were
significantly different from the control (p < .05) but that the
experimental groups were not significantly different (p > .05)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 277 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 278 / 394
Standard contrasts
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 279 / 394
Using lm():
1 viagraModel <- lm ( libido ~ dose , data = viagraData )
Using aov():
1 viagraModel <- aov ( libido ~ dose , data = viagraData )
2
3 summary ( viagraModel )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 280 / 394
Output from aov()
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 281 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 282 / 394
When variances are not equal across groups
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 283 / 394
Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 284 / 394
Robust ANOVA
Require the data to be in wide format rather than the long format
We can reformat the data using unstack():
1 viagraWide <- unstack ( viagraData , libido ~ dose )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 285 / 394
viagraWide
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 286 / 394
Robust ANOVA
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 287 / 394
Robust outputs
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 288 / 394
Planned Contrasts using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 289 / 394
Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 290 / 394
Polynomial Contrasts: Trend Analysis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 291 / 394
Trend Analysis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 292 / 394
Trend Analysis: Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 293 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 294 / 394
Post Hoc Tests
α
Bonferroniα =
number of tests
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 295 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 296 / 394
Post Hoc Tests Recommendations
How you conduct post hoc tests in R depends on which test you’d
like to do.
Bonferroni and related methods such as the Holm and
Benjamini–Hochberg (BH) variants are done using the
pairwise.t.test() function, which is part of the R base system.
However, Tukey and Dunnett’s test can be done using the glht()
function in the multcomp package.
Finally, Wilcox (2005) has some robust methods implemented in his
functions lincon() and mcpp20().
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 297 / 394
1 pairwise . t . test ( viagraData $ libido , viagraData $ dose , p . adjust . method = " bonferroni " )
2
3 pairwise . t . test ( viagraData $ libido , viagraData $ dose , p . adjust . method = " BH " )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 298 / 394
Tukey
For the Viagra data, we can obtain Tukey post hoc tests by executing:
1 postHocs <- glht ( viagraModel , linfct = mcp ( dose = " Tukey " ) )
2 summary ( postHocs )
3 confint ( postHocs )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 299 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 300 / 394
Robust post hoc tests
1 lincon ( viagraWide )
2 mcppb20 ( viagraWide )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 301 / 394
√
r
SSM
A simple measure of effect size is: r = R2 =
SST
r is slightly biased because it is based purely on sums of squares from
the sample and no adjustment is made for the fact that we’re trying
to estimate the effect size in the population. Therefore,
omega-squared is often used instead:
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 302 / 394
Chapter 15: Non-parametric Tests
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 303 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 304 / 394
When to use Non-parametric Tests
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 305 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 306 / 394
Ranking Data
The test works on the principle of ranking the data for each group:
Lowest score = a rank of 1
Next highest score = a rank of 2, and so on.
Tied ranks are given the same rank: the average of the potential ranks
For an unequal group size
The test statistic (Ws) = sum of ranks in the group that contains the
least people
For an equal group size
Ws = the value of the smaller summed rank
Add up the ranks for the two groups and take the lowest of these
sums to be our test statistic
The analysis is carried out on the ranks rather than the actual data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 307 / 394
An Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 308 / 394
Ranking the Depression Scores for Wednesday and Sunday
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 309 / 394
Provisional Analysis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 310 / 394
Running the Analysis Using R Commander
The Nonparametric Tests menu in R Commander and the dialog box for the Wilcoxon
test for independent samples
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 311 / 394
If you have the data for different groups stored in a single column:
1 newModel <- wilcox . test ( outcome ~ predictor , data = dataFrame , paired = FALSE / TRUE )
However, if you have the data for different groups stored in two
columns:
1 newModel <- wilcox . test ( scores group 1 , scores group 2 , paired = FALSE / TRUE )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 312 / 394
Running the Analysis Using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 313 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 314 / 394
Reporting the results
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 315 / 394
Uses:
To compare two sets of scores, when these scores come from the same
participants.
Imagine the experimenter in the previous example was interested in
the change in depression levels for each of the two drugs.
We still have to use a non-parametric test because the distributions of
scores for both drugs were non-normal on one of the two days
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 316 / 394
Ranking Data in the Wilcoxon
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 317 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 318 / 394
Running the Analysis Using R Commander
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 319 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 320 / 394
Running the Analysis Using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 321 / 394
Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 322 / 394
Reporting the results
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 323 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 324 / 394
Kruskal–Wallis Theory
Once the sum of ranks has been calculated for each group, the test
statistic, H, is calculated as:
X R2 k
12 i
H= − 3(N + 1)
N(N − 1) ni
i=1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 325 / 394
Does eating soya cause fertility problems? Does that affect the sperm
count?
Variables:
Outcome: sperm (millions)
IV: Number of soya meals per week:
No Soya meals
1 Soya meal
4 soya meals
7 soya meals
Participants: 80 males (20 in each group)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 326 / 394
Data for the Soya Example with Ranks
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 327 / 394
Provisional Analysis
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 328 / 394
Doing the Kruskal–Wallis Test using R Commander
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 329 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 330 / 394
Output from the Kruskal–Wallis test
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 331 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 332 / 394
Post Hoc Tests for the Kruskal–Wallis Test
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 333 / 394
One of the problems with comparing every group against all others is
that have to be quite strict about accepting a difference as significant
otherwise we will inflate the Type I error rate. To reduce this problem
we could use more focussed comparisons.
In this example, we have a control group that had no soya meals. As
such, a nice succinct set of comparisons would be to compare each
group against the control:
Test 1: one soya meal per week compared to no soya meals
Test 2: four soya meal per week compared to no soya meals
Test 3: seven soya meal per week compared to no soya meals
Yo compare each group to the no-soya group (using a two-tailed test)
we simply execute:
1 kruskalmc ( Sperm ~ Soya , data = soyaData , cont = ’two - tailed ’)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 334 / 394
Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 335 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 336 / 394
Jonckheere–Terpstra Test Using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 337 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 338 / 394
Theory of Friedman’s ANOVA
The theory for Friedman’s ANOVA is much the same as the other
tests: it is based on ranked data.
Once the sum of ranks has been calculated for each group, the test
statistic, Fr , is calculated as:
k
" #
12 X
Fr = Ri2 − 3N(k + 1)
Nk(k + 1)
i=1
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 339 / 394
Example
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 340 / 394
Diet Data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 341 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 342 / 394
Friedman’s ANOVA Using R
To run the Friedman test we simply input the name of our dataframe,
but within the as.matrix() function, which converts it to a matrix.
In this example, we would execute:
1 friedman . test ( as . matrix ( dietData ) )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 343 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 344 / 394
Post Hoc Tests for Friedman’s ANOVA
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 345 / 394
To Sum Up. . .
When data violate the assumptions of parametric tests we can
sometimes find a non-parametric equivalent
Usually based on analysing the ranked data
Kruskal–Wallis test
Compares more than two independent groups of scores
Friedman’s test
Compares more than two dependent groups of scores
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 346 / 394
Chapter 17: Principal Components Analysis and Reliability
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 347 / 394
Aims
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 348 / 394
When and Why?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 349 / 394
R-Matrix
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 350 / 394
What is a Factor?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 351 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 352 / 394
Mathematical Representation
Y = b1 X1 + b2 X2 + · · · + bn Xn
Factori = b1 Variable1 + b2 Variable2 + · · · + bn Variablen
Y = b1 X1 + b2 X2 + · · · + bn Xn
Sociability = b1 Talk1 + b2 Social Skills + b3 Interest +
+b4 Talk2 + b5 Selfishb6 Liar
Consideration = b1 Talk1 + b2 Social Skills + b3 Interest +
+b4 Talk2 + b5 Selfishb6 Liar
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 353 / 394
Factor Loadings
0.87 0.01
0.96 −0.03
0.92 0.04
A=
0.00
0.82
−0.10 0.75
0.09 0.70
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 354 / 394
The R anxiety questionnaire (RAQ)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 355 / 394
Initial Considerations
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 356 / 394
Further Considerations
Determinant:
indicator of multicollinearity
should be greater than 0.00001
Kaiser–Meyer–Olkin (KMO):
measures sampling adequacy
should be greater than .5
Bartlett’s test of sphericity:
yeasts whether the R-matrix is an identity matrix
should be significant at p < .05
Anti-image matrix:
measures of sampling adequacy on diagonal,
off-diagonal elements should be small
Reproduced:
correlation matrix after rotation
most residuals should be < |0.05|
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 357 / 394
Common variance:
Variance that a variable shares with other variables
Unique variance:
Variance that is unique to a particular variable
The proportion of common variance in a variable is called the
communality
communality = 1, all variance shared
communality = 0, no variance shared
0 < communality < 1 = some variance shared
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 358 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 359 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 360 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 361 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 362 / 394
Graphical Representation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 363 / 394
Finding Factors
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 364 / 394
Initial Preparation and Analysis
We want to include all of the variables in our data set in our factor
analysis.
We can calculate the correlation matrix:
1 raqMatrix <- cor ( raqData )
2 round ( raqMatrix , 2)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 365 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 366 / 394
Factors Extraction
Kaiser’s criterion
Kaiser (1960): retain factors with eigenvalues > 1
Scree plot
Cattell (1966): use “point of inflexion” of the scree plot
Which rule?
Use Kaiser’s criterion when
less than 30 variables, communalities after extraction > .7
sample size > 250 and mean communality ≥ .6
Scree plot is good if sample size is > 200.
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 367 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 368 / 394
Principal Components Model
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 369 / 394
Examples of scree plots for data that probably have two underlying
factors
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 370 / 394
The Scree Plot for the RAQ Data
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 372 / 394
Principal Components Model: Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 373 / 394
Residuals
Check the residuals and make sure that fewer than 50% have absolute
values greater than 0.05, and that the model fit is greater than 0.90.
Execute the function below:
1 residual . stats <- function ( matrix ) {
2 residuals <- as . matrix ( matrix [ upper . tri ( matrix ) ])
3 large . resid <- abs ( residuals ) > 0.05
4 num b er La r g eR esi d <- sum ( large . resid )
5 propL argeResid <- n um b er L ar g e R e si d s / nrow ( residuals )
6 rmsr <- sqrt ( mean ( residuals ^2) )
7
8 cat ( " Root means squared residual = " , rmsr , " \ n " )
9 cat ( " Number of absolute residuals > 0.05 = " , numberLargeResids , " \ n " )
10 cat ( " Proportion of absolute residuals > 0.05 = " , propLargeResid , " \ n " )
11 hist ( residuals )
12 }
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 374 / 394
Residuals
Or:
1 residual . stats ( factor . residuals ( raqMatrix , pc2 $ loadings ) )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 375 / 394
Residuals
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 376 / 394
Rotation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 377 / 394
Rotation
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 378 / 394
Orthogonal Rotation (varimax)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 379 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 380 / 394
Orthogonal Rotation (varimax)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 381 / 394
As with the previous model, we can look at the factor loadings from
this model in a nice easy-to-digest format by executing:
1 print . psych ( pc4 , cut = 0.3 , sort = TRUE )
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 382 / 394
Oblique Rotation (oblimin)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 383 / 394
Important!
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 384 / 394
Reliability
Test–retest method
What about practice effects/mood states?
Alternate form method
Expensive and impractical
Split-half method
Splits the questionnaire into two random halves, calculates scores and
correlates them
Cronbach’s alpha
Splits the questionnaire into all possible halves, calculates the scores,
correlates them and averages the correlation for all splits (well, sort of
...)
Ranges from 0 (no reliability) to 1 (complete reliability)
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 385 / 394
Cronbach’s Alpha
var1 cov12 cov13
variance–covariance matrix = cov12 var2 cov23
cov13 cov23 var3
N 2X covariance
α = Pn 2
Pn
item=1 sitem + item=1 covitem
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 386 / 394
Interpreting Cronbach’s Alpha
Kline (1999)
Reliable if α > .7
Depends on the number of items
More questions = bigger α
Treat subscales separately
Remember to reverse score reverse phrased items!
If not, α is reduced and can even be negative
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 387 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 388 / 394
Reliability Analysis Using R
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 389 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 390 / 394
Reliability Analysis Using R: Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 391 / 394
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 392 / 394
Reliability Analysis Using R: Output
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 393 / 394
The End?
Dr. M. Romano (University of Cagliari) Quantitative Methods for Management A.A. 2021/2022 394 / 394