Professional Documents
Culture Documents
Beyond PHD Coaching - Stats Short Course - 29mar2022h
Beyond PHD Coaching - Stats Short Course - 29mar2022h
Stats for
Graduate Research
Beyond the College Stats Textbook
and Course
Beyond PhD Coaching
Welcome & Overview https://www.beyondphdcoaching.com
• Fail to detect an effect when it is true. The alternate hypothesis is The alternate hypothesis is
true or an effect exists not true or no effect exists.
Real-World Issue
Research Problem
Quantitative
Qualitative Research?
Research? Purpose
Research Question
Beyond PhD Coaching
Stats Concepts: Which Tool to Choose?
Methodology is chosen that bests aligns with . . .
numerical
Real-World phenomenon, numerical
(linear output
input process, (except for
regression) input output
system, output binary
input
input
mechanism, treatment, output logistic
nominal
transformation, influence, regression)
(ANOVA,
correlation, or relationship
t-tests)
Any questions?
Beyond PhD Coaching
Data: Collection
Instruments, Sampling, and
Sample Size
Beyond PhD Coaching
Data Collection: Instruments
• Designed Experiments
• Control the inputs to measure their influence on the output (response).
• Example: use of a mechanical/computer device to dispense patient meds.
• Observation
• Human or inanimate behavior
(static behavior or reactions to stimuli).
• Example: consumer-customer service interactions.
• Minimal control over inputs.
• Documents & Secondary Data
• Existing data from reliable, scholarly sources (e.g., company performance).
• Cannot control the variables, but collect what has already been measured.
• Surveys vs. Questionnaires
• What’s the difference?
Beyond PhD Coaching
The Nature of Data
• Likert Scales (questionnaires)
• 5-level (1-2-3-4-5 with midpoint = 3);
or, 7-level (1 through 7 with midpoint = 4).
• Ordinal variable.
• Often considered to be numerical.
• Aggregated Indices
• Mean response of several items on a Likert scale.
• Continuous numerical variable [higher statistical power & confidence].
• Operationalization of Variables
• Every variable must be defined two ways:
• How is it informed by the instrument (e.g., Likert scale, index, arithmetic operation)?
• What do the variable and its values mean in the real-world?
• Example: what does each number on the Likert scale mean operationally?
Beyond PhD Coaching
Reliability and Validity of Data
• For instruments (i.e., questionnaires), using a multi-item scale
to inform numerical variables, there is a need to measure and/or report
on instrument internal consistency reliability.
• Cronbach’s alpha: Do the items measure the same underlying construct?
• If so, correlations among all items should be positive.
• can be increased by deleting poor items or increasing number of items (k).
• SPSS provides correlation matrix and statistics for identifying problem items.
• For off-the-shelf instruments (with permission), should be provided.
• For self-developed, can be calculated with formula 2,4 or using SPSS.
• Normally done with pilot test, of sufficient sample size.
• With a sample different from the main study, but that mirrors main study attributes.
• Pilot test also provides test of the length and understandability of the instrument.
• Sample size to determine Cronbach’s alpha depends on parameters.2,4 Example:
= .05, = .80, k = 5 items, CA0 = 0, CA1 = .7 => n = 16
Beyond PhD Coaching
Sampling
• Remember why we sample!
• Population: The entire set of entities you are interested in.
• Target population: A subset of the population delimited for the purpose of your
research (scope & deliminations, feasibility, etc.).
• Sample: A subset of the target population selected for the purpose of inferring
some attributes about the target population.
• Sampling Methods & Considerations
• Random sample: every element of the target population equally likely to be chosen.
• Stratified sample: reflects proportions of the target population (+ random sampling).
• Purposive sampling: non-probability sampling; researcher uses judgement when choosing
members of the target population to participate.
• Convience sampling.
• Snowball sampling.
• Survey options: conduct manually or use a service (e.g., SurveyMonkey).
Beyond PhD Coaching
Drivers of Sample Size
• Probability of Type I error (false positive) =
= level of significance
• 1 − = Confidence = inverse of level of significance
• Probability of Type II error (false negative) =
• 1 − = Power*
• How precisely we measure or compare = effect size = e (e.g., means of d)
• Variation (standard deviation) of the variable in the population =
• May not be known, but can be estimated:
• In a normal dist, 99.7% of items are within a range of 6 ; range 6 (e.g., range of 48, 8).1
• Another driver: your resources (time & money to perform the sampling)*
• Tradeoff among all five considerations
• *Many sample size “experts,” calculators, calculations fail to consider power.
• *Reality: money & time may prevent highly precise test with high power & confidence.
Beyond PhD Coaching
Sample Size Calculations
• Common errors with “experts,” calculators, tables: failure to consider statistical power, unequal group size,
population size.
• Sample size calculations for t and Z tests of means performed with an online calculator such as G*Power5
2 2 2
2 𝜎 2 𝑍𝛽 +𝑍𝛼ൗ
(𝑟+1) 𝜎 𝑍𝛽 +𝑍𝛼ൗ2 𝑟+1 2 (𝑟+1) 𝑍𝛽 +𝑍𝛼ൗ
2
𝑛= ∙ = ∙ = ∙
𝑟 𝑒2 𝑟 𝑑𝜎 2 𝑟 𝑑2
where, n = sample size per group (assuming equal sized groups) or sample size of larger group
r = ratio of larger group to smaller group (note: if two groups are of equal size, the expression = 2)
d = percentage of population
requires knowledge of population
or, an estimate of population
Statistical Tools
&
Hypothesis Testing
Beyond PhD Coaching
Data Preparation: Missing Data 1 44.83 114.65 1.18 115.84
2 46.09 117.18 0.03 117.22
or corrupted) 6
7
52.31
50.81
129.62
126.61
-0.13
0.26
129.48
967.20
Response, Y
55
50
• Continuous, numerical dependent variable. 45
40
• Each observation of the dependent variable is 35
30
independent of the observations. 0 10 20 30
Sequence of Observation
• Measurements for each sample not influenced by
or related to the measurements of other subjects. 80
Frequency
50
• Dependent variable has a normal distribution. 40
30
135.00
Y
IVs and the RV. 120.00
115.00
(Durbin Watson test). 40.00 42.00 44.00 46.00 48.00 50.00 52.00 54.00 56.00 58.00
X
Any questions?
Beyond PhD Coaching
ANOVA
&
Regression
Beyond PhD Coaching
Advanced Tools
• Comparison of two groups: t or Z test, correlation, simple hypothesis tests
• When question is, “which factors are influential?” or multiple group comparison . . .
• . . . different, more advanced, related set of tools
Assumptions ANOVA Multiple Linear Regression Binary Logistic Regression
DV numerical categorical: binary
residuals normally distributed
homogeneity of variance homoscedasticity (constant variation of residuals
among groups across all values of IVs)
IVs categorical numerical
2 groups, values, levels convert categorical to numerical
independent observations independent residuals
no multicollinearity among IVs
no outliers
linear relationship no DVs to IVs logit to IVs
model-guilding yes: to find best predictive model (significant predictors)
Beyond PhD Coaching
Multiple Regression
• Example: 12
13
e
f
i
g
p
j
• round up to 10 19
20
f
f
h
h
m
p
• total sample size = N = 240 21
22
f
f
i
i
j
k
*SPSS can handle other designs. 23
24
f
f
i
i
m
p
Beyond PhD Coaching
ANOVA Execution
• Complex; requires statistical app such as SPSS
• Steps:
• Prep data in Excel
• Copy and paste to SPSS (paste with variables names)
• Analyze / General Linear Model / Univariate
• Choose dependent variable
& fixed factors
• Choose model
• Build terms:
• Main effects [primary factors]
• All 2-way [interactions]
Beyond PhD Coaching
ANOVA Execution (part 2)
• Choose EM [Marginal] Means
• For interaction graphs
• Choose options:
• Descriptive statistics
• Homogeneity tests
• Levene’s test
• Homogeneity of variance
• OK: run the analysis
• Check assumptions
• Perform model-building
Beyond PhD Coaching
ANOVA Output (SPSS)
Beyond PhD Coaching
Factor Interactions (Regression)
• Provides additional insights; qualifies the effect of B on Y (“it depends!”).
• “The influence of B (relationship between B & Y) depends on the value of D.”
• Interaction indicated by non-parallel lines (tested statistically).
Always test significance of at least the two-factor interactions.
Depiction 2FI: 𝑌-predicted vs. B when D = min, mean, max Y = hardness of steel
is a 30.0
B = temperature the alloy is heated to
Y-pred
regression D = min 28.0 D = percentage of element D
example; D = mean 26.0
24.0
same D = max Hardness increases with increased temperature . . .
22.0
concept 20.0
But, that depends on the percentage content of D.
for ANOVA. 18.0 When D is at its lowest level, hardness increases
16.0 steeply with temperature.
14.0
12.0
Less so with an average % of D.
B 10.0 And, hardness decreases with increased temperature
35.0 45.0 55.0 65.0 75.0 85.0 95.0
at highest % of D.
Beyond PhD Coaching
Factor Interactions (ANOVA)
• “Influence (relationship) between X and Y depends on the value of C.”
• Provides additional insights; qualifies the effect of X on Y (“it depends!”).
• Again, interaction indicated by non-parallel lines (tested statistically).
Y = socioeconomic index
X = marital status (5 levels or groups)
C = American citizen (2 levels: yes or no)
Any questions?
Beyond PhD Coaching
Regression Model-Building
&
Interpretation
Beyond PhD Coaching
Model-Building
• For multi-attribute research & analysis techniques (regression, ANOVA)
• Interrelated questions which generate research:
• Which independent variables or factors are significant predictors of a response?
• Sensitivity of response to various predictors?
• Issues with real-world phenomena :
• Incompletely understood and complex.
• Unknown causes; lurking influences; interactions.
• Sampling yields less than 100% certainty.
• The influence of a predictor depends
on the presence of other predictors.
• Better questions:
• What model is the best predictor of a response? (and, is it a significant predictor?)
• What set of predictors comprise that model? (factors & interactions)
Beyond PhD Coaching
Defenses Against Uncertainty
• Define the problem (including the variables).
• Plan the approach.
• Choose an appropriate instrument.
• Adequate sample size.*
• Rigorous sampling.
• Careful data collection.
• Correct analysis tool.
• Proper use of the tool.
• Model-building: recognize one run of SPSS will not answer the question.
• Model validation & stability investigations, using bootstrapping.8
• Thorough explanation & interpretation, including subject matter expertise.
*Some recommend sample events-per-variable (EPV) > 25 with some stability investigation; never < 10.8,9
Earlier: 5 predictors, med effect size, = .05, power = .90 => n = 120; EVP of 25 => n’ = 5 25 = 125.
Beyond PhD Coaching
Model-Building Strategy
• Factor screening (theory + SMEs + initial statistical investigation).
• Appropriate variable selection criteria (inclusion criteria, or 𝐵).
• Understand Type I and II errors related to models.
• Focus more on model correctness, completeness, and goodness-of-fit
than on variable significance.10
• Avoid reliance on automated stepwise methods*; but use as evidence.
• Use multiple model-building techniques.11
• Develop collaborative evidence to develop and choose the best model.
• Try different combinations of predictors. *Documented flaws: included
• Investigate factor interactions. predictors highly dependent on the
order of selection or elimination.
Beyond PhD Coaching
Four Stages of Model-Building
• Stage 1: Theory, previous research, empirical results, subject matter
Final Model
expertise to identify candidate independent variables.
• Stage 2: Screening: perform multiple analyses to identify Stageand
4 eliminate
independent variables highly unlikely to be significant predictors.
• Stage 3: Analyze remaining independentStage 3
variables + 2FIs.
• Stage 4: Compare results Stage
from2 all stages and all model-building
techniques to decide on a final predictive model.
• Stages 2 & 3Stage
employ
1 four regression techniques collaboratively . . .
• . . . to generate evidence to select a final predictive model.
• Model goodness-of-fit 5
6
23.9
24.7
22.1
22.5
18.7
18.9
9.9
9.6 X X
X
X
X
X
X
X X
X X
What Happens
After the Analysis?
Beyond PhD Coaching
What Next?
• Statistics are not just numbers; they tell a story.13
• Someone is expecting both the numbers and the story.
• Several levels of rigorous statistical analysis:
data • Using tools provide graphical analysis, descriptive stats,
hypothesis testing, model-building. Some stop here . . . .
information • Presentation of outputs suggest relationships, patterns,
observations about mean & variance, results of hypothesis tests,
what adj R2 says, factor interactions. . . . . or, here.
knowledge • Explanation of stats:
• Subject matter expertise, prior research, theory to explain and interpret
statistical outputs.
• Corroboration or refutation of prior thinking.
wisdom • Not what was found, but why?
• Correlation and causation. This is where you become a scholar.
action • What real-world decisions are recommended?
Beyond PhD Coaching
What Might You Be Asked (Orals)?
• What’s the real-world issue at the heart of (the motivation for) your research?
• Why quantitative? ref14
• Why your choice of quantitative methodology?
• Where did your variables come from? Operational definitions?
• Rationale for statistical parameters (, power, effect size)?
• Rationale for sampling technique?
• Rationale for instrument?
• How do you know your instrument is valid and reliable?
• Did you consider factor interactions?
• What do factor interactions tell you?
• What did you learn that was not previously known or understood?
• How can someone use your research, analysis, and findings?
• What are your plans for disseminating your research, analysis, and findings?
Beyond PhD Coaching
So, What’s It All Mean?
• Facts are stubborn, but statistics are more pliable–Mark Twain.15
• It is the mark of a truly intelligent person to be moved by statistics– George
Bernard Shaw.15
• In ancient times they had no statistics so they had to fall back on lies– Stephen
Leacock.15
• There is a good reason why quantitative research is used.
• There is an opportunity for valid, reliable, believable analysis.
• But, it is not guaranteed just because we choose a well respected tool.
• Only one person responsible here—the researcher/analyst/candidate.
• Burden of proof!!!
• You are now armed and dangerous; make the most of it; don’t stop researching
and asking the tough questions!
• I would say, “Good Luck,” but you don’t need luck. So, “All the best!”
Beyond PhD Coaching