Professional Documents
Culture Documents
Date: 19/03/2020
Independent Variable (IV): n nonmetric variable, Dependent Variable (DV): 1 metric variable,
Control Variable (CV): metric variable
Note:
- Error value will become smaller as you put more factors/independent variables into
ANOVA
- Only use 2-way ANOVA at most; for 3+ factors, use ANCOVA
- When all Sigs are good (<0.05), compare Sum of Squares (SS) to tell which factor
has stronger effect
Date: 26/03/2021
1. Discriminant analysis:
- IV: n, metric; DV: 1, nonmetric (2-3 levels)
- Analyze => Classify => Discriminant
- Grouping Variable: DV (define range based on DV level)
- Independents: IV(s)
- Classify: Tick Leave-one-out classification (Display); if 3 levels, then tick
Combined-groups (Plots)
- SPSS table: (for 3 levels)
2
o Eigenvalues: read correlation (r) => square them up (𝑅 )
o Wilks’ Lamda: read Sig
o Standardized Canonical Discriminant: check and compare Beta (for 3 levels,
check both functions)
o Classification result: Check cross-validated for correctly classified data
- For 3 levels, examine the figure Canonical Discriminant Functions to determine
which function best serve Group Centroid 3 (normally function 1)
Example.:
-
2. Logit analysis:
- Analyze => Regression => Binary Logistic
- Dependent: DV (2-3 levels)
- Covariate: IV(s)
- SPSS table: (Block 1)
o Model summary: use Nagelkerke R square
o Variables in the equation: check Sigs & B (ignore constant)
Date: 09/04/2021
1. Cluster number:
- Analyze => Classify => Hierarchical cluster
- Variables: IV(s)
- Statistics: Tick Agglomeration schedule
- Plots: Tick Dendrogram
- Method: Ward’s method (Cluster method), Squared Euclidean distance (Interval)
- SPSS table:
o Agglomeration Schedule: look at column Coefficients, compare changes in
coefficients as No. observations (Stage) changes, choose the largest change to
determine No. cluster. E.g.: change “stage 164-165” is the largest (166 obs)
=> 2 clusters
o Dendrogram using Ward Linkage: same purpose with Agglomeration
Schedule. Zoom to better observe (height & width: 500) and draw straight
vertical (cut) lines (can be assumed) along with existed vertical lines. No.
clusters = No. longest horizontal lines after drawing cut lines
2. Cluster characteristics:
- Analyze => Classify => K-means cluster
- Variables: IV(s) + DV
- No. clusters: 2 (normally). For differentiation purpose, test No. clusters in ascending
order until able to clearly differentiate cluster (E.g.: 5 clusters in this case)
- SPSS table: Final Cluster Centers (5 clusters only)
o Compare results for DV (Consumer satisfaction) and rank 1-5 with 5 is lowest
and 1 is highest, normally there are 2 distinct clusters 1 & 5 with extreme
results => determine the others
o Similar process applied for IVs (factors), spot out clusters that have high rank
in at least 1 factor (rank 1-2) and name it based on that characteristic, for more
than 1 factor => name cluster by general characteristics
o There are times when the results between DV and IVs are contrary (E.g.: high
DV but low for all IVs) => imply that this cluster need further investigation
(more surveys and research needed)
Date: 16/04/2021
Review: 13/13 techniques studied