Professional Documents
Culture Documents
Background
Raw data
Variable modelling
Pre-processing
Training
Testing
Results and conclusions
Background
Survey
Obtain raw data
Variable Modelling
Removed outliers
Possible future clustering app
Split multiple variables (sports question)
Conversion of numerical variables to proper form (age)
Conversion of two-class labels to binary
Conversion of multi-class labels
Ordinal
Dummy coding
𝑥𝑖 −𝜇
Normalisation 𝑥𝑖′ = 𝜎
Training