Professional Documents
Culture Documents
Asc 399 Exercise
Asc 399 Exercise
b)SEMMA is an acronym for Sample, Explore, Modify, Model, and Assess. The first
step
of SEMMA process is to sample the data. A representative subset of the data will be
selected
for analysis. The next step is to explore the data. This would be done by examining
the data
to understand its characteristics such as distribution, missing values, and
outliers.
The Modify step involves preparing the data for modeling.
This may involve transforming variables, creating new variables, and handling
missing values.
The Model step involves applying data mining techniques to the data to create a
model.
The final step is to assess the model. It includes evaluating the model's
performance to know
how well it predicts the desired outcome.