Professional Documents
Culture Documents
KDD: knowledge discovery in database: extraction useful knowledge from large databases
-Predictive tasks: predict values of a particular atribute based on the values of other attributes:
Classification, regression
- Descriptive tasks: inducing patterns that summarize the underlying relationships in data
OVERFITTING:
- a hypothesis that exactly fits the training data may be wrong and have bad generation capabilities
- Overfitting occurs when the odel is too tailored over the training data -> reflect its contingent
properties rather than its structural properties
ENTROPY:
INFORMATION GAIN
the loss: difference between actual and predicted values -> distance measure
SUPERVISED LEARNING: learning algorithm from a training dataset, infering a model from labeled
training data
UNSUPERVISED LEARNING: modeling the underlying or hidden structure in the unlabled data, only have
input data no corresponding output variables - clustering(ex: self-organizing maps)