Professional Documents
Culture Documents
Lecture 5
Lecture 5
CHE F315
Recap
Multivariate data
Euclidean and Mahalanobis distance
Multivariate outlier detection
Data transformation
Dimensionality reduction
Feature selection
Feature extraction (transformation)
26 January 2024 4
BITS Pilani, Pilani Campus
ET ZC362 Environmental Pollution Control
Feature selection
26 January 2024 5
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Feature selection
Measure of relevant feature
– Mutual information
– Correlation based similarity
– Distance-based similarity
A typical feature selection process consists of four steps:
– Generation of possible subsets
– Subset evaluation
– Stop searching based on some stopping criterion
– Validation of the result
26 January 2024 6
BITS Pilani, Pilani Campus
Probability
BITS Pilani
Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Why Probability in ML
Designing machines that learn from observed data
Uncertainty in learning from data
Observed data can be consistent with many models and
therefore which model is appropriate, given the data, is
uncertain
Predictions about future data and the future consequences
of actions are uncertain
Many aspects of learning and intelligence crucially depend
on the careful probabilistic representation of uncertainty.
Probabilistic framework describes how to represent and
manipulate uncertainty about models and predictions
Bayesian interpretation use of probability to quantify
uncertainty
26 January 2024 8
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Review of basics
26 January 2024 9
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Example
26 January 2024 10
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Probability distribution
26 January 2024 11
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Probability distribution
26 January 2024 12
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Example
26 January 2024 13
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
Probability distribution
26 January 2024 14
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
References
26 January 2024 15
BITS Pilani, Pilani Campus
CHE F315 Machine Learning for Chemical Engineers
26 January 2024
16 BITS Pilani, Pilani Campus