Hyperparameter Optimization Techniques

The document discusses hyperparameter optimization techniques in machine learning, focusing on GridSearchCV, RandomizedSearchCV, and Bayesian optimization methods. GridSearchCV exhaustively tests all parameter combinations, while RandomizedSearchCV samples parameters randomly for efficiency. Bayesian optimization improves upon these methods by using past evaluation results to inform future hyperparameter selections, making it more efficient for model tuning, particularly in deep learning applications.

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views4 pages

Hyperparameter Optimization Techniques

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Hyper-parameter Optimization

If we were to count all the possible classification algorithms and their parameters available just
within the sklearn API, you would end up with something like 1.2 duodecillion combinations. Each
combination is something that we might want to try in our effort to find the best performing model
for our problem. There is no free lunch after all.
Data Scientists call this search hyperparameter tuning or hyperparameter optimization.

Many of us have employed GridSearchCV and RandomSearchCV to help you narrow the
search space. Let take a quick review of these two methods:

Grid Search Cross Validation (GridSearchCV)

Grid search works by trying every possible combination of parameters you want to try in your
model. Those parameters are each tried in a series of cross-validation passes. This technique has
been in vogue for the past several years as a way to tune your models.
Let’s take a quick look at the process in python with an SVM:
from sklearn.model_selection import GridSearchCV
from sklearn import dataset, svm
iris = dataset.load_iris() Classifier

# Instaniate SVM Classifier

parameters = {
'kernel' :('linear', 'rbf'),
'C': [1,10]
}
# Instantiate our models with each combo of paramters
svc = svm.SVC(gamma="scale")

# Fit each model - automatically picks the best one

clf = GridSearchCV(svc, parameters, cv=5)
clf.fit(iris.data, iris.target)

We are trying only twenty models with the grid above. Given the size of our dataset and the number
of models, the run time for this grid will be trivial. Imagine though, our dataset is an order of
magnitudes larger, and we decided to tweak many more parameters in our model. The runtime then
would be considerably larger. Days or weeks longer if you are tuning neural networks.
Random Search Cross Validation (RandomizedSearchCV)
Consider trying every possible combination takes a lot of brute force computation. This adopted a
faster technique: randomly sample from a range of parameters.
The idea is that you will cover on the near-optimal set of parameters faster than grid search. This
technique, however, is naive. It doesn’t know or remember anything from its previous runs.
from scipy.stats
import randint sp_randint
from sklearn.model_selection
import RandomizedSearchCV

from sklearn.datasets import load_digits

from sklearn.ensemble import RandomForestClassifier

# Data
digits = load_digits()
X, y = digits.data, digits.target# Instantiate a classifier
# Specify parameters and distributions to sample from
clf = RandomForestClassifier(n_estimators=20)

param_dist = {"max_depth": [3,None],

"max_features":sp_randint(1,11),
"min_samples_split": sp_randint(2,11),
"bootstrap": [True, False],
"criterion": ["gini", "entropy"]
}
# random search
n_iter_search = 20
random_search = RandomizedSearch(
clf,p
aram_distributions=param_dist,
n_iter=n_iter_search,
cv=5
)
random_search.fit(X,y)
Bayesian Hyperparameter Optimization
Both GridSearchCV and RandomizedSearchCV are both naïve approaches; each model run
is uninformed by a previous model.
Build a probability model of the object function and use it to select the most promising
hyperparameters to evaluate in the true objective function.
Bayesian approaches, in contrast to random or grid search, keep track of past evaluation results
which they use to form a probabilistic model mapping hyperparameters to a probability of a score
on the objective function.
This is a surrogate function for the objective function ( p(y|x). Using a surrogate function limits
calls to the object function making optimizing the objective easier by selecting the next
hyperparameters with Bayesian methods.
1. Build a surrogate probability model of the object function (our algorithm)
2. Find the hyperparameters that perform best on the surrogate
3. Apply these hyperparameters to the true objective function
4. Update the surrogate model incorporating the new results
5. Repeat steps 2–4 until max iterations or time is reached

At a high-level, Bayesian optimization methods are efficient, because they choose the next
hyperparameters in an informed manner.
The basic idea: spend a little more time selecting the next hyperparameters in order to make fewer
calls to the objective function. By evaluating hyperparameters that appear more promising from past
results, Bayesian methods can find better model settings than random search in fewer iterations.
Luckily for us, we do not have to implement these procedures by hand. The Python ecosystem has
several popular implementations: Spearmint, MOE (developed by Yelp), SMAC, and Hyperopt. We
will focus on Hyperopt. It seems to be the most popular implementation. It also has a nice wrapper
for sklearn aptly called hyperopt-sklearn.
Installing hyperopt-sklearn:
git clone <https://github.com/hyperopt/hyperopt-sklearn.git>
cd hyperopt
pip install -e .

Sample search for a classification algorithm using the hyperopt-sklearn package.

The package implements sklearn classification models in its searches. The package is still in the
early stages.
from hpsklearn import HyperoptEstimator, any_sparse_classifier, tfidf
from sklearn.datasets import fetch_20newsgroups
from sklearn import metrics
from hyperopt import tpe
import numpy as np

# Download the data and split into training and test sets
train = fetch_20newsgroups( subset='train' )
test = fetch_20newsgroups( subset='test' )

X_train = train.data
y_train = train.target
X_test = test.data

y_test = test.targetestim = HyperoptEstimator(

classifier=any_sparse_classifier('clf'),
preprocessing=[tfidf('tfidf')],
algo=tpe.suggest,
trial_timeout=300
)
estim.fit( X_train, y_train )print( estim.score( X_test, y_test ) )
# <<show score here>>
print( estim.best_model() )

The data science community is quickly adopting Bayesian hyperparameter optimization for deep
learning. The run-time for model evaluation makes these methods preferable to manual or grid-
based methods. There is a hyperopt wrapper for Keras called hyperas which simplifies bayesian
optimization for keras models.

Lec 04 05
No ratings yet
Lec 04 05
37 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
9 pages
ML Chap 5
No ratings yet
ML Chap 5
14 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
17 pages
Hyperparameter Tuning Mits
No ratings yet
Hyperparameter Tuning Mits
17 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
3 pages
Unit 1
No ratings yet
Unit 1
11 pages
Hyperparameter Tuning Guide
No ratings yet
Hyperparameter Tuning Guide
7 pages
Diabetes Machine Learning Case Study
100% (1)
Diabetes Machine Learning Case Study
10 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
15 pages
Hyperparameters
No ratings yet
Hyperparameters
8 pages
Machine Learning Cheat Sheet: Karn Singh
No ratings yet
Machine Learning Cheat Sheet: Karn Singh
13 pages
Macine Resit
No ratings yet
Macine Resit
7 pages
Random Forest Hyperparameter Tuning Guide
No ratings yet
Random Forest Hyperparameter Tuning Guide
5 pages
Hyperparameters
No ratings yet
Hyperparameters
2 pages
Hyperparameter Optimization Techniques
No ratings yet
Hyperparameter Optimization Techniques
19 pages
Bayesian Optimization for Deep Learning Tuning
No ratings yet
Bayesian Optimization for Deep Learning Tuning
10 pages
Hyperparameter Tunning Machine Learning Cse
No ratings yet
Hyperparameter Tunning Machine Learning Cse
5 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
4 pages
Hyperparameter Tuning in Machine Learning
No ratings yet
Hyperparameter Tuning in Machine Learning
194 pages
Grid Search
No ratings yet
Grid Search
48 pages
Amlt Bca Unit-2
No ratings yet
Amlt Bca Unit-2
5 pages
Bayesian Optimization for ML Hyperparameters
No ratings yet
Bayesian Optimization for ML Hyperparameters
11 pages
#Machinelearning: Mastering Tuning Hyperparameter
No ratings yet
#Machinelearning: Mastering Tuning Hyperparameter
7 pages
AI Feature Extraction & Model Building
No ratings yet
AI Feature Extraction & Model Building
35 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
Machine Learning for Cs₂CuBiCl₆ Cell Efficiency
No ratings yet
Machine Learning for Cs₂CuBiCl₆ Cell Efficiency
5 pages
Advanced Scikit Learn
No ratings yet
Advanced Scikit Learn
98 pages
Reference Guide - Validation & Cross-Validation
No ratings yet
Reference Guide - Validation & Cross-Validation
7 pages
Phase 3 IBM
No ratings yet
Phase 3 IBM
7 pages
Skit Learn Cheatsheet
No ratings yet
Skit Learn Cheatsheet
11 pages
AI Note
No ratings yet
AI Note
5 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Grid Search for ML Enthusiasts
No ratings yet
Grid Search for ML Enthusiasts
1 page
Data Download and Model Training Code
No ratings yet
Data Download and Model Training Code
11 pages
Module 6
No ratings yet
Module 6
4 pages
Python ML Methods Cheatsheet
No ratings yet
Python ML Methods Cheatsheet
6 pages
Divorce Prediction Using ML
No ratings yet
Divorce Prediction Using ML
12 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
3 pages
Supervised Learning Notes
No ratings yet
Supervised Learning Notes
7 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Machine Learning Cheat Sheet-1
No ratings yet
Machine Learning Cheat Sheet-1
12 pages
AI LAB Contents
No ratings yet
AI LAB Contents
19 pages
Induction Algorithms Overview and Types
No ratings yet
Induction Algorithms Overview and Types
26 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Borewell Data Analysis and ML Models
No ratings yet
Borewell Data Analysis and ML Models
8 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Optimized Hyperparameters Tuning of Multi-Class Classification Algorithms
No ratings yet
Optimized Hyperparameters Tuning of Multi-Class Classification Algorithms
17 pages
Data Analysis and Model Training Guide
No ratings yet
Data Analysis and Model Training Guide
7 pages
Python Central Tendency & KNN Guide
No ratings yet
Python Central Tendency & KNN Guide
16 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Hyperopt: Python Hyperparameter Optimization
No ratings yet
Hyperopt: Python Hyperparameter Optimization
25 pages
Hyperparameter Tuning with GridSearchCV
No ratings yet
Hyperparameter Tuning with GridSearchCV
5 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
Machine Learning Practical Exercises
No ratings yet
Machine Learning Practical Exercises
34 pages
Advanced Data Mining Overview 2024
No ratings yet
Advanced Data Mining Overview 2024
38 pages
01 Decision Analysis Presentation Group D
No ratings yet
01 Decision Analysis Presentation Group D
78 pages
Association Analysis in Data Mining
No ratings yet
Association Analysis in Data Mining
46 pages
Unit3 SVM
No ratings yet
Unit3 SVM
20 pages
02 Decision Making Under Uncertainty and Risk
No ratings yet
02 Decision Making Under Uncertainty and Risk
12 pages
Clustering Algorithms in Data Science
No ratings yet
Clustering Algorithms in Data Science
43 pages
Clustering Evaluation in Data Mining
No ratings yet
Clustering Evaluation in Data Mining
53 pages
Unit4 HAC Example
No ratings yet
Unit4 HAC Example
7 pages
Understanding K-Nearest Neighbors Algorithm
No ratings yet
Understanding K-Nearest Neighbors Algorithm
7 pages
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
No ratings yet
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
35 pages
Unit4 Clustering
No ratings yet
Unit4 Clustering
46 pages
Social Network Analysis Basics in R
No ratings yet
Social Network Analysis Basics in R
40 pages
Final Year Report Presentation Edited
No ratings yet
Final Year Report Presentation Edited
52 pages
Personality Prediction Using Machine Learning
No ratings yet
Personality Prediction Using Machine Learning
7 pages
Underwater Image Classification Using ML Techniques
No ratings yet
Underwater Image Classification Using ML Techniques
6 pages
Solar Power Forecasting with ML Techniques
No ratings yet
Solar Power Forecasting with ML Techniques
7 pages
AI for Theft and Fight Detection System
No ratings yet
AI for Theft and Fight Detection System
6 pages
Drug Demand Forecasting - Haryana
No ratings yet
Drug Demand Forecasting - Haryana
11 pages
Interactive Diabetes Risk Prediction Using Explainable Machine Learning: A Dash-Based Approach With SHAP, LIME, and Comorbidity Insights
No ratings yet
Interactive Diabetes Risk Prediction Using Explainable Machine Learning: A Dash-Based Approach With SHAP, LIME, and Comorbidity Insights
9 pages
Automated Mango Classification Using Convolutional Neural Networks (CNN)
No ratings yet
Automated Mango Classification Using Convolutional Neural Networks (CNN)
7 pages
Linear Regression and Data Processing Techniques
No ratings yet
Linear Regression and Data Processing Techniques
19 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
Flight Price Prediction
No ratings yet
Flight Price Prediction
34 pages
An Overview of Adversarial Attacks, Defenses
No ratings yet
An Overview of Adversarial Attacks, Defenses
29 pages
Assignment 03
No ratings yet
Assignment 03
6 pages
Chilli Quality Classification Using Deep Learning
No ratings yet
Chilli Quality Classification Using Deep Learning
5 pages
Portfolio
No ratings yet
Portfolio
6 pages
Rice Disease Detection with CNNs
No ratings yet
Rice Disease Detection with CNNs
13 pages
ML Module 2 Sjbit
No ratings yet
ML Module 2 Sjbit
22 pages
Alopex: On-Device LLM Function Calls
No ratings yet
Alopex: On-Device LLM Function Calls
12 pages
Understanding Machine Learning Concepts
No ratings yet
Understanding Machine Learning Concepts
12 pages
Presentaton PPT Stock Prediction
No ratings yet
Presentaton PPT Stock Prediction
10 pages
Traffic Flow Prediction Using ML Techniques
No ratings yet
Traffic Flow Prediction Using ML Techniques
8 pages
IET Renewable Power Gen - 2024 - Dokur - An Integrated Methodology For Significant Wave Height Forecasting Based On
No ratings yet
IET Renewable Power Gen - 2024 - Dokur - An Integrated Methodology For Significant Wave Height Forecasting Based On
13 pages
Assignment 01 2025 v01 Report Template 105508512
No ratings yet
Assignment 01 2025 v01 Report Template 105508512
5 pages
Plant Leaf Disease Detection App Guide
No ratings yet
Plant Leaf Disease Detection App Guide
78 pages
ML Lab Program 7
No ratings yet
ML Lab Program 7
7 pages
Communications Mining Taxonomy Guide
No ratings yet
Communications Mining Taxonomy Guide
42 pages
1 s2.0 S2352012423016478 Main
No ratings yet
1 s2.0 S2352012423016478 Main
18 pages
Breaking Bad: De-Anonymising Entity Types On The Bitcoin Blockchain Using Supervised Machine Learning
No ratings yet
Breaking Bad: De-Anonymising Entity Types On The Bitcoin Blockchain Using Supervised Machine Learning
10 pages
Churn Prediction in Telecom via Rough Sets
No ratings yet
Churn Prediction in Telecom via Rough Sets
13 pages
Google Professional Cloud Data Engineer Practice Exam PDF
No ratings yet
Google Professional Cloud Data Engineer Practice Exam PDF
34 pages
Black Friday Sales Prediction with ML
No ratings yet
Black Friday Sales Prediction with ML
8 pages