Welcome to Scribd!

Names To Nationality Hyperparameter Search

Uploaded by

0% found this document useful (0 votes)

10 views15 pages

The document describes a series of tasks to tune hyperparameters for classifying names and nationalities from text. Task 1 evaluates learning rates and finds 0.00001 performs best. Task 2 evaluates momentum and finds 0.98 causes overfitting while 0.9 and 0.5 perform similarly. Task 3 repeats Task 2 for a learning rate of 0.001 and finds 0.5 generalizes best. Task 4 will evaluate L2 regularization parameters with a learning rate of 0.0001 and momentum of 0.9.

Original Description:

Original Title

Names to Nationality Hyperparameter Search

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views15 pages

Names To Nationality Hyperparameter Search

Uploaded by

Vaibhav Mandhare

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 15

Search inside document

Names to Nationality Hyper-Parameter Search:

Objective: Classify 123 classes with only their first and last names with 1000 examples per class. Run each task
for 10 epoches.

Task 1: Get the best learning rate from [0.1, 0.01, 0.001, 0.0001, 0.00001]. Set momentum = 0, L2 = 0
● For learning rate 0.1:
Not started because we assume that since the learning rate for 0.01 is bad, then the learning rate for 0.1
would be worse.
● For learning rate 0.01:
Terminated early because accuracy is decreasing:

● For learning rate 0.001:

● For learning rate 0.0001:

● For learning rate 0.00001:

From the data above, we can see that lower learning rates will lead to slower convergence.
Task 2: Get the best momentum value from [0, 0.9, 0.98, 0.5]. Set the learning rate to 0.0001, and run it for 20
epoche:
● For 0:
● For 0.9:
● For 0.98:

This means that it overshot the minima :(

● For 0.5:

● From this we can conclude that 0.98 is worse than 0.9 or 0.5 since it overshot the minima. But the
performance between 0.9 and 0.5 is subtle.
Task 3: Since the curve for the learning rates 0.001 and 0.0001 are very similar, we are going to do Task 2 again
for learning rate 0.001. Get the best momentum value from [0.9, 0.98, 0.5]. Set the learning rate to 0.001, and
run it for 20 epoche:
● For 0:
● For 0.98:
● For 0.90:
● For 0.50:

● As one can see, using 0.5 is better than using 0.9 or 0.98 simply because 0.9 and 0.98 caused the
algorithm to overshoot its minima. Moreover, one can also see that using a learning rate of 0.01 (in
general) is worse than using a learning rate of 0.0001. Thus, using 0.0001 as the learning rate seems to be
the most sensible solution.
Task 4: Get the best L2 regularizer from [0.1, 0.01, 0.001, 0.0001]. Set learning rate to the best one from Task 2-3
(it is 0.0001), and set the momentum to the best one from Task 2-3 (it is 0.9):
● For L2 regularizer 0:
● For L2 regularizer 0.1:
● For L2 regularizer 0.01:
● For L2 regularizer 0.001:
● For L2 regularizer 0.0001:

The Logic of Long Division
From Everand
The Logic of Long Division
Ned Tarrington
No ratings yet
Basic Mathematics Review You Really Need
From Everand
Basic Mathematics Review You Really Need
Harry Hykko
No ratings yet
Managing Ashland MultiComm Services Case
Document6 pages
Managing Ashland MultiComm Services Case
Aditya Chourasia
No ratings yet
Exam
Document8 pages
Exam
danishamir086
No ratings yet
Test of Hypothesis Part 3
Document6 pages
Test of Hypothesis Part 3
Red Imperial
No ratings yet
IE380 Unit 7
Document26 pages
IE380 Unit 7
ayça
No ratings yet
Jeopardy Game (MATLAB)
Document31 pages
Jeopardy Game (MATLAB)
Anh Viet Vu
No ratings yet
Classification of Cardboard Papers Using A Multilayer Perceptron
Document14 pages
Classification of Cardboard Papers Using A Multilayer Perceptron
Colzi Chen
No ratings yet
allTestSolutions Discrete Math For Engineers
Document123 pages
allTestSolutions Discrete Math For Engineers
Angad Singh
No ratings yet
FIT1053 Algorithms and Programming Fundamentals in Python - Workshop 2
Document3 pages
FIT1053 Algorithms and Programming Fundamentals in Python - Workshop 2
Alireza Kafaei
No ratings yet
Geometric & Harmonic Means
Document117 pages
Geometric & Harmonic Means
Wasie Urrahman
No ratings yet
Parallel & Distributed Computing: Prof. Dr. Aman Ullah Khan
Document20 pages
Parallel & Distributed Computing: Prof. Dr. Aman Ullah Khan
Sibghat Rehman
No ratings yet
Class 13 Optimizing The Training Process
Document20 pages
Class 13 Optimizing The Training Process
Sumana Basu
No ratings yet
Assignment
Document4 pages
Assignment
Manik P Hettiarachchi
No ratings yet
5-6. Confidencelevelandsamplesize
Document40 pages
5-6. Confidencelevelandsamplesize
Jeff Lacasandile
No ratings yet
1.1 All Notes PDF
Document48 pages
1.1 All Notes PDF
Pahonea Gigi
No ratings yet
Logistic Regression Example
Document22 pages
Logistic Regression Example
LUV ARORA
100% (1)
7 DEC Tracetables
Document6 pages
7 DEC Tracetables
mahnoorlyan910
No ratings yet
Tutorial Letter 201/1/2018: Introduction To Programming II
Document16 pages
Tutorial Letter 201/1/2018: Introduction To Programming II
Lina Slabbert-van Der Walt
No ratings yet
Presentation 1
Document14 pages
Presentation 1
Megha
No ratings yet
A Step by Step Backpropagation Example
Document9 pages
A Step by Step Backpropagation Example
Reno Surya
No ratings yet
Computes Probabilities and Percentile Using The Standard Normal
Document36 pages
Computes Probabilities and Percentile Using The Standard Normal
not denise
No ratings yet
Problem 1: T Distribution Calculator
Document7 pages
Problem 1: T Distribution Calculator
Joe Chalhoub
No ratings yet
CSE 215L: Programming Language II Lab Faculty: Silvia Ahmed, Sec - 9,10
Document11 pages
CSE 215L: Programming Language II Lab Faculty: Silvia Ahmed, Sec - 9,10
Zahin Khan
No ratings yet
ACTIVITY SHEET #3 WEEK 1 Express Decimal To Fraction and Percent Form
Document2 pages
ACTIVITY SHEET #3 WEEK 1 Express Decimal To Fraction and Percent Form
Joel Lerios
No ratings yet
Non Parametric Test Examples
Document13 pages
Non Parametric Test Examples
Amni Radhiah
No ratings yet
Bdo Co1 Session 3
Document25 pages
Bdo Co1 Session 3
s.m.pasha0709
No ratings yet
Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras
Document20 pages
Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras
ammar sange
No ratings yet
Chapter 7 One-Dimensional Search Methods
Document35 pages
Chapter 7 One-Dimensional Search Methods
jairoo1234
No ratings yet
NN Lecture Notes
Document45 pages
NN Lecture Notes
findinngclosure
No ratings yet
Linear Programming I
Document16 pages
Linear Programming I
S Pat
No ratings yet
Backpropagation Example
Document9 pages
Backpropagation Example
Eman Jaffri
No ratings yet
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
Document7 pages
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
Rajiv Kumar
No ratings yet
ME451 Kinematics and Dynamics of Machine Systems
Document30 pages
ME451 Kinematics and Dynamics of Machine Systems
Joseph Daguio Jr
No ratings yet
OD7 PL Integer Programming
Document6 pages
OD7 PL Integer Programming
carolinarvsocn
No ratings yet
Taller 6 - Materiales de Ingeniería
Document6 pages
Taller 6 - Materiales de Ingeniería
El Angel Terrenal
No ratings yet
Linear Regression Review
Document4 pages
Linear Regression Review
alex_az
67% (3)
Asympototic Notation
Document21 pages
Asympototic Notation
Game Zone
No ratings yet
Sprinklr OA - July 13, 2023
Document8 pages
Sprinklr OA - July 13, 2023
okiokax
No ratings yet
Assignment 2 2023 نموذج 2
Document6 pages
Assignment 2 2023 نموذج 2
AbdAlrahman Siyamek
No ratings yet
Functions
Document52 pages
Functions
Simrat Mathur
No ratings yet
NLopt Tutorial - AbInitio
Document13 pages
NLopt Tutorial - AbInitio
rahulagarwal33
No ratings yet
Lab03 (R)
Document12 pages
Lab03 (R)
Javaid Musa Bojol
No ratings yet
If Statements: 1 A Simple Example
Document5 pages
If Statements: 1 A Simple Example
Matius Kelvin
No ratings yet
While Loops: 9.1 Examples
Document8 pages
While Loops: 9.1 Examples
Yohanes Daksa
No ratings yet
Getting Started With Reinforcement Learning and Open AI Gym
Document10 pages
Getting Started With Reinforcement Learning and Open AI Gym
KSD
No ratings yet
HW1
Document3 pages
HW1
munk.y boo
No ratings yet
A Practical Introduction To Python Programming Heinold-37-42
Document6 pages
A Practical Introduction To Python Programming Heinold-37-42
Phạm Quang Phúc
No ratings yet
Ch. 3 - Central Tendency
Document12 pages
Ch. 3 - Central Tendency
Sahara Malabanan
No ratings yet
Week5 Computation Method
Document60 pages
Week5 Computation Method
nguyễn Đức
No ratings yet
Freematerial PDF
Document39 pages
Freematerial PDF
Somya Pachauri
No ratings yet
Penalty Functions: - The Premise - Quadratic Loss - Problems and Solutions
Document21 pages
Penalty Functions: - The Premise - Quadratic Loss - Problems and Solutions
ShrutiPargai
No ratings yet
What If Analysis Practice Exercises
Document9 pages
What If Analysis Practice Exercises
api-305038457
No ratings yet
ON-OFF and Proportional Control
Document10 pages
ON-OFF and Proportional Control
Nguyen Ninh
No ratings yet
Repeating Decimals To Fractions
Document19 pages
Repeating Decimals To Fractions
Samuel Pertuz
No ratings yet
Lesson 4
Document15 pages
Lesson 4
Masood Zubair Ahmad
No ratings yet
55 - BD - Data Structures and Algorithms - Narasimha Karumanchi
Document19 pages
55 - BD - Data Structures and Algorithms - Narasimha Karumanchi
TritonCPC
No ratings yet
Greedy Methods
Document14 pages
Greedy Methods
Deepali Yadav
No ratings yet
02-Linear Regression and Gradient Descent
Document32 pages
02-Linear Regression and Gradient Descent
Khoa Tran Ngoc
No ratings yet
Master Division & Fractions
From Everand
Master Division & Fractions
Mourad Boufadene
No ratings yet
Irjet V7i31031
Document7 pages
Irjet V7i31031
Vaibhav Mandhare
No ratings yet
Ijet 10584
Document6 pages
Ijet 10584
Vaibhav Mandhare
No ratings yet
SEN 1 To 10
Document41 pages
SEN 1 To 10
Vaibhav Mandhare
No ratings yet
Grammar 1 Chapter 4
Document7 pages
Grammar 1 Chapter 4
Vaibhav Mandhare
No ratings yet
1.1 Weather Forecasting
Document16 pages
1.1 Weather Forecasting
Vaibhav Mandhare
No ratings yet
Plant Disease Detection Using Convolutional Neural Network
Document14 pages
Plant Disease Detection Using Convolutional Neural Network
Vaibhav Mandhare
No ratings yet
Computer Science Engineering An International Journal CSEIJ
Document2 pages
Computer Science Engineering An International Journal CSEIJ
Vaibhav Mandhare
No ratings yet
"TRAI Regulations''
Document6 pages
"TRAI Regulations''
Vaibhav Mandhare
50% (2)
State Common Entrance Test Cell, Maharashtra State, Mumbai
Document5 pages
State Common Entrance Test Cell, Maharashtra State, Mumbai
Vaibhav Mandhare
No ratings yet
Python Final Micro Project
Document32 pages
Python Final Micro Project
Vaibhav Mandhare
86% (7)
Tyco Defaulter List2
Document2 pages
Tyco Defaulter List2
Vaibhav Mandhare
No ratings yet
4179 - Lokmanya Tilak Jankalyan Shiksan Sanstha, Priyadarshini Indira Gandhi College of Engineering, Nagpur
Document8 pages
4179 - Lokmanya Tilak Jankalyan Shiksan Sanstha, Priyadarshini Indira Gandhi College of Engineering, Nagpur
Vaibhav Mandhare
No ratings yet
Prof Ram Meghe College OF Engineering & Management, Badnera
Document4 pages
Prof Ram Meghe College OF Engineering & Management, Badnera
Vaibhav Mandhare
No ratings yet
Student Online Documentation
Document19 pages
Student Online Documentation
Vaibhav Mandhare
No ratings yet
Final Jan 2020 1
Document8 pages
Final Jan 2020 1
Vaibhav Mandhare
No ratings yet
Food Recipe Finder Mobile Applications Based On Similarity of Materials
Document7 pages
Food Recipe Finder Mobile Applications Based On Similarity of Materials
Vaibhav Mandhare
No ratings yet
Capstone Project I. Definition: Machine Learning Engineer Nanodegree
Document26 pages
Capstone Project I. Definition: Machine Learning Engineer Nanodegree
Vaibhav Mandhare
No ratings yet
Editorial Board Mentor Words: Worker Who Is Deeply Interested in Educational Development. With Strict
Document4 pages
Editorial Board Mentor Words: Worker Who Is Deeply Interested in Educational Development. With Strict
Vaibhav Mandhare
No ratings yet
ISSN: 2231 - 329X (Online) 2231 - 3583 (Print) : Computer Science Engineering: An International Journal (CSEIJ)
Document2 pages
ISSN: 2231 - 329X (Online) 2231 - 3583 (Print) : Computer Science Engineering: An International Journal (CSEIJ)
Vaibhav Mandhare
No ratings yet
Computer Science Engineering An International Journal CSEIJ
Document2 pages
Computer Science Engineering An International Journal CSEIJ
Vaibhav Mandhare
No ratings yet
Ginni Garg: Grade School/College Duration CGPA/%
Document2 pages
Ginni Garg: Grade School/College Duration CGPA/%
Vaibhav Mandhare
No ratings yet