Chapter 6 Optimization

Uploaded by

Mark Magumba

0% found this document useful (0 votes)

4 views11 pages

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views11 pages

Chapter 6 Optimization

Uploaded by

Mark Magumba

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

Optimization

Mark. A Magumba
Linear Regression
•Equation of the line in the form:
, where
Multiple regression: 2 independent variables
• Equation takes the form:

• , where
Optimization Techniques
• Ways of determining the best parameters for parametric models
• Analytical solutions may not exist
• For big data it isn’t computationally feasible to obtain exact solutions
• Common technique is gradient descent
• Requires defining a cost/loss function
• It is convenient to express the loss function in differentiable form e.g. using
the mean square error as it involves computing partial differentials with
respect to different parameters
Gradient Descent
X1

Global minimum Loss

X2
Gradient descent
• Is about obtaining the optimal coefficients for your terms, the output
is assumed to be a linear combination of terms as with the
linear/multiple regression
• Assume this is our data:, x1, x2……x5 are features and y is the target
column to be predicted
x1 x2 x3 x4 xn Y
1
0
1
0
Gradient descent
• The following equations may be formulated
Partial gradients
• To compute the optimal values of the coefficients/weights an analytical solution
may be found but where you have a large number of terms, imperfect data and a
large number of examples this is often computationally intractable
• However, if we know the influence of each term on the loss we can obtain the
optimal coefficients by minimizing the loss
• The influence of each term on the loss/cost function is its partial derivative with
respect to the loss/cost
• From our equations in the previous slides, the following are the partial derivatives
with respect to x1 and x2 for multiple linear regression
=
=
Update rule
• With each iteration, we can adjust the coefficients/weights of each
term using this simple weight update rule

• Where w is the weight, n is the learning rate a small number used to

regulate the speed of learning, L is the loss/cost function. It is
common to use the square loss which is a mathematical convenience
as we want the loss/cost function to be differentiable
Weaknesses
• Local minima for certain data shapes
• Computational inefficiency for big data
• Solutions:
• Online gradient descent/ Stochastic gradient descent: With each
iteration weights are updated based on the loss of a single randomly
selected example instead of using the mean loss over all examples
• Batch gradient descent: Instead of updating the weights basing on a
single example (stochastic gradient) or all examples (vanilla gradient
descent) we use he mean loss over a small number of examples
(batch). Determining ideal batch size is a matter of trial and error

Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Lecture 4
Document33 pages
Lecture 4
Venkat ram Reddy
No ratings yet
LINEAR MODEL - KEY CONCEPTS OF LINEAR REGRESSION
Document33 pages
LINEAR MODEL - KEY CONCEPTS OF LINEAR REGRESSION
Swastik Sindhani
No ratings yet
w4 Generalisation
Document42 pages
w4 Generalisation
Swastik Sindhani
No ratings yet
CS464 Ch9 LinearRegression
Document43 pages
CS464 Ch9 LinearRegression
Onur Asım İlhan
100% (1)
ML Notes
Document14 pages
ML Notes
zomukoza
No ratings yet
Discretization of Equation
Document14 pages
Discretization of Equation
sandyengineer13
No ratings yet
CHAPTER 0: Refreshers
Document52 pages
CHAPTER 0: Refreshers
Habtamu
No ratings yet
Linear Regression With Gradient Descent Method
Document10 pages
Linear Regression With Gradient Descent Method
naral
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
Document38 pages
Introduction To Machine Learning Lecture 2: Linear Regression
Deepa Devaraj
No ratings yet
Management of Science
Document59 pages
Management of Science
Yanie Taha
No ratings yet
Chapter-III Water Resources Systems: Analysis
Document53 pages
Chapter-III Water Resources Systems: Analysis
Abdirahman ali
No ratings yet
Frontier Functions: Stochastic Frontier Analysis (SFA) & Data Envelopment Analysis (DEA)
Document45 pages
Frontier Functions: Stochastic Frontier Analysis (SFA) & Data Envelopment Analysis (DEA)
YaronBaba
100% (1)
Advanced Regression With JMP PRO Handout
Document46 pages
Advanced Regression With JMP PRO Handout
Gabriel Gomez
No ratings yet
Exploring The Model
Document13 pages
Exploring The Model
sst sharun
No ratings yet
I. Models and Cost Functions: ML Notations
Document13 pages
I. Models and Cost Functions: ML Notations
sst sharun
No ratings yet
Machine Learning Data Acquisition and Feature Selection
Document28 pages
Machine Learning Data Acquisition and Feature Selection
vkv.foe
No ratings yet
Module 3
Document35 pages
Module 3
2021.shreya.pawaskar
No ratings yet
02.2 Chapter2 Solving Linear Programs PDF
Document46 pages
02.2 Chapter2 Solving Linear Programs PDF
Ashoka Vanjare
No ratings yet
MAI Lecture 04 Optimization
Document36 pages
MAI Lecture 04 Optimization
Yeabsira
No ratings yet
Week 7
Document53 pages
Week 7
Khaled Tarek
No ratings yet
20210825200939D5341 - Week 9 CH 12
Document42 pages
20210825200939D5341 - Week 9 CH 12
jo monfils
No ratings yet
Gradient Descent
Document17 pages
Gradient Descent
Jatin Kumar Garg
No ratings yet
FASE II - Tema 9
Document69 pages
FASE II - Tema 9
Angela Melgar
No ratings yet
Supervised Learning 1 PDF
Document162 pages
Supervised Learning 1 PDF
Alexander
No ratings yet
09.0 Integer Programming PDF
Document77 pages
09.0 Integer Programming PDF
Ashoka Vanjare
No ratings yet
Optimisation (Part 2)
Document15 pages
Optimisation (Part 2)
Saswata
No ratings yet
Chapter 3. Linear Regression
Document41 pages
Chapter 3. Linear Regression
Nguyễn Quang Trường
No ratings yet
16 dm2 Dimred 2022 23
Document49 pages
16 dm2 Dimred 2022 23
nimra
No ratings yet
Linear Regression 18may
Document28 pages
Linear Regression 18may
pratikgohel
No ratings yet
Unit-2: Logistic Regression
Document30 pages
Unit-2: Logistic Regression
jas deep
No ratings yet
Optimal Control and Optimisation Principles
Document26 pages
Optimal Control and Optimisation Principles
Radhakrishnan V K
No ratings yet
IIMT3636 Lecture 6 With Notes
Document39 pages
IIMT3636 Lecture 6 With Notes
Chan Chin Chun
No ratings yet
Regression Analysis Engineering Statistics
Document72 pages
Regression Analysis Engineering Statistics
Leonardo Miguel Lanto
No ratings yet
Week4 Updates
Document15 pages
Week4 Updates
Idbddmm
No ratings yet
Linear Regression Gradient Descent Vs Analytical Solution
Document5 pages
Linear Regression Gradient Descent Vs Analytical Solution
yt peek
No ratings yet
Regression PPT
Document21 pages
Regression PPT
Rakesh bhukya
No ratings yet
SVM Using Python
Document24 pages
SVM Using Python
Ravindra Ambilwade
No ratings yet
Linear Regression-Part 2
Document26 pages
Linear Regression-Part 2
fathiah
No ratings yet
Chapter 3
Document80 pages
Chapter 3
getaw bayu
No ratings yet
Linear Regression
Document25 pages
Linear Regression
Alina
No ratings yet
Linear Regression
Document29 pages
Linear Regression
Soubhav Chaman
No ratings yet
Model Evaluation
Document80 pages
Model Evaluation
Deva Hema D
No ratings yet
Regularization Strategies for Deep Learning Models
Document31 pages
Regularization Strategies for Deep Learning Models
Tarun Gopalan
No ratings yet
Exploratory Data Analysis and Regression Modeling for Bike Rental Prediction
Document16 pages
Exploratory Data Analysis and Regression Modeling for Bike Rental Prediction
Rohit N
No ratings yet
Lindo ModelSimplification
Document82 pages
Lindo ModelSimplification
Nur Ismu Hidayat
No ratings yet
Integer Programming: The Branch & Bound Method
Document19 pages
Integer Programming: The Branch & Bound Method
imran_chaudhry
100% (1)
14 Dual Simplex 2023 N
Document37 pages
14 Dual Simplex 2023 N
Mohammad Ishraq Hossain
No ratings yet
Lec 4 PDF
Document66 pages
Lec 4 PDF
pranab sarker
No ratings yet
Tutorial 7 Machine Learning Algorithms
Document30 pages
Tutorial 7 Machine Learning Algorithms
Wenbo Pan
No ratings yet
Linear Regression
Document36 pages
Linear Regression
Siddharth Doshi
No ratings yet
Linear Programming: Melai Gajitos Kenneth Parungao Carlo Besana Ascii Obra
Document37 pages
Linear Programming: Melai Gajitos Kenneth Parungao Carlo Besana Ascii Obra
kurou hazama
No ratings yet
With Infinite Possible Answers Such As The Price of A House. Decision Trees Can
Document10 pages
With Infinite Possible Answers Such As The Price of A House. Decision Trees Can
Rishikesh Thakur
No ratings yet
Linear Programming: Sensitivity Analysis and Interpretation of Solution
Document56 pages
Linear Programming: Sensitivity Analysis and Interpretation of Solution
jony96
100% (1)
Feature Selection For SVMS: by J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik
Document19 pages
Feature Selection For SVMS: by J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik
Joseph Jose
No ratings yet
Linear Optimization: The Art of Modelling With Spreadsheet
Document16 pages
Linear Optimization: The Art of Modelling With Spreadsheet
Anish Kumar
No ratings yet
Chapter 4 Duality and Post Optimal Analysis
Document37 pages
Chapter 4 Duality and Post Optimal Analysis
Mir Md Mofachel Hossain
No ratings yet
3 TrainingNetwork
Document65 pages
3 TrainingNetwork
SWAMYA RANJAN DAS
No ratings yet
Optimization methods for minimizing functions
Document13 pages
Optimization methods for minimizing functions
Divyam
No ratings yet
Learning Outcomes
Document1 page
Learning Outcomes
Mark Magumba
No ratings yet
Entropy
Document1 page
Entropy
Mark Magumba
No ratings yet
Chapter 3 Decision Trees
Document12 pages
Chapter 3 Decision Trees
Mark Magumba
No ratings yet
Lecture NN Intro
Document49 pages
Lecture NN Intro
Mark Magumba
No ratings yet
Chapter 5 K-Means Clustering
Document3 pages
Chapter 5 K-Means Clustering
Mark Magumba
No ratings yet
Chapter 4 Support Vector Machines
Document5 pages
Chapter 4 Support Vector Machines
Mark Magumba
No ratings yet
Introduction To Machine Learning Ch2
Document6 pages
Introduction To Machine Learning Ch2
Mark Magumba
No ratings yet
Intro Machine Learning Fundamentals
Document12 pages
Intro Machine Learning Fundamentals
Mark Magumba
No ratings yet
Numerical Flux Comparison for Burgers' Equation
Document21 pages
Numerical Flux Comparison for Burgers' Equation
mekwonu
No ratings yet
Graph Neural Network Based Access Point Selection For Cell-Free Massive MIMO Systems
Document6 pages
Graph Neural Network Based Access Point Selection For Cell-Free Massive MIMO Systems
keyvan
No ratings yet
CNNs Pytorch
Document19 pages
CNNs Pytorch
Alex Saave
No ratings yet
LQR Tuning of Power System Stabilizer For Damping Oscillations
Document18 pages
LQR Tuning of Power System Stabilizer For Damping Oscillations
●●●●●●●1
100% (1)
Neural Fuzzy System - by Chin Teng Lin
Document814 pages
Neural Fuzzy System - by Chin Teng Lin
mansoor.ahmed100
100% (3)
DESSERT2020 Program 14052020-Brief
Document12 pages
DESSERT2020 Program 14052020-Brief
Alexander Guriev
No ratings yet
Lecture 24 Computer Modeling and Simulation of PWM Converter Circuits
Document6 pages
Lecture 24 Computer Modeling and Simulation of PWM Converter Circuits
lenovo1986
No ratings yet
Hashing
Document35 pages
Hashing
Adhara Mukherjee
No ratings yet
Paper 65-Fraud Detection in Credit Cards
Document12 pages
Paper 65-Fraud Detection in Credit Cards
Kébir Allade
No ratings yet
Chi-Square Lecture Quiz
Document2 pages
Chi-Square Lecture Quiz
Vikas Mani Tripathi
100% (1)
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
Document12 pages
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
Saravanan Velayutham
No ratings yet
L1 4
Document5 pages
L1 4
lifejadid
No ratings yet
Week6 Assignment Solutions
Document14 pages
Week6 Assignment Solutions
vicky.sajnani
No ratings yet
Arunkumar 2020
Document11 pages
Arunkumar 2020
Karencita Gutierrez
No ratings yet
Approaches For Efficient Handling of Lar
Document216 pages
Approaches For Efficient Handling of Lar
Kannan Rangaswami
No ratings yet
LBM GPU Hsu2018
Document14 pages
LBM GPU Hsu2018
Lado Kranjcevic
No ratings yet
Legendre Functions and Polynomials
Document26 pages
Legendre Functions and Polynomials
seaguls6969
No ratings yet
Top Answers To Artificial Intelligence Interview Questions
Document3 pages
Top Answers To Artificial Intelligence Interview Questions
Usman Ansaari
No ratings yet
DSP Lab Manual For ECE 3 2 R09
Document147 pages
DSP Lab Manual For ECE 3 2 R09
Jandfor Tansfg Errott
100% (2)
15-381 Spring 2007 Final Exam SOLUTIONS
Document18 pages
15-381 Spring 2007 Final Exam SOLUTIONS
tigger
No ratings yet
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
Document2 pages
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
Felipe Balboa Polanco
No ratings yet
Lec 21 Marquardt Method
Document29 pages
Lec 21 Marquardt Method
Muhammad Bilal Junaid
No ratings yet
The AI Marketing Canvas
Document13 pages
The AI Marketing Canvas
therezia.ryu
No ratings yet
Long Division Step-by-Step Solutions
Document2 pages
Long Division Step-by-Step Solutions
khuranaamanpreet7gmailcom
No ratings yet
Time Delay Systems
Document25 pages
Time Delay Systems
Samarendu Baul
No ratings yet
Basic and Applied Mathematics Chapter 3: Solving a Linear Programming Problem
Document9 pages
Basic and Applied Mathematics Chapter 3: Solving a Linear Programming Problem
kefyalew T
No ratings yet
Literature Survey On Image Deblurring Techniques
Document3 pages
Literature Survey On Image Deblurring Techniques
ATS
100% (1)
Two-stage method for optimal island partition of distribution systems with distributed generations
Document8 pages
Two-stage method for optimal island partition of distribution systems with distributed generations
Pradip Khatri
No ratings yet
CPM PERT in Operations Management
Document16 pages
CPM PERT in Operations Management
Anshul
100% (1)
Machine Learning: B.Tech (CSBS) V Semester
Document12 pages
Machine Learning: B.Tech (CSBS) V Semester
LIKHIT JHA
No ratings yet