Welcome to Scribd!

Normal Equation: θ = (X X) X y

Uploaded by

0% found this document useful (0 votes)

12 views1 page

The normal equation is a method for finding the optimal theta values without iteration. It is represented by the formula θ = (X^T X)^-1 X^T y. There is no need for feature scaling. While it does not require iterations like gradient descent, it has a complexity of O(n^3) and can be slow for large datasets. The X^T X matrix may be non-invertible if features are redundant or there are more features than samples, in which case a regularization approach or removing features may help.

Original Description:

Original Title

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views1 page

Normal Equation: θ = (X X) X y

Uploaded by

Laith Bounenni

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

28/09/2020 Machine Learning - Home | Coursera

Normal Equation
The "Normal Equation" is a method of finding the optimum theta without iteration.

θ = (X T X)−1 X T y

There is no need to do feature scaling with the normal equation.

Mathematical proof of the Normal equation requires knowledge of linear algebra and is fairly involved, so you do not need to worry about the details.

Proofs are available at these links for those who are interested:

https://en.wikipedia.org/wiki/Linear_least_squares_(mathematics)

http://eli.thegreenplace.net/2014/derivation-of-the-normal-equation-for-linear-regression

The following is a comparison of gradient descent and the normal equation:

Gradient Descent Normal Equation

Need to choose alpha No need to choose alpha
Needs many iterations No need to iterate
2
O (kn ) O (n3 ), need to calculate inverse of X T X

Works well when n is large Slow if n is very large

With the normal equation, computing the inversion has complexity O(n3 ). So if we have a very large number of features, the normal equation will be
slow. In practice, when n exceeds 10,000 it might be a good time to go from a normal solution to an iterative process.

Normal Equation Noninvertibility

When implementing the normal equation in octave we want to use the 'pinv' function rather than 'inv.'

X T X may be noninvertible. The common causes are:

Redundant features, where two features are very closely related (i.e. they are linearly dependent)
Too many features (e.g. m ≤ n). In this case, delete some features or use "regularization" (to be explained in a later lesson).

Solutions to the above problems include deleting a feature that is linearly dependent with another or deleting one or more features when there are too
many features.

https://www.coursera.org/learn/machine-learning/resources/QQx8l 1/1

Math 243M, Numerical Linear Algebra Lecture Notes
Document71 pages
Math 243M, Numerical Linear Algebra Lecture Notes
Ronald
No ratings yet
Algorithm Design 2 Solution
Document4 pages
Algorithm Design 2 Solution
Babio Gando
No ratings yet
3: Divide and Conquer: Fourier Transform: Polynomial
Document8 pages
3: Divide and Conquer: Fourier Transform: Polynomial
Irmak Erkol
No ratings yet
3: Divide and Conquer: Fourier Transform: Polynomial
Document8 pages
3: Divide and Conquer: Fourier Transform: Polynomial
Cajun Sef
No ratings yet
Calculus Non-Routine Problems and Solutions
Document51 pages
Calculus Non-Routine Problems and Solutions
kaushik247
No ratings yet
Model-Free Offline Change-Point Detection Multidimensional Time Series of Arbitrary Nature Via ?-Complexity Simulations and Applications
Document13 pages
Model-Free Offline Change-Point Detection Multidimensional Time Series of Arbitrary Nature Via ?-Complexity Simulations and Applications
Wong Chiong Liong
No ratings yet
Machine Learning Week 2 Coursera
Document4 pages
Machine Learning Week 2 Coursera
Hương Đặng
100% (1)
Numerical Solution of Linear Systems: Chen Greif
Document59 pages
Numerical Solution of Linear Systems: Chen Greif
Sadek Ahmed
No ratings yet
Econometric Analysis of Cross Section and Panel Data, 2e: Nonparametric and Semiparametric Methods
Document69 pages
Econometric Analysis of Cross Section and Panel Data, 2e: Nonparametric and Semiparametric Methods
Lino Ccarhuaypiña Contreras
No ratings yet
Primera Clase
Document48 pages
Primera Clase
Lino Ccarhuaypiña Contreras
No ratings yet
cs224n 2023 Lecture03 Neuralnets
Document83 pages
cs224n 2023 Lecture03 Neuralnets
myturtle game01
No ratings yet
3.5 Coefficients of The Interpolating Polynomial: Interpolation and Extrapolation
Document4 pages
3.5 Coefficients of The Interpolating Polynomial: Interpolation and Extrapolation
Vinay Gupta
No ratings yet
L 22 NNpractical
Document15 pages
L 22 NNpractical
Ayaan Khan
No ratings yet
Monte Carlo As A Tool To Solve Problems in Theoretical Science.
Document11 pages
Monte Carlo As A Tool To Solve Problems in Theoretical Science.
Rahul Sharma
No ratings yet
Lecture6 Curvefitting
Document20 pages
Lecture6 Curvefitting
Nanditha A
No ratings yet
Resampling Methods For Time Series
Document5 pages
Resampling Methods For Time Series
Mark Carlos Secada
No ratings yet
Advanced Algorithms Course. Lecture Notes. Part 11: Chernoff Bounds
Document4 pages
Advanced Algorithms Course. Lecture Notes. Part 11: Chernoff Bounds
Kasapa
No ratings yet
Newton's Method
Document3 pages
Newton's Method
George Keith
No ratings yet
Solar Cell PDF
Document4 pages
Solar Cell PDF
艾哲
No ratings yet
NLAFull Notes 22
Document59 pages
NLAFull Notes 22
forspamreceival
No ratings yet
An Adaptive Simulated Annealing Algorithm PDF
Document9 pages
An Adaptive Simulated Annealing Algorithm PDF
fadhli202
No ratings yet
Basic PRAM Algorithm Design Techniques
Document13 pages
Basic PRAM Algorithm Design Techniques
sandeeprose
No ratings yet
NA1 Lecture Chap.01-06-Paged
Document313 pages
NA1 Lecture Chap.01-06-Paged
adil
No ratings yet
A Quantum Algorithm For Solving Systems of Nonlinear Algebraic Equations
Document10 pages
A Quantum Algorithm For Solving Systems of Nonlinear Algebraic Equations
ivi
No ratings yet
Optimally Rotation-Equivariant Directional
Document8 pages
Optimally Rotation-Equivariant Directional
Eloisa Potrich
No ratings yet
Machine Learning - Home - Week 2 - Notes - Coursera
Document10 pages
Machine Learning - Home - Week 2 - Notes - Coursera
copsamosto
No ratings yet
Cs550 Manuscript
Document406 pages
Cs550 Manuscript
paul.huber1999
No ratings yet
Duff Ing
Document7 pages
Duff Ing
Cédric Kuete
No ratings yet
Asymptotic
Document27 pages
Asymptotic
shobhanrockstar
No ratings yet
Solution PDF
Document20 pages
Solution PDF
Vard Farrell
No ratings yet
Random Matrix Theory
Document65 pages
Random Matrix Theory
Haider AlSabbagh
No ratings yet
Iterative Methods For Computing Eigenvalues and Eigenvectors
Document10 pages
Iterative Methods For Computing Eigenvalues and Eigenvectors
Mauricio Rojas Valdivia
No ratings yet
NLP Nctu
Document19 pages
NLP Nctu
larasmoyo
No ratings yet
Calculus Stewart 7th Edition Solutions Manual
Document45 pages
Calculus Stewart 7th Edition Solutions Manual
CindyCurrydwqzr
100% (75)
A Deterministic Strongly Polynomial Algorithm For Matrix Scaling and Approximate Permanents
Document23 pages
A Deterministic Strongly Polynomial Algorithm For Matrix Scaling and Approximate Permanents
ShubhamParashar
No ratings yet
Computer Science Notes: Algorithms and Data Structures
Document21 pages
Computer Science Notes: Algorithms and Data Structures
Muhammed
No ratings yet
M1.1 Graphical and Incremental Search X (Autosaved)
Document19 pages
M1.1 Graphical and Incremental Search X (Autosaved)
Dennis A
No ratings yet
L14 Exploding and Vanishing Gradients
Document13 pages
L14 Exploding and Vanishing Gradients
parvathyp220246ec
No ratings yet
Differential Equations and Linear Algebra Supplementary Notes
Document17 pages
Differential Equations and Linear Algebra Supplementary Notes
Oyster Mac
No ratings yet
Daa Aat2 Plag
Document5 pages
Daa Aat2 Plag
Nehanth Admirer
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
Document25 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
Arya Chowdhury
No ratings yet
Dantzig Selector
Document35 pages
Dantzig Selector
ehsan
No ratings yet
10 Operation Count Numerical Linear Algebra
Document7 pages
10 Operation Count Numerical Linear Algebra
Jeevan Jyoti Baliarsingh
No ratings yet
Gaussian Elimination
Document9 pages
Gaussian Elimination
Saifuddin Arief
No ratings yet
C03.01 MEC500RK Roots of Equation - Bracketing Method
Document28 pages
C03.01 MEC500RK Roots of Equation - Bracketing Method
juuuunnnnssss
No ratings yet
Chap6 Series Bessel Legendre
Document66 pages
Chap6 Series Bessel Legendre
impaktum
No ratings yet
E7 2021 FinalReview
Document85 pages
E7 2021 FinalReview
kong
No ratings yet
Solu 3
Document28 pages
Solu 3
Aswani Kumar
100% (2)
1 s2.0 S1474667015364818 Main PDF
Document5 pages
1 s2.0 S1474667015364818 Main PDF
Vignesh Ramakrishnan
No ratings yet
Smoothing: Smooth
Document19 pages
Smoothing: Smooth
minaya2008
No ratings yet
DAA-University 2mark Questions and Answers
Document23 pages
DAA-University 2mark Questions and Answers
Sharan De
No ratings yet
R300 Advanced Econometrics Methods Lecture Slides
Document362 pages
R300 Advanced Econometrics Methods Lecture Slides
Marco Brolli
No ratings yet
Application of Modified Hopfield Neural Network To Real Time Current Harmonics Reduction
Document8 pages
Application of Modified Hopfield Neural Network To Real Time Current Harmonics Reduction
ARVIND
No ratings yet
Cs 316: Algorithms (Introduction) : SPRING 2015
Document44 pages
Cs 316: Algorithms (Introduction) : SPRING 2015
Ahmed Khairy
No ratings yet
Introduction To Rewriting and Functional Programming
Document355 pages
Introduction To Rewriting and Functional Programming
clux
No ratings yet
Application of Differential Equation in Computer Science
Document6 pages
Application of Differential Equation in Computer Science
Uzair Razzaq
100% (1)
Introduction To Machine Learning Lecture 2: Linear Regression
Document38 pages
Introduction To Machine Learning Lecture 2: Linear Regression
Deepa Devaraj
No ratings yet
AOA
Document3 pages
AOA
Ragini Karuna
No ratings yet
Numerical Algebra
From Everand
Numerical Algebra
John Todd
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications. (AM-158)
Jean Bourgain
No ratings yet
Gradient Descent Tips: X X X X X
Document1 page
Gradient Descent Tips: X X X X X
Laith Bounenni
No ratings yet
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑
Document1 page
Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑
Laith Bounenni
No ratings yet
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
Cost Function: y 2m 1 (Y ) 2m 1
Document1 page
Cost Function: y 2m 1 (Y ) 2m 1
Laith Bounenni
No ratings yet
Support Vector
Document2 pages
Support Vector
Laith Bounenni
No ratings yet