MLP & Backpropagation Explained

The document provides an overview of multilayer perceptrons (MLPs) and the backpropagation algorithm used to train them. It explains that MLPs use multiple layers of perceptron units arranged in a feedforward network. The extra layers allow MLPs to represent more complex, nonlinear relationships compared to a single perceptron. Backpropagation is then introduced as a method for calculating gradients to update the network weights using gradient descent, proceeding backwards from the output to the input layers. Key aspects of backpropagation like calculating error derivatives and updating weights are demonstrated step-by-step with an example.

Uploaded by

Fadhlan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

198 views30 pages

MLP & Backpropagation Explained

Uploaded by

Fadhlan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

MLP and Backpropagation

We will introduce the MLP and the backpropagation

algorithm which is used to train it

MLP used to describe any general feedforward (no

recurrent connections) network

However, we will concentrate on nets with units

arranged in layers
2
x1

Different books refer to the above as either 4 layer (no. of

layers of neurons) or 3 layer (no. of layers of adaptive
weights). We will follow the latter convention

1st question:
what do the extra layers gain you? Start with looking at
what a single layer can’t do
3
Perceptron Learning Theorem
• Recap: A perceptron (threshold unit) can
learn anything that it can represent (i.e.
anything separable with a hyperplane)

4
The Exclusive OR problem
A Perceptron cannot represent Exclusive OR
since it is not linearly separable.

5
6
Minsky & Papert (1969) offered solution to XOR problem by
combining perceptron unit responses using a second layer of
Units. Piecewise linear classification using an MLP with
threshold (perceptron) units

+1
7
Three-layer networks
x1

Input
Output

Hidden layers
8
Properties of architecture
• No connections within a layer
• No direct connections between input and output layers
• Fully connected between layers
• Often more than 3 layers
• Number of output units need not equal number of input units
• Number of hidden units per layer can be more or less than
input or output units

Each unit is a perceptron

m
y

f
(
i
w
x

i
j

b
)
j
i
j

1

Often include bias as an extra weight

9
What do each of the layers do?

3rd layer can generate arbitrarily

1st layer draws linear 2nd layer combines the complex boundaries
boundaries boundaries
10
Backpropagation
note: in this example, the activation function is dismissed to ease the calculation
Backpropagation
Backpropagation
• Backpropagation, short for “backward propagation of errors”, is a
mechanism used to update the weights using gradient descent. It
calculates the gradient of the error function with respect to the neural
network’s weights. The calculation proceeds backwards through the
network.

• Gradient descent is an iterative optimization algorithm for finding the

minimum of a function; in our case we want to minimize th error
function. To find a local minimum of a function using gradient descent,
one takes steps proportional to the negative of the gradient of the
function at the current point.
Backpropagation
Backpropagation
Backpropagation
Activation Function

Left: Sigmoid non-linearity squashes real numbers to range between [0,1]

Right: The tanh non-linearity squashes real numbers to range between [-1,1].
Left: Rectified Linear Unit (ReLU) activation function, which is zero when x < 0 and then linear with slope 1
when x > 0.
Right: A plot from Krizhevsky et al. (pdf) paper indicating the 6x improvement in convergence with the ReLU
unit compared to the tanh unit.
• [Link]
• [Link]
example/
• [Link]
with-numbers-step-by-step/

Overview of Neural Networks Basics
No ratings yet
Overview of Neural Networks Basics
48 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Back Propagation Back Propagation Network Network Network Network
No ratings yet
Back Propagation Back Propagation Network Network Network Network
29 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
Backpropagation in Multi-Layer Networks
No ratings yet
Backpropagation in Multi-Layer Networks
46 pages
ANN-unit 3
No ratings yet
ANN-unit 3
30 pages
Batch Normalization in AIML Accelerating Deep Learning
No ratings yet
Batch Normalization in AIML Accelerating Deep Learning
12 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
22 pages
ANN-unit 4
No ratings yet
ANN-unit 4
25 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Introduction to Neural Networks Concepts
100% (1)
Introduction to Neural Networks Concepts
25 pages
Supervised Regression in Machine Learning
No ratings yet
Supervised Regression in Machine Learning
32 pages
PyTorch Tabular Regression Guide
No ratings yet
PyTorch Tabular Regression Guide
13 pages
Training Multi-Layer Feedforward DNNs
No ratings yet
Training Multi-Layer Feedforward DNNs
9 pages
Advance Deep Learning - BIT L1
No ratings yet
Advance Deep Learning - BIT L1
66 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Recurrent Neural Networks For Prediction
100% (3)
Recurrent Neural Networks For Prediction
297 pages
Unit 2
No ratings yet
Unit 2
64 pages
The Definitive Guide To Deep Learning Interview Questions
No ratings yet
The Definitive Guide To Deep Learning Interview Questions
17 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
23 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Understanding Neural Networks and Fuzzy Logic
No ratings yet
Understanding Neural Networks and Fuzzy Logic
13 pages
Perceptron and Pattern Recognition Guide
100% (1)
Perceptron and Pattern Recognition Guide
24 pages
Deep Learning Quiz: Week 1 & 2
No ratings yet
Deep Learning Quiz: Week 1 & 2
5 pages
ML-5TH Unit
No ratings yet
ML-5TH Unit
28 pages
Unit 4
No ratings yet
Unit 4
38 pages
Batch Normalization Separate
No ratings yet
Batch Normalization Separate
20 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Multi-Arm Bandit Problem Guide
100% (1)
Multi-Arm Bandit Problem Guide
10 pages
Neural Network Loss & Regularization
No ratings yet
Neural Network Loss & Regularization
112 pages
Neural Networks Fundamentals Overview
No ratings yet
Neural Networks Fundamentals Overview
40 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Week 1 Deep Learning Quiz Insights
No ratings yet
Week 1 Deep Learning Quiz Insights
2 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
Deep Learning Basics Explained
No ratings yet
Deep Learning Basics Explained
21 pages
Deep Learning Exam Answers and Matrix G
No ratings yet
Deep Learning Exam Answers and Matrix G
20 pages
Topic 1
No ratings yet
Topic 1
19 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Sigmoid Deep Learning
No ratings yet
Sigmoid Deep Learning
8 pages
The Backpropagation Algorithm
No ratings yet
The Backpropagation Algorithm
4 pages
RNN and LSTM for Sentiment Analysis
No ratings yet
RNN and LSTM for Sentiment Analysis
14 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
27 pages
Neural Network Backpropagation Practice
No ratings yet
Neural Network Backpropagation Practice
9 pages
12-Regularization For Deep Learning-17!08!2024
No ratings yet
12-Regularization For Deep Learning-17!08!2024
51 pages
Multi Layer Perceptron Haykin
No ratings yet
Multi Layer Perceptron Haykin
50 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
7 pages
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
No ratings yet
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
49 pages
EM Algorithm in Machine Learning Explained
No ratings yet
EM Algorithm in Machine Learning Explained
3 pages
Graph Neural Networks Overview
No ratings yet
Graph Neural Networks Overview
1 page
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-IV
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-IV
27 pages
Neural Networks: Single & Multi-Layer Overview
No ratings yet
Neural Networks: Single & Multi-Layer Overview
35 pages
Chapter 10: Artificial Neural Networks
No ratings yet
Chapter 10: Artificial Neural Networks
17 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
34 pages
Unit 2: Multi-Layer Perceptron Overview
No ratings yet
Unit 2: Multi-Layer Perceptron Overview
36 pages
Module 2 Notes - Full
No ratings yet
Module 2 Notes - Full
54 pages
Understanding Multilayer Perceptrons
No ratings yet
Understanding Multilayer Perceptrons
70 pages
ML Unit 2 Lecture Notes
No ratings yet
ML Unit 2 Lecture Notes
20 pages
Artificial Neural Networks in Manufacturing Processes: Monitoring and Control
No ratings yet
Artificial Neural Networks in Manufacturing Processes: Monitoring and Control
9 pages
A Systematic Survey of Computer-Aided Diagnosis in Medicine-Past and Present Developments
No ratings yet
A Systematic Survey of Computer-Aided Diagnosis in Medicine-Past and Present Developments
25 pages
EI603C
No ratings yet
EI603C
4 pages
Why Machines Learn PDF
50% (2)
Why Machines Learn PDF
151 pages
AML - Lab - Syllabus - Chandigarh University
No ratings yet
AML - Lab - Syllabus - Chandigarh University
9 pages
Artificial Intelligence Questions and Answers - Fuzzy Logic
50% (2)
Artificial Intelligence Questions and Answers - Fuzzy Logic
31 pages
Data Science Brochure
No ratings yet
Data Science Brochure
6 pages
Understanding AI Concepts and Agents
No ratings yet
Understanding AI Concepts and Agents
23 pages
Module 2
No ratings yet
Module 2
37 pages
Automatic Target Recognition Third Edition Schachter
No ratings yet
Automatic Target Recognition Third Edition Schachter
481 pages
V14 Cse Aiml Iii Year
No ratings yet
V14 Cse Aiml Iii Year
41 pages
Applied Artificial Intelligence: by Jerry Felsen, PH.D
No ratings yet
Applied Artificial Intelligence: by Jerry Felsen, PH.D
8 pages
Game AI: Agents and Techniques Overview
No ratings yet
Game AI: Agents and Techniques Overview
79 pages
Learning Algorithms in Neural Networks
No ratings yet
Learning Algorithms in Neural Networks
5 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Neural Logic Lab Record by Suhasini
No ratings yet
Neural Logic Lab Record by Suhasini
18 pages
JNTUA - R23 - B.tech. AI & DS III & IV Year Course Structure & Syllabus PDF
No ratings yet
JNTUA - R23 - B.tech. AI & DS III & IV Year Course Structure & Syllabus PDF
207 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Liquid State Machine with Water
No ratings yet
Liquid State Machine with Water
10 pages
MCA Machine Learning Exam Model Paper
No ratings yet
MCA Machine Learning Exam Model Paper
6 pages
Seminar on Artificial Neural Networks
No ratings yet
Seminar on Artificial Neural Networks
24 pages
Neural Network Architectures
No ratings yet
Neural Network Architectures
32 pages
Real Life Applications of Soft Computing 1st Edition Anupam Shukla Ebook Legacy Edition
100% (1)
Real Life Applications of Soft Computing 1st Edition Anupam Shukla Ebook Legacy Edition
120 pages
The Following Papers Belong To: WSEAS NNA-FSFS-EC 2002, February 11-15, 2002, Interlaken, Switzerland
No ratings yet
The Following Papers Belong To: WSEAS NNA-FSFS-EC 2002, February 11-15, 2002, Interlaken, Switzerland
295 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Solar Radiation Prediction with ELM
No ratings yet
Solar Radiation Prediction with ELM
11 pages
Understanding Perceptrons and ADALINE
No ratings yet
Understanding Perceptrons and ADALINE
26 pages
CEG5301: Machine Learning With Applications: Part I: Fundamentals of Neural Networks
No ratings yet
CEG5301: Machine Learning With Applications: Part I: Fundamentals of Neural Networks
57 pages
Module 1
No ratings yet
Module 1
176 pages
Learning Oreilly Com Library View Aws Certified Ai 9798341622326 Ch04 HTML Ch04 Pre Training 1746034578002934
No ratings yet
Learning Oreilly Com Library View Aws Certified Ai 9798341622326 Ch04 HTML Ch04 Pre Training 1746034578002934
20 pages