Introduction To Neural Networks PDF

Introduction to
Neural Networks
Computer Vision Applications
and Training Techniques
By
T.SarweswaraReddy
M.Tech Student
Vit University
India
What is a Neural Network?
The question
'What is a neural network?
-- Pinkus (1999)
A method of computing, based on the

interaction of multiple connected
processing elements
What can a Neural Network do?
 Compute a known function
 Approximate an unknown function
 Pattern Recognition
 Signal Processing
 Learn to do any of the above

Basic Concepts
A Neural Network generally
maps a set of inputs to a set
of outputs
Number of inputs/outputs is
variable
The Network itself is

composed of an arbitrary
number of nodes with an
arbitrary topology
Basic Concepts
Definition of a node:
• A node is an
element which
performs the
function
Connection y = fH(∑(wixi) +
Node
Wb)
Simple Perceptron
 Binary logic application Input 0 Input 1
 fH(x) = u(x) [linear

threshold] W0 W1
 Wi = random(-1,1) Wb +
fH(x)
 Y = u(W0X0 + W1X1 + Wb)
 Now how do we train it? Output

Basic Training
 Perception learning rule
ΔWi = η * (D – Y) * Xi
 η = Learning Rate
 D = Desired Output
 Adjust weights based on a how well

the current weights match an
objective
Logic Training
 Expose the network to the X X D
logical OR operation 0 1
 Update the weights after each 0 0 0
epoch 0 1 1
1 0 1
 As the output approaches the 1 1 1
desired output for all cases, ΔWi
will approach 0
Results
W0 W1 Wb
Details
 Network converges on a hyper-plane
decision surface
 X1 = (W0/W1)X0 + (Wb/W1)
X1
X0
Typical Activation Functions
 F(x) = 1 / (1 + e -k ∑
(wixi) )
 Shown for
k = 0.5, 1 and 10
 Using a nonlinear
function which
approximates a linear
threshold allows a
network to approximate
nonlinear functions
Back-Propagated Delta Rule Networks
(BP)
 Inputs are
put through a
‘Hidden
Layer’ before
the output
layer
 All nodes
connected
between
layers
BP Network Details
 Forward Pass:
• Error is calculated from outputs
• Used to update output weights
 Backward Pass:
• Error at hidden nodes is calculated by
back propagating the error at the
outputs through the new weights
• Hidden weights updated
In Matrix Form
For:
n inputs, m hidden nodes
and q outputs
olk is the output of the lth

neuron
For the kth of p patterns
vk is the output of the hidden

layer
ok is the true output vector
Matrix Tricks
p
 E(A, B) = k=1 Σ (tk – ok)T(tk – ok)
 tk denotes true output vectors
 The optimal weight matrix of B can be

computed directly if fH-1(t) is known
 B’ = fH-1(t)vT(vvT)*
 So… E(A, B) = E(A, B(A)) = E’(A)

 Which makes our weight space much smaller
Hybrid LS RS/SA/GA Training
 Delta rule training may converge to a
local minimum
 Hybrid Global Learning (HGL) will

converge on a global minimum
• Randomize A [-0.5, 0.5]
• Minimize the Error function E’(A)
Random Sampling
 Randomize A [-0.5, 0.5]
 Loop for a long time
• Compute Bnew = B(A)
• if E(A, B) < THRESH
 Training Complete
• else
 Anew = A + N(0, σ)
 σ = σ(1 – dec)
 if E(A, B) > E(Anew, Bnew)
• E(A, B) = E(Anew, Bnew)
• A = Anew; B = Bnew; σ = σbound
Other methods
 Simulated Annealing
•More accurate results
•Much slower
 Genetic Algorithms
•More accurate results
•Slower
Alternative Activation functions
 Radial Basis
Functions
• Square
• Triangle
• Gaussian!
Input 0 Input 1 ... Input n
 (μ, σ) can be
varied at each
fRBF fRBF fRBF
(x) (x) (x)
hidden node to
guide training fH(x) fH(x) fH(x)
Alternate Topologies
 Inputs analyze
signal at multiple
points in time
 RBF functions may

be used to select a
‘window’ in the
input data
Typical Topologies
 Set of inputs
 Set of hidden nodes
 Set of outputs
 Too many nodes makes network

hard to train
Supervised Vs. Unsupervised
 Previously discussed networks are
‘supervised’
• Need to be trained ahead of time with
lots of data
 Unsupervised networks adapt to the
input
• Applications in Clustering and reducing
dimensionality
• Learning may be very slow
Applications:
 Investment Analysis
• Predicting movement of stocks
• Replacing earlier linear models
 Signature Analysis
• Bank Checks, VISA, etc.
 Process Control
• Chemistry related
 Monitoring
• Sensor networks may gather more data than can be
processed by operators
• Inputs: Cues from camera data, vibration levels, sound,
radar, lydar, etc.
• Output: Number of people at a terminal, engine warning
light, control for light switch
References
[1] L. Smith, ed. (1996, 2001), "An Introduction

to Neural Networks", URL:
http://www.cs.stir.ac.uk/~lss/NNIntro/InvSlides.
html
[2] Sarle, W.S., ed. (1997), Neural Network
FAQ, URL: ftp://ftp.sas.com/pub/neural/FAQ.html
[3] StatSoft, "Neural Networks", URL:
http://www.statsoftinc.com/textbook/stneunet.ht
ml
[4] S. Cho, T. Chow, and C. Leung, "A Neural-
Based Crowd Estimation by Hybrid Global
Learning Algorithm", IEEE Transactions on
Systems, Man and Cybernetics, Part B, No. 4.
1999.
Thank You

Introduction To Neural Networks PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Introduction To Neural Networks PDF

Uploaded by

Copyright:

Available Formats

Introduction to

A method of computing, based on the

 Learn to do any of the above

The Network itself is

 fH(x) = u(x) [linear

 Now how do we train it? Output

 Adjust weights based on a how well

olk is the output of the lth

vk is the output of the hidden

 The optimal weight matrix of B can be

 So… E(A, B) = E(A, B(A)) = E’(A)

 Hybrid Global Learning (HGL) will

 RBF functions may

 Set of hidden nodes

 Too many nodes makes network

[1] L. Smith, ed. (1996, 2001), "An Introduction

You might also like