Neural Networks Problem Set 1 Guide

The document provides instructions for a neural networks homework assignment. Students are asked to: 1) Generate training data for a 2D nonlinear function and split it into training, validation, and test sets. 2) Train a multi-layer perceptron on the function using backpropagation, experimenting with different network sizes, initialization parameters, and learning rates. 3) Analyze learning curves and report on how to choose the network size based on factors like training error reduction and avoiding overfitting.

Uploaded by

Hendra Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

298 views2 pages

Neural Networks Problem Set 1 Guide

Uploaded by

Hendra Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

K. N. Toosi University of Tech.

Control Department, fall 2009 - Neural Networks

Instructor: Dr. Mohammad Teshnehlab
TA: M. Ahmadieh Khanesar, email: [Link]@[Link]
Neural Networks Problem Set #1
(Due before class on Wednesday, 1388/8/6)

Back-propagation Algorithm
In this problem, we will investigate the two identification problems.
Consider a 2 dimensional nonlinear function which is fully describable using the following neural
networks:
2 1 1 2 1 1 2 2
0.5 0.5 0.1
0.1 0.2 0.1
( ) *tansig( ) , , , [0.2 0.2 0.5 0.5], 0.6
0.5 0.3 0.7
1 1 0.5
f x w w x b b w b w b
( (
( (
( (
= + + = = = =
( (
( (

Generate 1000 pairs of data for inputs and compute the output. Determine the proper range for which
no saturation for Tansig functions happens. Use 50 samples for validation and 2/3 of remaining
samples for train and 1/3 for the test.
Write your code so that in each epoch you cycle through every individual data in the training set
(making an online update after each) and through every individual data in the test set (no updates,
just record the error).
We will look at the epoch squared error. The squared error for an epoch is simply the sum of the
squared errors for each individual in the data set. This is precisely the function that is minimized in the
gradient descent calculation.
For the multi-layer perceptron with one hidden layer, experiment with different sizes n of the neurons,
initial conditions for the weights and biases, and learning rate. Settle on particular choices of these
parameters, and run the backprop algorithm until convergence. As your final results, write a report on
how to choose the size of the network. Learning curves of training squared error and test squared
error as function of epoch number must be included in the report. Try to derive information from these
curves as much as possible. For example try to answer these questions: For your best parameters
how does the squared error evolve over time for the training set? What about bad parameters? What
about the test set? How do the two curves compare? Is there any case in which the parameters of
neural networks do not converge to the optimal values but the identification error is good? Is it
possible to compensate the wrong number of neurons with a good selection of epochs? About the
biasing parameters, is it possible to omit the biasing parameters especially the biasing parameters in
the output layer by using more neurons? If yes provide simulations and if not provide reasons. About
the input data for identification, does the convergence to the optimal values depend on them? What
can you add to the above questions?
Here is a new function for identification. Do the above mentioned procedure for the following function:

sin( ) sin( )
( , ) , , [ 2, 2]
x y
f x y x y
x y
= e
Use the rules of thumbs in the next page for this function and investigate them. Which one is violated
here?

Subject: How many hidden units should I use?
The best number of hidden units depends in a complex way on:
- the numbers of input and output units
- the number of training cases
- the amount of noise in the targets
- the complexity of the function or classification to be learned
- the architecture
- the type of hidden unit activation function
- the training algorithm
- regularization
In most situations, there is no way to determine the best number of hidden units without
training several networks and estimating the generalization error of each. If you have too few
hidden units, you will get high training error and high generalization error due to underfitting
and high statistical bias. If you have too many hidden units, you may get low training error
but still have high generalization error due to overfitting and high variance. Geman,
Bienenstock, and Doursat (1992) discuss how the number of hidden units affects the
bias/variance trade-off.
Some books and articles offer "rules of thumb" for choosing an architecture; for example:
- "A rule of thumb is for the size of this [hidden] layer to be somewhere between the
input layer size ... and the output layer size ..." (Blum, 1992, p. 60).
- "To calculate the number of hidden nodes we use a general rule of: (Number of inputs
+ outputs) * (2/3)".
- "you will never require more than twice the number of hidden units as you have
inputs" in an MLP with one hidden layer (Swingler, 1996, p. 53).
- "How large should the hidden layer be? One rule of thumb is that it should never be
more than twice as large as the input layer." (Berry and Linoff, 1997, p. 323).
- "Typically, we specify as many hidden nodes as dimensions [principal components]
needed to capture 70-90% of the variance of the input data set." (Boger and
Guterman, 1997)
Reference
[Link]/faqs/ai-faq/neural-nets/part3/[Link]

Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
MATLAB Neural Networks Guide
No ratings yet
MATLAB Neural Networks Guide
54 pages
Feedforward Neural Networks Overview
100% (1)
Feedforward Neural Networks Overview
27 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Unit 4
No ratings yet
Unit 4
13 pages
Fitting a Multilayer Perceptron Model
No ratings yet
Fitting a Multilayer Perceptron Model
9 pages
Deep Learning Challenges & Solutions
No ratings yet
Deep Learning Challenges & Solutions
64 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
26 pages
Unit 3
No ratings yet
Unit 3
110 pages
Understanding Multi-Layer Neural Networks
No ratings yet
Understanding Multi-Layer Neural Networks
14 pages
Multilayer Perceptron Network Analysis
No ratings yet
Multilayer Perceptron Network Analysis
7 pages
Artificial Neural NetworkIV
No ratings yet
Artificial Neural NetworkIV
6 pages
Unit 3
No ratings yet
Unit 3
12 pages
Understanding Multilayer Perceptrons
No ratings yet
Understanding Multilayer Perceptrons
23 pages
Cs3491-Artificial Intelligence and Machine Learning-1221091049-Unit 5 Aiml
No ratings yet
Cs3491-Artificial Intelligence and Machine Learning-1221091049-Unit 5 Aiml
38 pages
Unit 5 (Second Half)
No ratings yet
Unit 5 (Second Half)
10 pages
Approach To The Synthesis of Neural Network Structure During Classification
No ratings yet
Approach To The Synthesis of Neural Network Structure During Classification
7 pages
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
No ratings yet
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
14 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
38 pages
Neural Networks and Deep Learning Overview
No ratings yet
Neural Networks and Deep Learning Overview
9 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
Module 2
No ratings yet
Module 2
44 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
34 pages
Understanding Deep Learning Basics
No ratings yet
Understanding Deep Learning Basics
73 pages
Optimum Architecture of Neural Networks Lane Following System
No ratings yet
Optimum Architecture of Neural Networks Lane Following System
6 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
42 pages
Machine Learning Lecture II Overview
No ratings yet
Machine Learning Lecture II Overview
118 pages
Question 105A
No ratings yet
Question 105A
33 pages
Neural Network Basics and Applications
No ratings yet
Neural Network Basics and Applications
53 pages
MATLAB Neural Networks Guide
No ratings yet
MATLAB Neural Networks Guide
4 pages
Neural Network Initialization and Fitting
No ratings yet
Neural Network Initialization and Fitting
3 pages
Neural Network Training Guide
No ratings yet
Neural Network Training Guide
44 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Neural Network Adaptive Filtering Techniques
No ratings yet
Neural Network Adaptive Filtering Techniques
4 pages
Backpropagation in Feedforward Neural Networks
No ratings yet
Backpropagation in Feedforward Neural Networks
19 pages
Key Issues in Neural Networks
No ratings yet
Key Issues in Neural Networks
17 pages
Neural Network Introduction
No ratings yet
Neural Network Introduction
5 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
3 Neural
No ratings yet
3 Neural
27 pages
Neural Network
100% (2)
Neural Network
54 pages
Deep Learning: Neural Networks Overview
No ratings yet
Deep Learning: Neural Networks Overview
44 pages
Neural Lab 1
No ratings yet
Neural Lab 1
5 pages
Ai Unit 5
No ratings yet
Ai Unit 5
33 pages
Machine Learning Exam Prep
No ratings yet
Machine Learning Exam Prep
16 pages
NNlec 8
No ratings yet
NNlec 8
12 pages
Deep Neural Network Optimization Guide
100% (1)
Deep Neural Network Optimization Guide
84 pages
Module 2 DL Snotes P1
No ratings yet
Module 2 DL Snotes P1
16 pages
Multi-Layer Perceptron Overview and Learning
No ratings yet
Multi-Layer Perceptron Overview and Learning
39 pages
Introduction to Multilayer Neural Networks
No ratings yet
Introduction to Multilayer Neural Networks
53 pages
Neural Networks and Learning Algorithms
No ratings yet
Neural Networks and Learning Algorithms
33 pages
MATLAB Neural Network Examples
No ratings yet
MATLAB Neural Network Examples
3 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
28 pages
Training Feedforward Neural Networks
No ratings yet
Training Feedforward Neural Networks
22 pages
Learning Activation Functions in Neural Networks
No ratings yet
Learning Activation Functions in Neural Networks
10 pages
Matlab (FUAD MAHFUDIANTO)
No ratings yet
Matlab (FUAD MAHFUDIANTO)
14 pages
Unit 3
No ratings yet
Unit 3
7 pages
EPAP Brochure
No ratings yet
EPAP Brochure
21 pages
940648056-Research Methodology and Review
No ratings yet
940648056-Research Methodology and Review
5 pages
Psychology Course Syllabus Overview
No ratings yet
Psychology Course Syllabus Overview
5 pages
Ensuring Quality in Organizations
No ratings yet
Ensuring Quality in Organizations
3 pages
Correlation of Mother's Knowledge On Child
No ratings yet
Correlation of Mother's Knowledge On Child
8 pages
Strategic Management Process Overview
No ratings yet
Strategic Management Process Overview
5 pages
Statistical Concepts For Criminal Justice and Criminology 1st Edition by Frank P. Williams III Question Bank
No ratings yet
Statistical Concepts For Criminal Justice and Criminology 1st Edition by Frank P. Williams III Question Bank
25 pages
LETTER
No ratings yet
LETTER
2 pages
Zie Proposal
No ratings yet
Zie Proposal
18 pages
Role of ICT in The Agriculture Sector
No ratings yet
Role of ICT in The Agriculture Sector
8 pages
SKKU Handbook
No ratings yet
SKKU Handbook
50 pages
Current Insight Into Traditional and Modern Methods
No ratings yet
Current Insight Into Traditional and Modern Methods
50 pages
AP SCERT PhysicalScience AcademicStandards Updated
No ratings yet
AP SCERT PhysicalScience AcademicStandards Updated
2 pages
Safety Culture Awareness in Klang Construction
No ratings yet
Safety Culture Awareness in Klang Construction
28 pages
Final Thesis Paper
85% (34)
Final Thesis Paper
24 pages
Insights from Corpus Software on Language Development
No ratings yet
Insights from Corpus Software on Language Development
29 pages
Optimization Techniques in Pharmaceutical Formulations and Processing
96% (26)
Optimization Techniques in Pharmaceutical Formulations and Processing
27 pages
Cognitive Dysfunction in OCD Explained
No ratings yet
Cognitive Dysfunction in OCD Explained
11 pages
2021 Abbasnasab Sarderehetal Tching Statsacceptedauthor
No ratings yet
2021 Abbasnasab Sarderehetal Tching Statsacceptedauthor
42 pages
Research Program Manager Profile
No ratings yet
Research Program Manager Profile
2 pages
Danstan Assignment
No ratings yet
Danstan Assignment
18 pages
Sample Size Justification Methods
No ratings yet
Sample Size Justification Methods
28 pages
Module 3 NSTP 2
No ratings yet
Module 3 NSTP 2
9 pages
Industrial Organizational Psychology 5th Edition by Paul Levy Question Bank
No ratings yet
Industrial Organizational Psychology 5th Edition by Paul Levy Question Bank
15 pages
MATHEMATICS
No ratings yet
MATHEMATICS
49 pages
Secure Software Dev Practices
No ratings yet
Secure Software Dev Practices
18 pages
The Integrative Strategies of Teaching
100% (1)
The Integrative Strategies of Teaching
22 pages
Importance of Internship
No ratings yet
Importance of Internship
6 pages
Super Long Qualitative Study SHS Strand Availability
No ratings yet
Super Long Qualitative Study SHS Strand Availability
2 pages
Chapter 7 SM Test
No ratings yet
Chapter 7 SM Test
28 pages

Neural Networks Problem Set 1 Guide

Uploaded by

Neural Networks Problem Set 1 Guide

Uploaded by

K. N. Toosi University of Tech.

Control Department, fall 2009 - Neural Networks

You might also like