k
Back Propagation
Neural Network
For the multi
–
layer neural network that you will be implementing in the following problems, you may use 
either the hyperbolic tangent or the sigmoid for the activation function.  Obviously you will be implementing 
the back propagation method to train the n
etwork.  Your code should include an arbitrary method that allows 
you to code it for any number of input dimensions, hidden layers, neurons and output neurons, (i.e. you need 
it to be able to allow you to change a variable and it changes the number of laye
rs, or the number of neurons 
at a given layer, or the number of dimensions, or the number of output layers…in other words this cannot be 
hard coded
for a specific data set
).   
1.
(25pts) 
Develop a multi
–
layer neural network to solve the 
regression problem: f
(x) = 1/x
, (for only the 
positive values of x)
.  Be sure to create a testing and training set.  
I am 
NOT
providing you with data. 
The 
number of hidden layers and neurons 
is your discretion, however, you must provide an explanation as to why 
the number of l
ayers and nodes you used was a good choice.  You will need to create multiple different 
structures that you train and p
lot this error for the different models to show why you chose the final model 
that you did for this problem.
Report your network configu
ration, and comment on your observations 
regarding the performance of your network as you try to determine the number of hidden layers and hidden 
nodes of your final network, once you determine the correct model
:
a.
(10pts) 
T
rack 
your training and testing history.  
This means that you 
check your training performance 
and your testing performance at multiples of some fixed number of 
iterations
(and over many Epochs 
also)
[thus you will provide a graph of this information]
, implement 
the online learning method.
Be 
sure to label plots appropriately.
(Remember to scale the data in the range that provides the best 
results for your activation function
… normalization is a must)
b.
(5pts) 
What did you observe regarding the value of the learni
ng parameter and how the network 
performed given
: 
(do one of each)  
i.
(2.5pts)
a fixed value, 
ii.
(2.5pts) 
and a time decreasing value 
c.
(2.5pts) 
Choose a couple of points beyond your training set (i.e., if your max training input 
is 
x=10, 
try testing your netw
ork with, say, x=10.5, x=
12
, and x=
30
).  What do you observe regarding the 
networks ability to generalize for data that is beyond its training set (note, you may have to increase 
the v
alue of x 
to a large number 
to get a good idea
to answer this
)
.
d.
(2.5pts)
Briefly comment on 
the extrapolation capability 
compared 
to the interpolation capability
of 
the network.
e.
(5pts) 
Plot the final results
showing the capability of your network to determine the function f(x) = 
1/x versus the function f(x) = 1/x.
2.
(25pts)
Develop a multi
–
layer neural network to
classif
y
the 
IRIS 
data set.  
Use K
–
Fold Cross Validation for 
this problem.
The code you use for thi
s problem will be an extension of the code you created for problem 1.
a.
(12pts) Implement the NN and r
eport your network configuration: number of hidden layers, number 
of nodes per hidden layer, learning rate/learning schedule, encoding of the output
, etc.
b.
(
5pts) 
Plot 
the 
error metric versus the number of training steps (similar to problem 
1
).
c.
(8pts) 
Comment on how well your network learned the data.  Things to think about: did the network 
classify the data well (or not), and why (or why not); how well did it
classify each class independently 
(you might consider contingency
table
s for this
); and what observation do you have regarding number 
of training samples
: was the normalization technique I used adequat