My First AI Problem Set

Some Quick Notes:

Training has been completed
No, the two layer neural net cannot model non-linear decision boundaries as the system deals simply with coefficients of linear variables. This indicates that the feedback necessary for non-linear models cannot be met; it is limited to a linear model.
Yes, the 3 layer model can learn non-linear decision boundaries as the hidden layer enables higher order representations like curves. Practically the hidden layer allows further feedback, creating compounded activations, akin to creating non-linear variables.
Learning rate affects the aggressiveness of training; Lower rates yield better training, but longer processing time. Higher rate result in

- Smaller gradient (0.001)
Changing the amount of hidden nodes changes the complexity of the curve; More nodes results in a curve with more local minimas or maximas (or in plain English: bendiness)

L2 Regularization is a weight decay linearization. This process attempts to use as many nodes as it can, thereby creating small weights. This process created significantly more jagged edges, but ultimately increased the accuracy. (As an FYI, this is my approximation, I had l2 regulation encoded as default)