Out of memory issue while training a Neural Network (NN), array exceeds maximum array size preference using backpropJacobianStatic

6 visualizzazioni (ultimi 30 giorni)
Hello, this is my first time asking a question here, I will try to be brief and clear !
I am currently trying to train a NN of 2 hidden layers with 256 neurones both, in input and output i have a 22*size(trainSet) data set. This represent an amout of 77334 weigth + biais, this shouldn't be a problem to train a NN like that since I saw on some post people training much larger NN. But the issue is that when I call the train function somewhere inside the matlab code (in the backpropJacobianStatic) there is a matrix multiplication that create an array of size 77334*77334 (77334 the number of weight), and that takes all the memory creating an out of memory issue (picture bellow of the issue) :
My question is the following : is there a way not to create this matrix of size numberOfWeight*numberOfWeight that take all the memory during the training ? Because I don't really understand why we would need to store this array since we only need a 1*77334 size array to store the weight, no ?
Thank you in advance for your answers, and if you have any question or if I wasn't clear feel free to ask me for more information !

Risposta accettata

Timothee Fichot
Timothee Fichot il 21 Lug 2020
Modificato: Timothee Fichot il 21 Lug 2020
If someone else has the same issu I found a way to solve it : just dont use the default training method which is the Levenberg-Marquardt backpropagation one. This method requires to comput the Jacobian matrix which in my case was too large. To avoid that you can use e.g. trainscg (or any other solver that does not require the jacobian) which does not require to comput the Jacobian matrix but only the gradient, which is much smaler !

Più risposte (2)

Greg Heath
Greg Heath il 17 Lug 2020
A single hidden layer is sufficient.
Hope this helps
Thank you for formally accepting my answer
Greg

Greg Heath
Greg Heath il 17 Lug 2020
A single hidden layer is sufficient.
Hope this helps
Thank you for formally accepting my answer
Greg

Categorie

Scopri di più su Sequence and Numeric Feature Data Workflows in Help Center e File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by