Why Training Set accuracy decrease dramatically after stopping the trainNetwork?

Question

Apri in MATLAB Online

0 voti

After stopping manually trainNetworktrainNetwork, the validation error dropped dramatically:

I tested the Training Set accuracy, and got also about 60%:

predY = classify(net,xTrain);

Any ideas what I'am doing wrong?

4 Commenti
Mostra 2 commenti meno recenti Nascondi 2 commenti meno recenti

Don Mathis il 23 Gen 2019

What is your network architecture? Does it contain dropoutLayers and later BatchNormlization layers?

Sergy Stepura il 1 Feb 2019

Modificato: Sergy Stepura il 4 Feb 2019

Apri in MATLAB Online

The network has simple architecture, 5 fully connected layers with batch normalization + Input layer + Output layer (softmax):

 ''   Image Input             120x1x4 images with 'zerocenter' normalization
 ''   Fully Connected         65 fully connected layer
 ''   Batch Normalization     Batch normalization
 ''   ReLU                    ReLU
 ''   Fully Connected         65 fully connected layer
 ''   Batch Normalization     Batch normalization
 ''   ReLU                    ReLU
 ''   Fully Connected         65 fully connected layer
 ''   Batch Normalization     Batch normalization
 ''   ReLU                    ReLU
 ''   Fully Connected         65 fully connected layer
 ''   Batch Normalization     Batch normalization
 ''   ReLU                    ReLU
 ''   Fully Connected         65 fully connected layer
 ''   Batch Normalization     Batch normalization
 ''   ReLU                    ReLU
 ''   Fully Connected         3 fully connected layer
 ''   Softmax                 softmax
 ''   Classification Output   crossentropyex

Accedi per commentare.

Accedi per rispondere a questa domanda.

Follow Question

Answer 1

Don Mathis il 8 Feb 2019

0 voti

Maybe your minibatch size is too small. The accuracy drop may be due to batchnormalization layers getting finalized, during which time the mean and variance of the incoming activations of each batchnorm layer are computed using the whole training set. If those full-batch statistics don't match the minibatch statistics very well, the finalized batchnorm layers will not be performing a very good normalization.

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

Don Mathis il 11 Feb 2019

You could try increasing the batch size iteratively to see whether that fixes the problem. I would try exponentially increasing: 1000, 2000, 4000, 8000, etc. Or you can just try the largest amount that will fit in your GPU memory right away.

Don Mathis il 11 Feb 2019

Also: Why does your plot show "Iterations per epoch: 1"? Were you using miniBatchSize=30000 in that run?

What are you passing to trainingOptions()?

Accedi per commentare.

Why Training Set accuracy decrease dramatically after stopping the trainNetwork?

4 Commenti
Mostra 2 commenti meno recenti Nascondi 2 commenti meno recenti

Risposte (1)

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

Categorie

Prodotti

Release

Tag

Community Treasure Hunt

Why Training Set accuracy decrease dramatically after stopping the trainNetwork?

4 Commenti Mostra 2 commenti meno recenti Nascondi 2 commenti meno recenti

Risposte (1)

3 Commenti Mostra 1 commento meno recente Nascondi 1 commento meno recente

Categorie

Prodotti

Release

Tag

Vedere anche

Community Treasure Hunt

4 Commenti
Mostra 2 commenti meno recenti Nascondi 2 commenti meno recenti

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente