Disparity between training and evaluation episodes

Question

Ari il 1 Mag 2025

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2176813-disparity-between-training-and-evaluation-episodes

Modificato: Jacob Mathew il 13 Mag 2025

I tried to re-train this DDPG control example using exact same configuration (except the max training episode set to 500) and this is the result I got:

It is quite different compared to the provided screen capture (below), despite the random seed has been fixed reproducibility purpose.

Why are there huge disparities between training and evaluation episodes on my training? Eventhough my training process was successful (evaluation statistic hit 2000), the last training episodes still couldn't match evaluation episodes. How to properly interpret this result? Is this normal?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Jacob Mathew il 13 Mag 2025

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2176813-disparity-between-training-and-evaluation-episodes#answer_1565129

Modificato: Jacob Mathew il 13 Mag 2025

Hey Ari,

I tried running the example on my end and got a different graph in the training compared to both what you have seen and the screenshot in the example guide. But my output was reproduceable across multiple runs of the example:

However, while the path taken during training does seem to vary, the results logged in the scope are inline with that the example expects, which showcases that the training was successful and predictions are in line:

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Disparity between training and evaluation episodes

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Disparity between training and evaluation episodes

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti