DQN learns at first but then worsens.

Question

Khandakar Rashid il 20 Apr 2021

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens

Commentato: Emmanouil Tzorakoleftherakis il 23 Apr 2021

Hi, I am training a DQN agent with a simevent model. I am testing out different hyperparameters, but everytime the agent learns (reward goes higher) at first for a while, but then goes down. I have tested different learning rate, exploration epsilon, and discount factors. But the shape of training progress is pretty much same in all combinations. Is there any potential way I can fix this issue?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Emmanouil Tzorakoleftherakis il 22 Apr 2021

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens#answer_682275

Modificato: Emmanouil Tzorakoleftherakis il 22 Apr 2021

To confirm that this is an exploration issue, can you try setting the EpsilonMin param to a high value? e.g. 0.99. If after doing that you still see the same result, there is likely something else going on.

2 Commenti
Mostra NessunoNascondi Nessuno

Khandakar Rashid il 23 Apr 2021

Thank you Emmanouil for the suggestion. I have tried Epsilon = 1, EpsilonMin=0.99. Unfortunately, no luck :(

Do you have any other tips?

Emmanouil Tzorakoleftherakis il 23 Apr 2021

Hard to tell, but it's strange to me that the episode curve is similar every time. That makes me think that there is something specific about the way you have modeled your environment model that guides the training through a similar path each time.

Accedi per commentare.

DQN learns at first but then worsens.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

2 Commenti
Mostra NessunoNascondi Nessuno

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

DQN learns at first but then worsens.

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

2 Commenti Mostra NessunoNascondi Nessuno

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

2 Commenti
Mostra NessunoNascondi Nessuno