Episode Q0 increases exponentially

18 visualizzazioni (ultimi 30 giorni)

Mostra commenti meno recenti

DAMODARAN B.K il 16 Feb 2021

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/746697-episode-q0-increases-exponentially

Modificato: DAMODARAN B.K il 17 Feb 2021

Can anyone explain why episode Q0 in RL increases exponentially after convergence of reward to a suboptimal policy?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Risposte (1)

Emmanouil Tzorakoleftherakis il 16 Feb 2021

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/746697-episode-q0-increases-exponentially#answer_625027

Hello,

Please take a look at this answer for some suggestions. Normalizing observations, rewards, and actions can also help avoid situations like these.

Hope this helps

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

DAMODARAN B.K il 17 Feb 2021

Modificato: DAMODARAN B.K il 17 Feb 2021

is episode Q0, criticnetwork output or target value?

Accedi per commentare.

Accedi per rispondere a questa domanda.

Categorie

MATLAB Installation and Licensing Install Products Introduction to Installation and Licensing

Scopri di più su Introduction to Installation and Licensing in Help Center e File Exchange

Prodotti

Reinforcement Learning Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Episode Q0 increases exponentially

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Community Treasure Hunt

Episode Q0 increases exponentially

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti