Transient value problem of the variable in reward function of reinforcement learning

5 visualizzazioni (ultimi 30 giorni)

Mostra commenti meno recenti

Yihao Wan il 22 Mar 2021

1
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/779882-transient-value-problem-of-the-variable-in-reward-function-of-reinforcement-learning

Commentato: Yihao Wan il 23 Mar 2021

Risposta accettata: Emmanouil Tzorakoleftherakis

Hello, I encounted a problem when designing the reward function. In the simulink environment, I want to incorporate some variables in the reward function. During the training of RL agent, the varibles will converge after about 0.06s, while the agent is trained from 0s. The enable block doesn't help by putting the RL block in a subsystem.

From my understanding, it will influence the value reward function, which may result in poor trained agent. Does anyone have any suggestions regarding this questions?

Thank you very much.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Risposta accettata

Emmanouil Tzorakoleftherakis il 22 Mar 2021

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/779882-transient-value-problem-of-the-variable-in-reward-function-of-reinforcement-learning#answer_654817

You can put the agent block under a triggered subsystem and set it to begin training after 0.06 seconds

5 Commenti
Mostra 3 commenti meno recentiNascondi 3 commenti meno recenti

Emmanouil Tzorakoleftherakis il 23 Mar 2021

I believe it should be 40 yes - there is a counter implemented internally that keeps track of how many times the RL Agent block will run

Yihao Wan il 23 Mar 2021

Thank you very much for your help.

Accedi per commentare.

Più risposte (0)

Accedi per rispondere a questa domanda.

Categorie

Control Systems Reinforcement Learning Toolbox Environments

Scopri di più su Environments in Help Center e File Exchange

Tag

Prodotti

Simulink

Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by