How to use the reinforcement learning toolbox in Matlab to implement delayed reward
Mostra commenti meno recenti
I want to implement delayed reward with matlab code. For example, I need to wait until the end of my current episode before giving the reward for each action in this episode. How can I achieve this?
Risposta accettata
Più risposte (1)
MOHAMMADREZA
il 5 Mar 2025
0 voti
Hi, I am having the same problem. Hwever, I am using the Matlab heper (class) for environment. I do not know how to handle reward so that at the end of episode the reward is used for updating the parameters. More specifically, when using class template, I have step, reset,... functions. when the parameters is updated? is it after running step function? I wrote the reward in the step function. but I need to update the parameters only at the end of episode.
Categorie
Scopri di più su Reinforcement Learning in Centro assistenza e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!