- Ensure that “cos(psi(t)) - cos(psi(t-1)) > 0” is met frequently by checking if psi(t) increases over time.
- Verify that the initial conditions of the delay block are properly set to prevent incorrect first-step evaluations.
- Modify the reward function threshold to make the positive reward condition more lenient.
reward error during training
2 visualizzazioni (ultimi 30 giorni)
Mostra commenti meno recenti
Hello,
Im using reinforcemet designer to train my model,
and here is my problem.
Q. I dont not why my reward cannot go up to 0.1, why is this happen?? How can I fix this??


0 Commenti
Risposte (1)
Harsh
il 28 Feb 2025
Hi @기범
In Reinforcement Learning (RL), the reward is a signal that guides the agent’s learning by providing feedback on its actions. It changes dynamically based on the agent’s behavior and the environment’s response. A well-designed reward function encourages desired actions and discourages unwanted ones, leading to improved performance over time.
You can try the following to improve your performance:
For more information on how to craft a reward function please refer to the following MATLAB tech talks webinar - https://www.mathworks.com/videos/reinforcement-learning-part-2-understanding-the-environment-and-rewards-1551976590603.html
0 Commenti
Vedere anche
Categorie
Scopri di più su Environments in Help Center e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!