Reinforcement Learning Agents generating zero episode
Mostra commenti meno recenti
Hello Matlab community,
I faced an issue for training a multiple agent problem in Matlab/Simulink. I tried to solve a very simple problem; however, training stops at episode 1.
Suppose that we have three discrete variables A=[1 2 3], B=[1 2 3], C=[1 2 3].
Reward function = A*B*C;
Observation= A+B+C;
I tried different parameters but it didn’t work. I attached the sample file for the reference. Very appreciate it if you can suggest potential solution to solve convergence issue.
Thanks for your time 😊

Risposta accettata
Più risposte (0)
Categorie
Scopri di più su Reinforcement Learning in Centro assistenza e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!