Tuning ExperienceHorizon hyperparamter for PPO agent (Reinforcement Learning)

17 visualizzazioni (ultimi 30 giorni)

Nicolas CRETIN il 18 Lug 2024 alle 14:39

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2138486-tuning-experiencehorizon-hyperparamter-for-ppo-agent-reinforcement-learning

Hello everyone,

I'm trying to train a PPO agent, and I would like to change the value for the ExperienceHorizon hyperparameter (Options for PPO agent - MATLAB - MathWorks Switzerland)

When I try another value than the default, the agent wait for the end of the episode to update its policy. For example, ExperienceHorizon=1024 don't work for me, dispite the episode's lenght of more than 1024 steps. I'm also not using Parallel training.

I also get the same issue if I change the MiniBatchSize from its default value.

Is there anything I've missed about this parameter?

More infos on PPO algorithms: Proximal Policy Optimization (PPO) Agents - MATLAB & Simulink - MathWorks Switzerland

If anyone could help, that would be very nice!

Thanks a lot in advance,

Nicolas

Scopri di più su Reinforcement Learning in Help Center e File Exchange

Prodotti

Reinforcement Learning Toolbox

Release

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Tuning ExperienceHorizon hyperparamter for PPO agent (Reinforcement Learning)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Tuning ExperienceHorizon hyperparamter for PPO agent (Reinforcement Learning)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti