When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

Question

DHRUV LAAD il 2 Gen 2020

2
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri

Commentato: goc3 il 14 Lug 2020

The current options only allow for 5 predefined choices ("AverageSteps", "AverageReward", "EpisodeReward", "GlobalStepCount", "EpisodeCount"). I want to include a stopping criterion different from these. Is there any option to do the same?

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

goc3 il 14 Lug 2020

I was about to ask a similar question... The "accepted" answer below doesn't actually answer the question—instead, it confirms that those are the only available stop criteria.

It would be great if additional options and/or support for custom stopping criteria were added.

As an example, for a particular application, I would like to stop training once the episode reward plateaus. It is not known beforehand at what value it will plateau, so having to set a constant before training is very limiting for any application that is programmed to be dynamic or to proceed automatically based on training results.

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Rajani Mishra il 6 Gen 2020

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri#answer_408842

trainOpts = rlTrainingOptions(Name,Value) creates an option set for training using specified name-value pairs.

Arguments like - 'StopTrainingCriteria', 'StopTrainingValue', 'MaxEpisodes' should be specified for defining stopping criterion while training an agent.

StopTrainingCriteria: Specifies the termination condition. Takes one of the choices as you have mentioned

StopTrainingValue: Specifies the Critical value of training termination condition. Training terminates when the termination condition specified by the StopTrainingCriteria option equals or exceeds this value

MaxEpisodes: Specifies maximum number of episodes to train the agent, once the number of episodes reached training terminates

For more information please refer to

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Tuwe Löfström il 13 Lug 2020

So there is no way of adding a custom stopping criteria, in a similar way as you can define custom reset and step functions?

Accedi per commentare.

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Risposte (1)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Risposte (1)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti