Reinforcement Learning experience buffer length and parallelisation toolbox

Question

Tech Logg Ding il 2 Dic 2020

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox

Modificato: Emmanouil Tzorakoleftherakis il 3 Dic 2020

Risposta accettata: Emmanouil Tzorakoleftherakis

When parallelisation is used when training a DDPG agent with the following settings:

trainOpts.UseParallel = true;
trainOpts.ParallelizationOptions.Mode = 'async';
trainOpts.ParallelizationOptions.StepsUntilDataIsSent = -1;
trainOpts.ParallelizationOptions.DataToSendFromWorkers = 'Experiences';

Does the the parallel simulations have their own experience buffer? This could take up more memory hence I am hoping that only one experience buffer is stored to update the critic network.

From the documentations, it seems like there will only be one experience buffer as the experiences are sent back to the host.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Emmanouil Tzorakoleftherakis il 3 Dic 2020

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox#answer_564503

Modificato: Emmanouil Tzorakoleftherakis il 3 Dic 2020

Hello,

There is one big experience buffer on the host, the size of which you determine as usual in your agent options. Each worker has a much smaller buffer to collect experiences until you reach "StepsUntilDataIsSent".

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Reinforcement Learning experience buffer length and parallelisation toolbox

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Reinforcement Learning experience buffer length and parallelisation toolbox

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti