question about external action of DDPG

3 visualizzazioni (ultimi 30 giorni)
Is anyone know the loss function of the Q-network when I set external action=1 during training process?(DDPG)

Risposta accettata

Emmanouil Tzorakoleftherakis
The loss function does not change. What happens is that the experience buffer is populated with the action from the external signal and the respective observations/reward.

Più risposte (0)

Categorie

Scopri di più su Deep Learning Toolbox in Help Center e File Exchange

Prodotti


Release

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by