The definition of the Target update frequency in Reinforcement Learning Designer.

Question

Xian Zheng Hong il 7 Mar 2024

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer

Commentato: Xian Zheng Hong il 16 Mar 2024

Risposta accettata: UDAYA PEDDIRAJU

In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.

The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.

Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

UDAYA PEDDIRAJU il 12 Mar 2024

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer#answer_1424086

Hi Xian,

No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Xian Zheng Hong il 16 Mar 2024

Thanks for answering. Here is my another question.

Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

Accedi per commentare.

The definition of the Target update frequency in Reinforcement Learning Designer.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Community Treasure Hunt

The definition of the Target update frequency in Reinforcement Learning Designer.

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti