multi-agent deep reinforcement learning

Question

beni hadi il 4 Nov 2020

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/635900-multi-agent-deep-reinforcement-learning

Risposto: Ari Biswas il 5 Nov 2020

I designed the deep reinforcement learning multi-agent system with three DDPG agents. Each agent does an independent task. I prepared a counter to calculate the total rewards of each agent in each episode in the Simulink. The calculated total rewards in each episode for each agent are different from the calculated rewards of each agent in the Matlab training-progress of Reinforcement Learning Episode Manager. But for a single agent in the reinforcement learning system, these rewards were the same.

1) Are the rewards calculated by the algorithm in the multi-agent reinforcement system influenced by the calculations of other agent's rewards?

2) Is the update of the weights of the network of each agent not independent of the other agents?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Ari Biswas il 5 Nov 2020

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/635900-multi-agent-deep-reinforcement-learning#answer_535685

Assuming you are training multiple agents in Simulink using the Reinforcement Learning Toolbox in R2020b:

The rewards are calculated by the environment, not the agent algorithm so they should not be affected unless the environment is changing them. When you compare rewards between single and multi-agents please ensure that the state-action pairs are the same. Rewards depend on states and actions and you may get different results for different state-action pairs.
In R2020b, the agent neural networks are updated independently.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

multi-agent deep reinforcement learning

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Community Treasure Hunt

multi-agent deep reinforcement learning

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti