References to multi-agent reinforcement learning schemes in the reinforcement learning toolbox

Question

0 voti

Can somebody provide several references on multi-agent reinforcement learning schemes in reinforcement learning toolbox？

2 Commenti
Mostra Nessuno Nascondi Nessuno

Umar il 22 Lug 2024

Hi Lin,

Plese refer to following links regarding multi-agent reinforcement learning,

https://www.mathworks.com/help/reinforcement-learning/ug/train-agent-to-play-turn-based-game.html https://www.mathworks.com/help/releases/R2023b/reinforcement-learning/ref/rl.env.rlmultiagentfunctionenv.html

Please let me know if you have any further questions.

Lin il 29 Lug 2024

Thank you for you answer!Unfortunately, I could not find any reference even after your advices.I'm looking for references & informations related to the "LearningStrategy:centralized&decentralized" property of object "rlMultiAgentTrainingOptions".Please let me know if you have further tips.Thanks again!

Accedi per commentare.

Accedi per rispondere a questa domanda.

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis il 24 Lug 2024

0 voti

The following examples are all on multi-agent reinforcement learning (centralized or decentralized):

https://www.mathworks.com/help/reinforcement-learning/ug/train-agents-for-path-following.html

https://www.mathworks.com/help/reinforcement-learning/ug/train-3-agents-for-area-coverage.html

https://www.mathworks.com/help/reinforcement-learning/ug/train-2-agents-to-collaborate.html

https://www.mathworks.com/help/reinforcement-learning/ug/train-agent-to-play-turn-based-game.html

12 Commenti
Mostra 10 commenti meno recenti Nascondi 10 commenti meno recenti

Umar il 29 Lug 2024

Hi @Lin,

The centralized critic provides feedback to all agents based on the collective experience of the entire team. This approach enables agents to coordinate their actions effectively and learn from a global perspective. Here is an example of how you can set the "LearningStrategy" property to "centralized" in MATLAB:

options = rlMultiAgentTrainingOptions('LearningStrategy', 'centralized');

Conversely, in a decentralized learning strategy, each agent learns independently based on its local observations and rewards. There is no centralized critic providing feedback to all agents. Each agent makes decisions autonomously without considering the global state of the environment. This approach is useful when agents have limited communication or when scalability is a concern.To set the "LearningStrategy" property to "decentralized" in MATLAB, you can use the following code snippet:

options = rlMultiAgentTrainingOptions('LearningStrategy', 'decentralized');

To showcase the "LearningStrategy" property of the "rlMultiAgentTrainingOptions" object in Matlab, I will create a simple example that involves setting up a multi-agent training scenario with both centralized and decentralized learning strategies and then visualize the training progress using plots. So, first setting up the environment by defining the environment and agents for our multi-agent system. I will create a simple environment with two agents.

% Define the environment

env = rlPredefinedEnv("SimpleMultiAgentEnvironment");

% Define the agents

agent1 = rlQAgent(env);

agent2 = rlQAgent(env);

Note: rlPredefinedEnv requires Reinforcement Learning Toolbox.

Then, creating the multi-agent training options by creating the "rlMultiAgentTrainingOptions" object and set the learning strategy to both centralized and decentralized.

% Create multi-agent training options

multiAgentOpts = rlMultiAgentTrainingOptions;

multiAgentOpts.LearningStrategy = ["centralized", "decentralized"];

Now, training the agents using the defined options and visualize the training progress.

% Train the agents

trainingStats = trainMultiAgent(agent1, agent2, env, multiAgentOpts);

% Plot the training progress

figure;

subplot(2,1,1);

plot(trainingStats.CumulativeReward);

title('Cumulative Reward');

subplot(2,1,2);

plot(trainingStats.ActorLoss);

hold on;

plot(trainingStats.CriticLoss);

legend('Actor Loss', 'Critic Loss');

title('Actor and Critic Loss');

Finally, observing the results after running the code, you will see two plots showing the cumulative reward and the actor/critic losses for both centralized and decentralized learning strategies. This detailed implementation demonstrates how to utilize the "LearningStrategy" property of the "rlMultiAgentTrainingOptions" object in Matlab to train agents with different strategies and visualize their training progress effectively. Feel free to customize the environment, agents, and training options to explore more complex multi-agent scenarios and further enhance your understanding of centralized and decentralized learning strategies in reinforcement learning.By understanding the nuances of the "LearningStrategy" property and its implications for multi-agent training, you can tailor your approach to suit the specific requirements of your reinforcement learning scenario. Please let me know if you have any further questions.

Lin il 30 Lug 2024

Hello @Umar @Emmanouil Tzorakoleftherakis,

I am very grateful for your help. Your answers have resolved my doubts and filled the gap of lacking references on multi-agent reinforcement learning in Matlab. Thank you again.

Umar il 30 Lug 2024

@Lin,

Glad to help out again, please feel free to ask any questions if you still need any help.

Accedi per commentare.

References to multi-agent reinforcement learning schemes in the reinforcement learning toolbox

2 Commenti
Mostra Nessuno Nascondi Nessuno

Risposta accettata

12 Commenti
Mostra 10 commenti meno recenti Nascondi 10 commenti meno recenti

Più risposte (0)

Categorie

Prodotti

Release

Tag

Community Treasure Hunt

References to multi-agent reinforcement learning schemes in the reinforcement learning toolbox

2 Commenti Mostra Nessuno Nascondi Nessuno

Risposta accettata

12 Commenti Mostra 10 commenti meno recenti Nascondi 10 commenti meno recenti

Più risposte (0)

Categorie

Prodotti

Release

Tag

Vedere anche

Community Treasure Hunt

2 Commenti
Mostra Nessuno Nascondi Nessuno

12 Commenti
Mostra 10 commenti meno recenti Nascondi 10 commenti meno recenti