Different observation matrix in reinforcement learning episode

Question

mohsan niaz il 7 Feb 2022

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1644920-different-observation-matrix-in-reinforcement-learning-episode

Risposto: Poorna il 29 Set 2023

Environment.PNG

Hi Everyone,

I want to train an agent in a deep-Q reinforcement learning setting. But for every episode i want the agent to oberserve or read a different dimension in a large matrix stored in a .mat file. In otherwords i want to the agent to read a different row or a colomn of a matrix in every new episode of training.

Can anyone guide how can this be done in RL tool box in Matlab. I am also attaching a screenshot of the Simulink environment for reinforcement learning ?

Regards

Mohsan.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Poorna il 29 Set 2023

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1644920-different-observation-matrix-in-reinforcement-learning-episode#answer_1322019

Hi Mohsan,

I understand that you would like the observation at the start of each episode to be a random or predefined sequence of a row in the observation matrix you have.

To achieve this, you can use the "ResetFcn" callback property of the environment in your model. The reset function sets the environment to an initial state and computes the initial value of the observation.

You can create a custom callback function that contains the logic to select the required observation from the observation matrix and returns it. You can then set this function to the “ResetFcn” callback property of the environment. This function will be called by the "train" function at the beginning of each training episode.

For more information on how to use the "ResetFcn" callback property, please refer to the following MATLAB documentation https://www.mathworks.com/help/reinforcement-learning/ref/rl.env.rlfunctionenv.html

Hope this Helps!

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Different observation matrix in reinforcement learning episode

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Different observation matrix in reinforcement learning episode

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti