Different Sample time for RL environment

Question

Syed Adil Ahmed il 11 Giu 2024

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2127656-different-sample-time-for-rl-environment

Commentato: Syed Adil Ahmed il 13 Giu 2024

Hello Everyone,

I am trying to build a RL agent using DQN currently. My environment model is composed of Nonlinear equations which have faster dynamics around 1ms. I was wondering if it possible to have my RL agent to work at 10ms and let the environment run at 1ms. This will obviously mean that for 10 timesteps the environment will be feeding at a constant control action from the RL agent.

I know I can code this up if I create the agent and environment myself, but currently I'm taking advantage of MATLAB's Reinforcement Learning Toolbox in MATLAB editor and I would save a lot of time if this is possible in the toolbox itself.

Thanks.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Kartik Saxena il 13 Giu 2024

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2127656-different-sample-time-for-rl-environment#answer_1471376

Apri in MATLAB Online

Hi,

When creating a custom environment in MATLAB for use with the Reinforcement Learning Toolbox, you can define a 'step' function that advances the environment's state based on the agent's action. To simulate the environment at a faster rate than the agent operates, modify the 'step' function to perform multiple updates (10 updates of 1ms each) for each call to the 'step' function.

Refer to the following code snippet:

    for i = 1:10
        % Update the environment state based on the action
        % Assume updateEnvironment is a function that updates the environment state
        % for a 1ms timestep given the current state and action
        loggedSignals = updateEnvironment(loggedSignals, action);
    end

Hope it helps!

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Syed Adil Ahmed il 13 Giu 2024

Thanks a lot. That is not the solution I was expecting, but sometimes its great to see small things (like a for loop) making the difference.

Accedi per commentare.

Different Sample time for RL environment

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Different Sample time for RL environment

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti