RL Agent can't inherint FiM sampling time

Question

Ari il 5 Mag 2025

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2176904-rl-agent-can-t-inherint-fim-sampling-time

Risposto: Emmanouil Tzorakoleftherakis il 7 Mag 2025

Risposta accettata: Emmanouil Tzorakoleftherakis

rlwatertank.slx

I am trying to experiment with this example. I replaced all logical operatios in reward and stop simulation blocks with MATLAB functions to get rid of Fixed in Minor Step (FiM) signals. This is the result:

I re-trained the agent and everything went well. Then, I made modification in agent's SampleTime. Since I want the agent to inherit environment's sampling time (decided by the solver), I set agent.AgentOptions.SampleTime = -1. During the simulation, this error message occured:

An error occurred while running the simulation for model 'rlwatertank' with the following RL agent blocks:

rlwatertank/RL Agent

Caused by:

Error using rl.env.internal.reportSimulinkSimError (line 29)

The block rlwatertank/RL Agent is not supported when inheriting a "Fixed In Minor Step" sample time. Either place a rate transition block at the block inports to inherit a discrete sample time or specify agent.SampleTime > 0.

MATLAB/Simulink said RL Agent cannot inherit FiM sample time, but there is no FiM signal in the model as we can see. How do I solve this problem? How can I make RL Agent inherit environment's sampling time? I'am using R2024b Update 5.

5 Commenti
Mostra 3 commenti meno recentiNascondi 3 commenti meno recenti

Ari il 7 Mag 2025

Consider a scenario where the reference is not stationary. For example it may look like this:

During transient period, the environment needs action update more frequently since the dynamic changes rapidly. On the other hand, during steady period the environment may receive action update less frequent. If I use low sampling rate for the agent, it could "miss" the dynamic in transient region. But, if I use high sampling rate, it may waste computational resources during steady period. I could be wrong, but I think it's a good thing if the solver and agent could have a "communication" like:

solver: hey agent, I just updated the state, give us a new action signal

agent: ok solver, here is your new action

This can be achieved in MATLAB environment using custom step function. It should be possible in Simulink environment as well, I hope.

Emmanouil Tzorakoleftherakis il 7 Mag 2025

Thanks, will reply below. Fyi, what you are describing is easily implementable in MATLAB because in MATLAB environments there is no notion of time like in Simulink - you can define what time is somewhat arbitrarily.

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Emmanouil Tzorakoleftherakis il 7 Mag 2025

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2176904-rl-agent-can-t-inherint-fim-sampling-time#answer_1564836

Based on the comments section above, it seems you are looking to have an "adaptive" sample time for the RL Agent block. The screenshot you have above shows that the input signals to the RL Agent block are continuous - this cannot happen. RL needs to happen at discrete sample time per definition of MDP. A couple of options I see:

1) Add a rate transition block to the output of the water tank subsystem (height signal). Set the sample time in the block to be whatever makes sense to you. The RL Agent block will inherit the same sample time. Note this will still be "fixed" time step.

2) For adaptive sample time, you need to place the RL Agent block inside a triggered subsystem. Doing that you can "call" the RL Agent block at varying sample time based on some event (could be the variable-step sample time of the environment or some other metric you define to recognize transients vs steady state).

Hope this helps

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

RL Agent can't inherint FiM sampling time

5 Commenti
Mostra 3 commenti meno recentiNascondi 3 commenti meno recenti

Risposta accettata

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

RL Agent can't inherint FiM sampling time

5 Commenti Mostra 3 commenti meno recentiNascondi 3 commenti meno recenti

Risposta accettata

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

5 Commenti
Mostra 3 commenti meno recentiNascondi 3 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti