photo

Kundan Panta


Last seen: 28 giorni fa Attivo dal 2024

Followers: 0   Following: 0

Statistica

Feeds

Visto da

Domanda


Confusion in agent and trainFromData options when using RNN/LSTM
My dataset contains numTraj trajectories, each containing numSteps time-steps. I filled the experience buffer with my data in a ...

28 giorni fa | 1 risposta | 0

1

risposta

Domanda


Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?
Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work...

3 mesi fa | 0 risposte | 0

0

risposte