receiving different training results while running the same code

Question

Sourabh il 2 Giu 2023

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1977359-receiving-different-training-results-while-running-the-same-code

Commentato: Emmanouil Tzorakoleftherakis il 5 Giu 2023

Risposta accettata: Steven Lord

I ran the training of my RL model but forgot to save so i thought i would run the same script again

but i am getting a slightly changed response ?

shouldnt i get the same training results?

also what is relation b/w different sampling times of like actor and agent.

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Sourabh il 2 Giu 2023

okay so does it mean higher sample time of my agent better control or what ?

also in training if max steps is 100 does it mean the simulation is running for 100 sec / episode ?

Emmanouil Tzorakoleftherakis il 5 Giu 2023

max steps will depend on your agent sample time. If it's 100, it means thatthe total episode duration will be 100* ts where ts is the agent sample time.

Also, smaller sample time does not necessarily mean better control. As a rule of thumb, your sample time should only be as small as needed to get good results, not smaller than that to avoid wasting computational resources.

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Steven Lord il 2 Giu 2023

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1977359-receiving-different-training-results-while-running-the-same-code#answer_1249229

Apri in MATLAB Online

Are random numbers involved in the process of creating or training your RL model? [My guess is most likely yes.] One way to check this would be to set the state of the random number generator to a known, fixed value using rng then run your code. Reset the generator to that same known, fixed value and run your code again.

rng(0, 'twister');
x = rand(1, 5)
x = 1×5
    0.8147    0.9058    0.1270    0.9134    0.6324
y = rand(1, 5) % not the same as x
y = 1×5
    0.0975    0.2785    0.5469    0.9575    0.9649
isequal(x, y)
ans = logical
   0
rng(0, 'twister');
y = rand(1, 5) % the same as x
y = 1×5
    0.8147    0.9058    0.1270    0.9134    0.6324
isequal(x, y)
ans = logical
   1

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

receiving different training results while running the same code

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Risposta accettata

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

receiving different training results while running the same code

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Risposta accettata

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti