why I get a different action result every new time with same sample observations after deploying trained RL policies?
Mostra commenti meno recenti
load("agent0218_300016_40000.mat","agent");
obsInfo = getObservationInfo(agent);
actInfo = getActionInfo(agent);
ResetHandle = @() myResetFunction(test_sss);
StepHandle = @(Action,LoggedSignals) myStepFunction(Action,LoggedSignals,test_sss);
envT = rlFunctionEnv(obsInfo,actInfo,StepHandle,ResetHandle);
simOpts = rlSimulationOptions('MaxSteps',size(test_sss,1));
experience = sim(envT,agent,simOpts);
ac3=squeeze(experience.Action.bs.Data);
%******************************************************************************
%******************************************************************************
generatePolicyFunction(agent);
%******************************************************************************
%******************************************************************************
for iii=1:size(ac3,1)
observation1=test_sss{iii,:};
action1(iii,1) = evaluatePolicy(observation1);
end
sum(abs(ac3-action1))
Risposta accettata
Più risposte (1)
de y
il 24 Feb 2021
0 voti
Categorie
Scopri di più su Reinforcement Learning Toolbox in Centro assistenza e File Exchange
Prodotti
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!