reinforcement learning toolbox - q table

Question

Xinpeng Wang il 10 Lug 2019

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table

Risposto: Tuong Nguyen il 7 Ott 2022

Risposta accettata: Emmanouil Tzorakoleftherakis

I'm a newbie to RL and the RL toolbox. I played with Q-learning agent with a model in simulink. My question is after training, How can I access to the trained Q table? The qTable used to generate the agent is all ZERO. I cannot figure out where the trained Q values and the policies are stored. Thank you!

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Emmanouil Tzorakoleftherakis il 23 Lug 2019

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_384423

Apri in MATLAB Online

Hi Xinpeng,

To see the trained table, you have to do is extract it using ‘getCritic’. Try:

critic = getCritic(agent);

The variable ‘critic’ has a field which contains the Qtable after training.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 2

carlos pedreira il 13 Gen 2020

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_409897

OK, but, after that, HOW CAN I SEE the table....

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 3

Shikhar Sharma il 24 Gen 2020

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_411873

It should appear under the Workspace tab.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 4

Umut Can Akdag il 18 Mag 2020

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_433182

For those who are still looking for the q table I think this is the solution.

critic = getCritic(agent);

qtable = getLearnableParameters(critic);

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 5

RUBEN HERNANDEZ il 19 Apr 2022

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_946135

Hi everyone

I want to simulate Q-learning agent for control inverted pendulum in simulink (with Q-table) (just for ilustrative example)

I've picked the rlsimplependulumModel.slx predefined in matlab.

This is my code

mdl = 'rlSimplePendulumModel';

open_system(mdl)

obsInfo = rlNumericSpec([3 1]); % vector of 3 observations: sin(theta), cos(theta), d(theta)/dt

actInfo = rlFiniteSetSpec([-2 0 2]); % 3 possible values for torque: -2 Nm, 0 Nm and 2 Nm

obsInfo.Name = 'observations';

actInfo.Name = 'torque';

agentBlk = [mdl '/RL Agent'];

env = rlSimulinkEnv(mdl,agentBlk,obsInfo,actInfo);

env.ResetFcn = @(in)setVariable(in,'theta0',pi,'Workspace',mdl);

Ts = 0.05; % simulation time

Tf = 20; % sample time

% Fix the random generator seed for reproducibility

rng(0)

%% To create a Q-learning agent:

%1 Create a critic using an rlQValueRepresentation object.

qTable = rlTable(obsInfo, actInfo);

qRepresentation = rlQValueRepresentation(qTable, obsInfo, actInfo);

qRepresentation.Options.LearnRate = 0.99;

%% 2 Specify agent options using an rlQAgentOptions object.

agentOpts = rlQAgentOptions;

agentOpts.DiscountFactor = 0.99;

agentOpts.EpsilonGreedyExploration.Epsilon = 0.9;

agentOpts.EpsilonGreedyExploration.EpsilonDecay = 0.01;

%% 3 Create the agent using an rlQAgent object.

qAgent = rlQAgent(qRepresentation,agentOpts);

%% Training Algorithm

% rlQAgentOptions.

trainOpts = rlTrainingOptions;

trainOpts.MaxStepsPerEpisode = ceil(Tf/Ts);

trainOpts.MaxEpisodes = 2000;

trainOpts.StopTrainingCriteria = "AverageReward";

trainOpts.StopTrainingValue = -740;

trainOpts.ScoreAveragingWindowLength = 5;

trainingStats = train(qAgent,env,trainOpts);

AND THIS IS THE ERROR MESSAGE

Error using rlTable/validateInput (line 131)

Input must be a scalar rlFiniteSetSpec.

Error in rlTable (line 51)

validateInput(obj, ObservationInfo)

Error in qlearningpendulum (line 30)

qTable = rlTable(obsInfo, actInfo);

any suggestions?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 6

Tuong Nguyen il 7 Ott 2022

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_1068895

I think to use tabular Q learning, your observation has to be discrete and finite. That means your obsInfo has to be rlFiniteSetSpec(allStates), where in "allStates" you list out all the possible observations. See https://www.mathworks.com/help/reinforcement-learning/ref/rltable.html for the rlTable and https://www.mathworks.com/help/reinforcement-learning/ref/rl.util.rlfinitesetspec.html for the rlFiniteSetSpec.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

reinforcement learning toolbox - q table

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (5)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

reinforcement learning toolbox - q table

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Più risposte (5)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti