Is it possible to train LSTM Network without a Dataset?
1 visualizzazione (ultimi 30 giorni)
Mostra commenti meno recenti
Huzaifah Shamim
il 23 Lug 2020
Modificato: Huzaifah Shamim
il 27 Lug 2020
In the following paper, they utilize Reinforcement Learning and within it, also use an LSTM network. On page 3, they say that they use some kind of loss function that allows the training of the LSTM network without a dataset. I was wondering how that could be possible? If someone could explain, I would greatly appreciate it.
0 Commenti
Risposta accettata
Emmanouil Tzorakoleftherakis
il 27 Lug 2020
In the paper they mention "Although a readily available dataset is required to train an LSTM network, we devised an efficient way to tackle this challenge utilizing the experiences stored in the replay memory of the Q-network".
This is how training works with experience buffers in RL - you don't have data at the beginning, then you run simulations and store the data you collect in the experience buffer, which you are then using to train the policy. So the data is not "readily available" but you are still sing your experience buffer.
1 Commento
Più risposte (0)
Vedere anche
Categorie
Scopri di più su Sequence and Numeric Feature Data Workflows in Help Center e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!