I would like to training a network using both CNN-LSTM Network ? Is this possible in Matlab

Question

Manoj Devaraju il 21 Mag 2022

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1724420-i-would-like-to-training-a-network-using-both-cnn-lstm-network-is-this-possible-in-matlab

Commentato: Ben il 23 Giu 2023

I have a image data and I use imageInputLayer as a input for the 2D Conv layer then I would like to use LSTM network. Is this possible to use in Matlab. Like the architecture below picture (found in some research paper image on Google). I have tried using this but unfortunatly not sucessfull. Can you please give some ideas how can we implement this.

layers = [ ...
    %CNN
    imageInputLayer([129 35 1])
    sequenceInputLayer(inputSize,'Name','input')
    
    convolution2dLayer(3,32,'Padding','same')
    batchNormalizationLayer
    reluLayer
    maxPooling2dLayer(2,'Stride',2)
    convolution2dLayer(3,32,'Padding','same')
    batchNormalizationLayer
    reluLayer
    maxPooling2dLayer(2,'Stride',2)
    convolution2dLayer(3,64,'Padding','same')
    batchNormalizationLayer
    reluLayer
    maxPooling2dLayer(2,'Stride',2)
    
    flattenLayer('Name','flatten')    
    %LSTM
    lstmLayer(numHiddenUnits,'OutputMode','last','Name','lstm')    
    fullyConnectedLayer(numClasses, 'Name','fc')
    softmaxLayer('Name','softmax')
    classificationLayer('Name','classification')];

2 Commenti
Mostra NessunoNascondi Nessuno

Ullah Nadeem il 23 Giu 2023

The reply is too late though but it may help for next search.

My problem has been resolved by putting sequenceFoldingLayer right after imageInputLayer and sequenceUnfoldingLayer before flatten layer.

I think, it is because LSTM layers need sequential information to keep long-range dependencies.

Cheers~

Ben il 23 Giu 2023

@Ullah Nadeem - thanks for replying, you're right that you need sequenceFoldingLayer and sequenceUnfoldingLayer when using trainNetwork for CNN-LSTM networks. We have this example that shows training an LSTM on CNN embeddings of video frames, the final network combines the CNN and LSTM for prediction using the sequence folding layers. We also have this example demonstrating training a CNN-LSTM on audio data.

Note that you need a sequenceInputLayer to input sequences of images into the CNN-LSTM network.

Also note that you do not need sequenceFoldingLayer or sequenceUnfoldingLayer when using convolution2dLayer in dlnetwork with sequences of images - by default the convolution2dLayer in dlnetwork will "distribute" over the sequence dimension on sequences of images. To train the dlnetwork you will need to use a custom training loop.

Accedi per commentare.

Accedi per rispondere a questa domanda.

I would like to training a network using both CNN-LSTM Network ? Is this possible in Matlab

2 Commenti
Mostra NessunoNascondi Nessuno

Risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

I would like to training a network using both CNN-LSTM Network ? Is this possible in Matlab

2 Commenti Mostra NessunoNascondi Nessuno

Risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

2 Commenti
Mostra NessunoNascondi Nessuno