Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

Question

John Smith il 13 Mar 2023

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension

Commentato: Artem Lensky il 19 Ago 2023

Hello,

While implementing a ViT transformer in Matlab, I found at that the concatLayer does not concatenate over the T dimension. This is needed to concatenate the class token with patch tokens, since the natural representation is CBT with C corresponding to features, B to batch and T to token within a batch (this is also the canonical representation in the attention function).

It's possible to work around this by hacking to e.g. SCB, but then other problems pop up which also need to be hacked around.

Thx

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Ben il 14 Mar 2023

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension#answer_1192820

You can create a layer that concatenates on the T dimension with functionLayer

sequenceCatLayer = functionLayer(@(x,y) cat(3,x,y));

This will work in dlnetwork to concatenate two CBT dlarray-s.

Since you're concatenating the class token, it might also be worth considering creating a custom layer that has the class token embedding as a Learnable property, and performs the concatenation in the predict method.

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Catalytic il 23 Mar 2023

Modificato: Catalytic il 23 Mar 2023

@John Smith - Since Ben's answer yielded a solution for you, you should hit the Accept this Answer button, and likewise with other answers you might not have accepted.

Artem Lensky il 19 Ago 2023

Are there any plans to make concatenationLayer support concatetnation along the T dimension?

Accedi per commentare.

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente