Purpose and implementation of 're-flatten' Layer in Vision Transformer

3 visualizzazioni (ultimi 30 giorni)
In MATLAB 2023b's Computer Vision Toolbox, there is a pre-trained vision Transformer model that includes patch embedding and re-flatten layers. What is the purpose of the re-flatten layer? Additionally, it seems that there is no "re-flatten" function in the Deep Learning Toolbox. Do I need to write this function myself?
  2 Commenti
xingxingcui
xingxingcui il 4 Gen 2024
I only see patch embedding layer,flatten layer,can't find re-flaten layers in MATLAB 2023b,can you post the re-flaten layer link?
HK ZHOU
HK ZHOU il 5 Gen 2024
Thanks for your comment.
After executing the following codes, in the NetworkAnalyzer, the 're-flatten' function is visible, but cannot find the source code.
net = visionTransformer;
analyzeNetwork(net)

Accedi per commentare.

Risposte (1)

Tijue
Tijue il 12 Feb 2024
Hello Zhou,
You can open the "visionTransformer" function and find the "re-flatten" layer at line 67.

Categorie

Scopri di più su Deep Learning Toolbox in Help Center e File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by