Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet 论文

20212021 IEEE/CVF International Conference on Computer Vision (ICCV)引用 2229
Multimodal Machine Learning ApplicationsAdvanced Neural Network ApplicationsDomain Adaptation and Few-Shot Learning

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet · 相关技术