The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official impleme...
SwinIR: Image Restoration Using Swin Transformer (official repository)
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer...
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal a...
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Runner-Up...
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
[TNSRE 23] EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activatio...
[ECCV 2022] Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Official implementation of PVT series
PaddleSlim is an open-source library for deep model compression and architecture search.
Graph Transformer Architecture. Source code for "A Generalization of Transformer Networks to Graphs", DLG-AAAI'21.
list of efficient attention modules
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting (NeurIPS 2019)
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Predicti...
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
VRT: A Video Restoration Transformer (official repository)
Multimodal-GPT
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
第 1-20 条,共 26938 条