transformers 产品
来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2018-10-29
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
160573
Stars
33196
Forks
3
技术栈
0
替代方案
50
相关事件
transformers · 相关事件
相关事件
WAV: Multi-Resolution Block Residual Routing for Deep Decoder-Only Transformers
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Inside the Visual Mind: Neuroscience-Motivated Concept Circuits for Interpreting and Steering Vision Transformers
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
When is 3D Worth It? A Resource-Performance Frontier for CNNs and Transformers in Lung CT
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Planning-aligned Token Compression for Long-Context Autonomous Driving
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Discovering Interpretable Algorithms by Decompiling Transformers to RASP
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
2026-06-08REGULATION影响: MEDIUM
TrioPose: Native Triple-Stream Diffusion Transformers for Pose-Guided Text-to-Image Generation
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Does Appearance Help? A Systematic Study of Image-Based Re-Identification in Online 3D Multi-Pedestrian Tracking
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
The Topological Trouble With Transformers
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
When Attention Beats Fourier: Multi-Scale Transformers for PDE Solving on Irregular Domains
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Where does Absolute Position come from in decoder-only Transformers?
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
What Makes Two Language Models Think Alike?
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
SpanNorm: Reconciling Training Stability and Performance in Deep Transformers
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Unpaired RGB-Thermal Gaussian-Splatting Using Visual Geometric Transformers
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
BMCR: Adaptive Backbone Module Composition via Reinforcement Learning for Remote Sensing Object Detection
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
KV-Control: Parameter-Efficient K/V Injection for Trajectory-Controlled Text-to-Motion
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
FATE: Focal-modulated Attention Encoder for Multivariate Time-series Forecasting
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Do Transformers Need Three Projections? Systematic Study of QKV Variants
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Unlocking Feature Learning in Gated Delta Networks at Scale
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Adaptive Patching Is Harder Than It Looks For Time-Series Forecasting
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Generalizable Multi-Task Learning for Wireless Networks Using Prompt Decision Transformers
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Low-Rank Decay for Grokking in Scale-Invariant Transformers: A Spectral-Geometric View
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
ChessMimic: Per-Rating Transformer Models for Human Move, Clock, and Outcome Prediction in Online Blitz Chess
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
An Empirical Audit of Input Encoders for Multi-Channel Signal Transformers
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Success Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Platonic Transformers: A Solid Choice For Equivariance
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
What Structural Inductive Bias Helps Transformers Reason Over Knowledge Graphs? A Study with Tabula RASA
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Depth-Attention: Cross-Layer Value Mixing for Language Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
DSA: Dynamic Step Allocation for Fast Autoregressive Video Generation
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
DSA: Dynamic Step Allocation for Fast Autoregressive Video Generation
2026-06-04BREAKTHROUGH影响: HIGH
An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers
2026-06-04OPEN_SOURCE影响: MEDIUM
Towards Evaluating the Robustness of Visual State Space Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Vision Transformers and Convolutional Neural Networks for Land Use Scene Classification
2026-06-04BREAKTHROUGH影响: HIGH
Vision Transformers and Convolutional Neural Networks for Land Use Scene Classification
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Cosmos 3: Omnimodal World Models for Physical AI
2026-06-03OPEN_SOURCE影响: MEDIUM
Cosmos 3: Omnimodal World Models for Physical AI
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Cosmos 3: Omnimodal World Models for Physical AI
2026-06-03BREAKTHROUGH影响: HIGH