FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation 事件

Name: FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation
Start: 2026-06-02

BREAKTHROUGH2026-06-02影响: HIGH

FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation arXiv:2601.06199v3 Announce Type: replace-cross Abstract: Scaling Multimodal Large Language Models (MLLMs) to long-form speech is bottlenecked by the explosive growth of input tokens. Unlike images or videos, audio lacks overlapping information, making extreme 1-token compression highly susceptible to the loss of fine-grained acoustic cues. To overcome this, we propose FastSLM, a token-efficient architecture fe

人工智能

关系图谱

FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation 事件

FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation · 相关报道

相关报道