Escape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimization 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Escape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimization arXiv:2605.27741v1 Announce Type: new Abstract: Audio and omni-modal large language models exhibit impressive cross-modal reasoning capabilities. However, applying standard reinforcement learning post-training algorithms to these models exposes a critical structural vulnerability: methods like GRPO apply uniform policy gradients across all tokens, ignoring their unequal d

Escape the Language Prior: Mitigating Late-Stage Modality Collapse in Audio Reasoning via Modality-Aware Policy Optimization · 相关公司

A
arXivNONPROFIT
F
FrameworkCOMPANY
E
EARNNONPROFIT
A
ACTNONPROFIT
P
U
UniforNONPROFIT
R
RatioRESEARCH_INSTITUTE
C
chainCOMPANY
V
VIACOMPANY