SDR: Set-Distance Rewards for Radiology Report Generation 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

SDR: Set-Distance Rewards for Radiology Report Generation arXiv:2606.00440v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards has rapidly advanced reasoning in vision--language models. However, for chest X-ray report generation, the standard rewards (i.e. exact-match accuracy and step-level processes) are incompatible because the reports consist of unordered and orthogonal findings, rather than a causal reasoning chain. We address this gap with a set-based view: each