RISE: Reliable Improvement in Self-Evolving Vision-Language Models 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

RISE: Reliable Improvement in Self-Evolving Vision-Language Models arXiv:2605.20914v2 Announce Type: replace Abstract: Vision-language models (VLMs) have achieved strong multimodal reasoning capabilities, but further improving them still relies heavily on large-scale human-constructed supervision for post-training. Such supervision is costly to obtain, especially for reasoning-intensive multimodal tasks where questions, answers, and feedback signals must be carefully designed. This motivates se