TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL 文章

ArXiv CS.AI2026-06-02NEWSen作者: Tianze Yang, Yucheng Shi, Ruitong Sun, Jingyuan Huang, Ninghao Liu, Jin Sun

TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL · 相关事件

相关事件