Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots 文章

ArXiv CS.AI2026-06-04NEWSen作者: Guangcheng Zhu, Shenzhi Yang, Haobo Wang, Xing Zheng, Yingfan MA, Xuening Feng, Zhongqi Chen, Bowen Song, Weiqiang Wang, Gang Chen

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots · 相关技术

相关技术