Escaping the Mode Lottery: Multi-Response Training Improves Language Model Generalization 文章

ArXiv CS.CL2026-06-02NEWSen作者: Hasan Amin, Kian Ahrabian, Ming Yin, Rajiv Khanna

Escaping the Mode Lottery: Multi-Response Training Improves Language Model Generalization · 相关事件