Restoring the Sweet Spot: Pass-Rate Weighted Self-Distillation for LLM Reasoning 文章

ArXiv CS.AI2026-05-28NEWSen作者: Zehao Liu, Yuanpu Cao, Jinghui Chen, Vasant G. Honavar

Restoring the Sweet Spot: Pass-Rate Weighted Self-Distillation for LLM Reasoning · 相关人物

暂无数据