Rethinking the Role of Temperature in Large Language Model Distillation 文章

ArXiv CS.AI2026-06-02NEWSen作者: Hoang-Chau Luong, Lingwei Chen

Rethinking the Role of Temperature in Large Language Model Distillation · 相关技术