Logit Distillation on Manifolds: Mapping by Learning 文章

ArXiv CS.AI2026-06-02NEWSen作者: Yiru Yang, Junling Wang, Nishant Kumar Singh, Luohong Wu, Haoran Yan

摘要

arXiv:2606.00771v1 Announce Type: cross Abstract: A simple way to improve the performance of almost any machine learning model is not to train a single but several models with diverse algorithms which will make slightly distinct kinds of predictions and errors on the same data, and thus improve the average predictions and robustness. However, making predictions using a whole ensemble of models is cumbersome and computationally too expensive to allow deployment to a large number of users, especially if the models are large neural nets. In response to this, we introduce a layer and point wise projection mapping, which maps student and teacher representations into an aligned high-dimensional embedding space during training process.

相关事件查看全部 (1)

Logit Distillation on Manifolds: Mapping by Learning
2026-06-02PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据

相关技术

暂无数据