Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers 文章

ArXiv CS.AI2026-06-01NEWSen作者: Albus Yizhuo Li, Matthew Wicker

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers · 相关技术