Federated Language Models Under Bandwidth Budgets: Distillation Rates and Conformal Coverage 文章

ArXiv CS.CL2026-05-28NEWSen作者: Prasanjit Dubey, Xiaoming Huo

摘要

arXiv:2605.09986v2 Announce Type: replace-cross Abstract: Training a language model on data scattered across bandwidth-limited nodes that cannot be centralized is a setting that arises in clinical networks, enterprise knowledge bases, and scientific consortia. We study the regime in which data must remain distributed across nodes, and ask what statistical guarantees are in principle achievable under explicit bandwidth budgets; we aim to characterize what is provably possible, not to demonstrate a deployment-ready system. Existing theory treats either training-time consistency or inference-time calibration in isolation, and no prior work makes bandwidth a first-class statistical parameter. We analyze two protocols, Federated Probe-Logit Distillation (FPLD) for training and Federated Conformal RAG (FC-RAG) for inference, as the analytical vehicles for our results.

Federated Language Models Under Bandwidth Budgets: Distillation Rates and Conformal Coverage 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (2)