Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences arXiv:2605.30873v1 Announce Type: cross Abstract: Federated Learning (FL) offers a privacy-preserving pathway for aligning Large Language Models (LLMs); however, existing frameworks typically enforce a monolithic reward model, inevitably averaging out inherently conflicting user preferences (e.g., helpfulness vs. harmlessness). While Variational Preference Learning (VPL) offers a pathway to pe

Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences · 相关产品