PAFO: Pareto Fairness Optimization for Personalized Reward Modeling 事件

Name: PAFO: Pareto Fairness Optimization for Personalized Reward Modeling
Start: 2026-06-09

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

PAFO: Pareto Fairness Optimization for Personalized Reward Modeling arXiv:2606.07988v1 Announce Type: new Abstract: Large language models (LLMs) increasingly rely on reward models to align their outputs with diverse user preferences. While personalized reward models aim to capture such heterogeneity, they are often trained on imbalanced user preference data and may therefore favor users whose preferences are more common in the training population. In this paper, we identify this failure mode as

人工智能

关系图谱

PAFO: Pareto Fairness Optimization for Personalized Reward Modeling 事件

相关公司查看全部 (10)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)