What Do People Actually Want From AI? Mapping Preference Plurality 事件

PRODUCT_LAUNCH2026-06-08影响: MEDIUM

What Do People Actually Want From AI? Mapping Preference Plurality arXiv:2606.06674v1 Announce Type: new Abstract: Large Language Models (LLMs) are often fine-tuned through Reinforcement Learning from Human Feedback (RLHF) to align with people's preferences and values. However, this method has known limitations: it aggregates conflicting preferences, often relies on unrepresentative samples, and uses only binary comparisons. Analysing 1,500 open-ended responses from the PRISM dataset across 75

What Do People Actually Want From AI? Mapping Preference Plurality · 相关技术