What Do People Actually Want From AI? Mapping Preference Plurality 事件
PRODUCT_LAUNCH2026-06-08影响: MEDIUM
What Do People Actually Want From AI? Mapping Preference Plurality arXiv:2606.06674v1 Announce Type: new Abstract: Large Language Models (LLMs) are often fine-tuned through Reinforcement Learning from Human Feedback (RLHF) to align with people's preferences and values. However, this method has known limitations: it aggregates conflicting preferences, often relies on unrepresentative samples, and uses only binary comparisons. Analysing 1,500 open-ended responses from the PRISM dataset across 75
相关产品查看全部 (10)
相关报道查看全部 (1)
What Do People Actually Want From AI? Mapping Preference Plurality
ArXiv CS.CL2026-06-08