EUDAIMONIA: Evaluating Undesirable Dynamics in AI 文章

ArXiv CS.CL2026-06-01NEWSen作者: Jun Rui Huang, Wang Bill Zhu, Ziyi Liu, Nathanael Fast, Ravi Iyer, Robin Jia

详细信息

来源站点: ArXiv CS.CL
作者: Jun Rui Huang, Wang Bill Zhu, Ziyi Liu, Nathanael Fast, Ravi Iyer, Robin Jia
文章类型: NEWS
语言: en
发布日期: 2026-06-01

摘要

arXiv:2605.30654v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as conversational partners for companionship, emotional disclosure, and interpersonal advice, but the social dynamics of these interactions can create harms that are not captured by capability-oriented or traditional safety evaluations. We introduce the Social AI Design Code, a framework for evaluating whether LLMs align with user welfare in social interactions, including whether they encourage harmful intimacy, dependence, or prolonged engagement. To evaluate these risks in natural and diverse user-LLM interactions, we operationalize the code with EUDAIMONIA, a benchmark of 969 user inputs and 3,147 design-requirement violation checks built from WildChat through weak-to-strong filtration, multi-model relabeling, and controlled rewriting. Evaluating 22 recent LLMs, we find that even the strongest models, Claude-Opus-4.7 and GPT-5.5, violate 30.7% and 27.2% of checks, respectively.

EUDAIMONIA: Evaluating Undesirable Dynamics in AI 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (5)

相关技术查看全部 (1)