When AI Says It Feels 文章

ArXiv CS.CL2026-06-05NEWSen作者: Shin-nosuke Ishikawa, Seiya Ikeda, Hirotsugu Ohba

摘要

arXiv:2606.05734v1 Announce Type: cross Abstract: Large language models (LLMs) are generally constrained from expressing feelings through human-preference alignment in post-training processes. This policy is designed using a top-down approach and may conflict with the goal of training models to exhibit human-like intelligence using human-generated texts. Here, we performed an experiment called Human-like Model eXpressions of Feeling (HMX-feel), in which LLMs were encouraged to express feelings, intentions, and self-awareness through self-rewarded reinforcement learning. We successfully enhanced these capabilities using a rubric-based self-rewarding training scheme with Group Relative Policy Optimization (GRPO). By comparing the trained models with contrastively trained models, we investigated the effects of this approach on performance across various tasks.

相关事件查看全部 (1)

When AI Says It Feels
2026-06-05PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据