How Well Do Models Follow Their Constitutions? 事件

Name: How Well Do Models Follow Their Constitutions?
Start: 2026-05-26

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

How Well Do Models Follow Their Constitutions? arXiv:2605.24229v1 Announce Type: new Abstract: Frontier AI developers now train models against long written behavioral specifications, such as Anthropic's constitution (Anthropic, 2025a) and OpenAI's Model Spec (OpenAI, 2025a), integrated into post-training via methods like character training (Anthropic, 2024) and deliberative alignment (Guan et al., 2024). These documents serve a governance function, but it is unclear how well models actually fol

人工智能

关系图谱

How Well Do Models Follow Their Constitutions? 事件

相关公司查看全部 (10)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)