Evaluating chain-of-thought monitorability 文章

OpenAI Blog2025-12-18BLOGen

摘要

OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findings show that monitoring a model’s internal reasoning is far more effective than monitoring outputs alone, offering a promising path toward scalable control as AI systems grow more capable.

相关事件查看全部 (1)

Evaluating chain-of-thought monitorability
2025-12-18PRODUCT_LAUNCH影响: MEDIUM

相关人物

暂无数据

相关产品

暂无数据

相关技术

暂无数据