Deliberative alignment: reasoning enables safer language models 文章

OpenAI Blog2024-12-20BLOGen

摘要

Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety specifications and how to reason over them.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据