摘要
Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety specifications and how to reason over them.
相关事件
暂无数据
相关公司
暂无数据
相关人物
暂无数据
Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety specifications and how to reason over them.
暂无数据
暂无数据
暂无数据