Deliberative alignment: reasoning enables safer language models 文章

OpenAI Blog2024-12-20BLOGen

Deliberative alignment: reasoning enables safer language models · 相关产品