Learning to summarize with human feedback 文章

OpenAI Blog2020-09-04BLOGen

摘要

We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据