Estimating worst case frontier risks of open weight LLMs 事件
PRODUCT_LAUNCH2025-08-05影响: MEDIUM
Estimating worst case frontier risks of open weight LLMs In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and cybersecurity.
相关产品查看全部 (6)
相关报道查看全部 (1)
Estimating worst case frontier risks of open weight LLMs
OpenAI Blog2025-08-05