Model Unlearning Objectives Vary for Distinct Language Functions 事件

Name: Model Unlearning Objectives Vary for Distinct Language Functions
Start: 2026-05-27

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

Model Unlearning Objectives Vary for Distinct Language Functions arXiv:2605.26454v1 Announce Type: new Abstract: Large language models (LLMs) learn undesirable properties during pretraining, including dangerous knowledge and toxic text generation. Just as post-training uses different objectives to shape different behaviors, we argue that unlearning methods should be designed for the language function at issue. To study this, we consider two mechanistically distinct unlearning goals, dangerous-k

大语言模型

关系图谱

Model Unlearning Objectives Vary for Distinct Language Functions 事件

相关公司查看全部 (9)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)