Constitutional On-Policy Safe Distillation 事件

Name: Constitutional On-Policy Safe Distillation
Start: 2026-06-03

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

Constitutional On-Policy Safe Distillation arXiv:2606.03089v1 Announce Type: cross Abstract: On-policy self-distillation (OPSD) has emerged as an efficient post-training paradigm by using a teacher conditioned on privileged information to provide dense token-level supervision. Prior work has shown that OPSD can collapse in verifiable reasoning tasks, but safety alignment differs in that it is guided by high-level constitutions rather than explicit target answers, making it a natural setting to

人工智能

关系图谱

Constitutional On-Policy Safe Distillation 事件

相关公司查看全部 (10)

相关人物查看全部 (2)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)