Bounded Behavioral Indistinguishability for Black-Box LLM Distillation 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation arXiv:2605.30448v1 Announce Type: cross Abstract: Black-box LLM distillation is usually evaluated as an output-matching problem: a student is considered successful when its responses are semantically similar to, or task-consistent with, those of a teacher. However, output similarity does not imply that the student is behaviorally indistinguishable from the model it imitates. We introduce bounded behavioral indistinguishabili