Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning arXiv:2605.09270v2 Announce Type: replace-cross Abstract: Supervised Fine-Tuning (SFT) is widely used for task-specific adaptation, yet recent work shows it systematically undermines reasoning generalization. We argue the root cause is not memorization itself, but its target: vanilla SFT drives models to exploit and memorize spurious surface correlations in problem-solution pairs, leaving them brittle to

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning · 相关技术