Selection Without Signal, Recovery Through Expression: A Measurement Study of Post-Hoc Falsification Operators for Frozen Small Code Models 文章

ArXiv CS.CL2026-06-16NEWSen作者: Mehmet Iscan

详细信息

来源站点
ArXiv CS.CL
作者
Mehmet Iscan
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2606.16999v1 Announce Type: cross Abstract: Frozen small code models (=45. Two operators help on a different axis, outside the semantic output space. An expression-layer recovery (M1), the only accuracy gain here, recovers correct programs the standard extractor discards (robust extraction and public-test signature alignment); it does no harm (b10=0), is leakage-free, and lifts DeepSeek-Coder-1.3B by +12 tasks on HumanEval+ (p=2.4e-4). An adaptive consensus early-stop (ACE) is a calibrated compute-saving control (~19% saving, zero harm). M1 and the selection negative replicate on HumanEval+ and MBPP+ across three model cells. The lesson: fix the harness and measure coverage before blaming semantic post-hoc reasoning.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据