Self-Consistency via Marginal Sharpening 文章

ArXiv CS.CL2026-05-28NEWSen作者: Aleksei Arzhantsev, Otmane Sakhi, Nicolas Chopin

摘要

arXiv:2605.28142v1 Announce Type: cross Abstract: Inference-time sampling can elicit strong reasoning abilities from language models without additional training. Existing power-sampling methods do so by sharpening the distribution over full generated outputs, favoring completions that are individually likely under the model. We argue that this is the wrong object to target for reasoning: a completion entangles a reasoning trace with a final answer, whereas what matters is whether an answer is supported by many plausible reasoning paths. We therefore shift the target from the full-output distribution to the sharpened answer marginal, making self-consistency an inference-time objective rather than a post-hoc voting criterion.

相关事件查看全部 (1)

Self-Consistency via Marginal Sharpening
2026-05-28PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据

相关技术

暂无数据