FASE: Fast Adaptive Semantic Entropy for Code Quality 文章

ArXiv CS.AI2026-06-09NEWSen作者: Shizhe Lin, Ladan Tahvildari

详细信息

来源站点
ArXiv CS.AI
作者
Shizhe Lin, Ladan Tahvildari
文章类型
NEWS
语言
en
发布日期
2026-06-09

摘要

arXiv:2606.09800v1 Announce Type: cross Abstract: Multi-agent code generation offers a promising paradigm for autonomous software development by simulating the human software engineering lifecycle. However, system reliability remains hindered by LLM hallucinations and error propagation across interacting agents. While semantic entropy provides a principled way to quantify uncertainty without ground-truth answers, current methods often rely on costly LLM-driven equivalence checks. In this work, we introduce Fast Adaptive Semantic Entropy (FASE), a novel metric that approximates functional correctness based on the minimum spanning tree of structural and semantic dissimilarity graphs. Evaluations on HumanEval and BigCodeBench demonstrate that FASE outperforms state-of-the-art semantic entropy by LLM entailment, achieving a 25% average improvement in Spearman correlation and a 19% increase in ROCAUC score against Pass@1 from ground-truth test cases when using the Qwen3-Embedding-8B model.