CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions 事件

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions arXiv:2602.20213v2 Announce Type: replace-cross Abstract: The evaluation of Large Language Models (LLMs) for code generation relies heavily on the quality and robustness of test cases. However, existing benchmarks often lack coverage for subtle corner cases, allowing incorrect solutions to pass. To bridge this gap, we propose CodeHacker, an automated agent framework dedicated to generat

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions · 相关报道