FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks 文章

ArXiv CS.AI2026-05-29NEWSen作者: Nishal Thomas, Noel Thomas

FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks · 相关技术