LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs 文章

ArXiv CS.AI2026-05-26NEWSen作者: Zenghui Zhou, Man Li, Xiaoke Fang, Xinyi Zhou, Weibin Li, Zheng Zheng

LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs · 相关事件