FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games 文章

ArXiv CS.AI2026-06-04NEWSen作者: Leonardo Bertolazzi, Katya Tentori, Raffaella Bernardi

FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games · 相关事件

相关事件