FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games 文章

ArXiv CS.AI2026-06-04NEWSen作者: Leonardo Bertolazzi, Katya Tentori, Raffaella Bernardi

查看原文 →

FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games · 相关事件

相关事件

FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games

2026-06-04PRODUCT_LAUNCH影响: MEDIUM