ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale 文章

ArXiv CS.AI2026-05-26NEWSen作者: Noel Thomas

ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale · 相关技术