EconCausal: A Context-Aware Economic Reasoning Benchmark for Large Language Models 文章

ArXiv CS.CL2026-05-27NEWSen作者: Donggyu Lee, Hyeok Yun, Meeyoung Cha, Sungwon Park, Sangyoon Park, Jihee Kim

详细信息

来源站点: ArXiv CS.CL
作者: Donggyu Lee, Hyeok Yun, Meeyoung Cha, Sungwon Park, Sangyoon Park, Jihee Kim
文章类型: NEWS
语言: en
发布日期: 2026-05-27

摘要

arXiv:2510.07231v4 Announce Type: replace Abstract: Socio-economic causal effects depend heavily on their institutional and environmental contexts. The same intervention can produce different, even opposite, effects across regulatory regimes, market conditions, time periods, or populations. This poses a challenge for large language models (LLMs) in decision-support roles: can they infer the direction of a causal effect under a specified context, and revise that judgment when the context changes? To address this, we introduce EconCausal, a large-scale benchmark of 10,490 context-annotated causal triplets extracted from 2,595 high-quality empirical studies in top-tier economics and finance journals, constructed through a rigorous four-stage pipeline with multi-run consensus, context refinement, and multi-critic filtering. Across models, LLMs often fail to condition their predictions on context. While top models reach 88% accuracy in fixed, explicit contexts, accuracy falls by 32.

EconCausal: A Context-Aware Economic Reasoning Benchmark for Large Language Models 文章

详细信息

摘要

相关事件

相关公司查看全部 (3)

相关人物

相关产品查看全部 (6)

相关技术查看全部 (23)