Herculean: An Agentic Benchmark for Financial Intelligence 事件

SHUTDOWN2026-06-02影响: LOW

Herculean: An Agentic Benchmark for Financial Intelligence arXiv:2605.14355v3 Announce Type: replace-cross Abstract: As AI agents improve, the central question is no longer whether they can solve isolated well-defined financial tasks, but whether they can reliably carry out financial professional work. Existing financial benchmarks offer only a partial view of this ability, as they primarily evaluate static competencies such as question answering, retrieval, summarization, and classification. W