BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents 事件

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents arXiv:2606.03829v1 Announce Type: new Abstract: Financial-research answers are decision-relevant only when another analyst can audit how they were produced: which source was chosen, which period and accounting definition were used, which assumptions were made, and how the calculation was performed. Existing finance benchmarks largely evaluate isolated subskills or final answers, leaving the auditable derivation itself