AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models arXiv:2602.18481v2 Announce Type: replace-cross Abstract: The rapid advancement of Large Language Models (LLMs) has led to a surge of financial benchmarks, evolving from static knowledge evaluation toward interactive trading simulations. However, existing frameworks for evaluating real-time trading largely overlook a critical failure mode: the severe behavioral instability of LLMs in sequential decision-