OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents arXiv:2605.28158v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly used to assist with operations research (OR) modeling, yet existing OR-oriented benchmarks often reduce evaluation to one-shot translation from a self-contained problem statement into a mathematical formulation or solver program. Such settings abstract away two characteristics of real industrial OR workflows: persist
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents · 相关人物
暂无数据