JobBench: Aligning Agent Work With Human Will 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
JobBench: Aligning Agent Work With Human Will arXiv:2605.26329v1 Announce Type: new Abstract: Current benchmarks for occupational AI agents are scoped primarily by economic values, telling a replacement story. We introduce JobBench, which evaluates AI agents on the workflows that experts identify as high-priority for delegation, empowering humans based on their needs instead of replacing them with GDP value. JobBench covers 130 agentic tasks across 35 occupations. Each task is packaged as a wor