TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories arXiv:2605.31308v1 Announce Type: new Abstract: Agent benchmarks increasingly record rich interaction trajectories, yet evaluation often reduces each rollout to a pass rate or reward score. We introduce TraceGraph, a graph-based framework that turns released multi-model agent trajectories into shared decision landscapes. For each task, TraceGraph builds a graph over observable action-observation states from p