Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation 文章

ArXiv CS.AI2026-06-06NEWSen作者: Yongjie Wang, Xinyue Zhang, Kunhong Yao, Zhiwei Zeng, Kaisong Song, Jun Lin, Zhiqi Shen

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation · 相关技术