Closing the Auto-Research Loop: An AI Co-Scientist for Production Search Ranking 文章

ArXiv CS.AI2026-06-16NEWSen作者: Liwei Wu, Cho-Jui Hsieh

详细信息

来源站点
ArXiv CS.AI
作者
Liwei Wu, Cho-Jui Hsieh
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2603.22376v2 Announce Type: replace-cross Abstract: We present an AI Co-Scientist framework that closes the research loop for the production search-ranking system of a large online travel platform -- pairing LLM agents with direct cloud-compute access so that idea generation, code implementation, GPU experimentation, and result analysis iterate end-to-end with a human scientist in the loop. The framework uses a hybrid agent architecture: single-LLM agents handle routine work, while multi-LLM consensus (GPT-5.2, Gemini Pro 3, Claude Opus 4.5) is invoked for higher-stakes decisions. On the production ranking task, a human-designed transformer baseline (V2) yielded $+0.118\%$ over a pre-transformer baseline (V1); the AI Co-Scientist's automated loop on top of V2 contributed an additional $+0.083\%$, for a combined $+0.201\%$ offline gain delivered in roughly one extra week of wall-clock time (single-run numbers; statistical limits discussed in the paper).