GRACE-DS: a Guarded Reward-guided Agent Correction Environment in Data Science 文章

ArXiv CS.CL2026-06-16NEWSen作者: Aleksandr Tsymbalov, Danis Zaripov, Artem Epifanov, Anastasya Palienko

详细信息

来源站点: ArXiv CS.CL
作者: Aleksandr Tsymbalov, Danis Zaripov, Artem Epifanov, Anastasya Palienko
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2606.16000v1 Announce Type: new Abstract: We introduce GRACE-DS, a Guarded Reward-guided Agent Correction Environment in Data Science for pre-deployment evaluation of LLM-powered AutoML agents. GRACE-DS is a set of evaluation metrics in an isolated environment that can be applied to tabular ML tasks specific to a particular organization. It exposes agents to realistic workflow stages, from planning and data inspection through feature engineering, model development, validation, and code repair to final submission, while hidden executable validators measure not only final predictive performance but also leakage avoidance, reproducibility, protocol validity, correction behavior, and reward alignment. The strongest structured regime, flexible iterative interaction (our approach), achieves higher end-to-end normalized hidden-test quality than single-shot generation, unstructured interaction, and restart-based baselines, while also improving protocol-valid completion.

GRACE-DS: a Guarded Reward-guided Agent Correction Environment in Data Science 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (3)

相关技术