Can Generalist Agents Automate Data Curation? 事件

PRODUCT_LAUNCH2026-06-04影响: MEDIUM

Can Generalist Agents Automate Data Curation? arXiv:2606.04261v1 Announce Type: cross Abstract: Curating training data is among the most consequential yet labor-intensive parts of modern AI development: practitioners iteratively propose, implement, evaluate, and revise data policies against noisy benchmark feedback. We ask whether generalist coding agents can automate this data-curation loop. We introduce *Curation-Bench*, an agent-centric benchmark that fixes the model, training recipe, and ev