MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering 事件

PRODUCT_LAUNCH2024-10-10影响: MEDIUM

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.