MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering 事件
PRODUCT_LAUNCH2024-10-10影响: MEDIUM
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.
相关人物
暂无数据
相关产品查看全部 (4)
相关报道查看全部 (1)
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
OpenAI Blog2024-10-10