MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering 文章

OpenAI Blog2024-10-10BLOGen

摘要

We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据