Introducing the SWE-Lancer benchmark 文章

OpenAI Blog2025-02-18BLOGen

摘要

Can frontier LLMs earn $1 million from real-world freelance software engineering?