Evaluating AI’s ability to perform scientific research tasks 文章

OpenAI Blog2025-12-16BLOGen

摘要

OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific research.