Measuring the performance of our models on real-world tasks 事件
PRODUCT_LAUNCH2025-09-25影响: MEDIUM
Measuring the performance of our models on real-world tasks OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.
Measuring the performance of our models on real-world tasks · 相关报道
相关报道
Measuring the performance of our models on real-world tasks
OpenAI Blog2025-09-25