OpenAI today announced an improved version of its most capable artificial intelligence model to dateâone that takes even more time to deliberate over questionsâjust a day after Google announced its first model of this type.
OpenAIâs new model, called o3, replaces o1, which the company introduced in September. Like o1, the new model spends time ruminating over a problem in order to deliver better answers to questions that require step-by-step logical reasoning. (OpenAI chose to skip the âo2â moniker because it’s already the name of a mobile carrier in the UK.)
âWe view this as the beginning of the next phase of AI,â said OpenAI CEO Sam Altman on a livestream Friday. âWhere you can use these models to do increasingly complex tasks that require a lot of reasoning.â
The o3 model scores much higher on several measures than its predecessor, OpenAI says, including ones that measure complex coding-related skills and advanced math and science competency. It is three times better than o1 at answering questions posed by ARC-AGI, a benchmark designed to test an AI modelsâ ability to reason over extremely difficult mathematical and logic problems theyâre encountering for the first time.
Google is pursuing a similar line of research. Noam Shazeer, a Google researcher, yesterday revealed in a post on X that the company has developed its own reasoning model, called Gemini 2.0 Flash Thinking. Googleâs CEO, Sundar Pichai, called it âour most thoughtful model yetâ in his own post. Googleâs new model achieved a high score on SWE-Bench, a test that measures a modelsâ agentic abilities.
However, OpenAIâs new o3 model is 20 percent better than o1. âo3 blew it out of the water,â says Ofir Press, a post-doctoral researcher at Princeton University who helped develop SWE-Bench. âVery surprising increase, not sure how they did it.â
The two dueling models show competition between OpenAI and Google to be fiercer than ever. It is crucial for OpenAI to demonstrate that it can keep making advances as it seeks to attract more investment and build a profitable business. Google is meanwhile desperate to show that it remains at the forefront of AI research.
The new models also show how AI companies are increasingly looking beyond simply scaling up AI models in order to wring greater intelligence out of them.
