OpenAI has recently unveiled its latest AI model series called o1, which represents a significant advancement in the field of artificial intelligence by focusing on complex reasoning capabilities.
Unlike its predecessors, the o1 series is designed to spend more time “thinking” before responding, much like a human would when faced with difficult problems. This shift marks a bold move towards developing AI that can engage in logical problem-solving, a step beyond merely predicting the next word in a sentence.

A leap in reasoning
The key feature of OpenAI’s o1 models is their ability to perform chain-of-thought reasoning. OpenAI explains that these models are trained to break down complex problems into simpler steps, explore different strategies, and learn from their mistakes. This process enables them to handle more complicated tasks where backtracking or deeper thinking is necessary. For instance, in benchmark tests, the o1 model demonstrated impressive problem-solving abilities in areas such as physics, chemistry, biology, math, and coding.
To illustrate, in a qualifying exam for the International Mathematics Olympiad, OpenAI’s previous flagship model, GPT-4o, solved only 13% of the problems, whereas the o1 model solved a remarkable 83%. Similarly, in coding challenges, o1 scored in the 89th percentile in Codeforces competitions. These results showcase how o1 excels in handling intricate and multi-step tasks.

Limitations and trade-offs
Despite its advanced reasoning capabilities, the o1 model comes with certain trade-offs. Unlike GPT-4o, o1 lacks key functionalities like web browsing and file or image uploads. Additionally, it’s significantly slower than GPT-4o, as the model takes extra time to process prompts in order to think through its responses. This slower response time is by design, as it allows the model to refine its thought process, but it can be a disadvantage for users who need quick answers.
Another challenge is that the o1 model, like many AI models, still struggles with hallucinations, where it fabricates information. OpenAI’s Chief Research Officer, Bob McGrew, acknowledged this limitation, stating that the model hasn’t fully solved the problem of generating inaccurate information. This indicates that while the model’s reasoning is more advanced, there’s still progress to be made in ensuring the accuracy of its outputs.
A focus on safety
One of the most noteworthy aspects of the o1 series is its emphasis on safety. OpenAI has integrated a new safety training approach that leverages the model’s reasoning capabilities to adhere to safety guidelines. During testing, the o1-preview model outperformed GPT-4o in resisting attempts to bypass safety protocols, scoring 84 out of 100 on one of the most challenging jailbreaking tests, compared to GPT-4o’s score of 22.
In collaboration with AI Safety Institutes in the U.S. and the U.K., OpenAI has granted early access to a research version of the o1 model to ensure rigorous safety evaluations. These partnerships aim to develop processes for research, evaluation, and testing, ultimately strengthening AI safety across future models.
Who benefits from o1?
The o1 model is particularly well-suited for users in fields such as science, coding, and mathematics, where reasoning through multi-step problems is critical. For example, healthcare researchers can use the model to annotate cell sequencing data, while physicists can employ it to generate complex mathematical formulas for quantum optics. Developers, too, can benefit from o1’s ability to execute multi-step workflows and debug complex code.
To cater specifically to developers, OpenAI has also released o1-mini, a smaller, more cost-effective version of the o1 model. While o1-mini lacks broad world knowledge, it excels at generating and debugging complex code. Its reasoning abilities make it a powerful tool for coding applications, offering 80% lower costs than the o1-preview model.
What’s next for OpenAI?
OpenAI is committed to continuously improving the o1 series, with plans to introduce additional features like web browsing, file uploading, and image inputs. These updates are expected to make the o1 models more versatile and user-friendly, bridging the gap between advanced reasoning and practical functionality.
In addition to further developments in the o1 series, OpenAI plans to continue its work on the GPT series, ensuring that both branches of AI models evolve to meet a broad range of needs. As the o1 models continue to evolve, users can expect regular updates and improvements that will enhance their problem-solving capabilities.