Tools

The Truth About OpenAI’s o1: Is It Worth the Hype?

Explore the capabilities and limitations of OpenAI’s o1 model, its impact on the AI community, and its potential role in the future of AI in Asia.

Published

on

TL;DR:

  • OpenAI’s o1 model excels at complex reasoning but is more expensive and slower than GPT-4o.
  • Open AI’s o1 is best suited for big, complicated tasks rather than simpler questions.
  • The AI community has mixed feelings about o1’s capabilities and its high cost.

The Arrival of OpenAI’s o1: A Step Forward or Back?

OpenAI recently released its new o1 models, nicknamed “Strawberry,” which pause to “think” before answering. While there’s been much anticipation, the model has received mixed reviews. Compared to GPT-4o, o1 is better at reasoning and complex questions but is roughly four times more expensive. It also lacks the tools, multimodal capabilities, and speed that made GPT-4o impressive. OpenAI even admits that GPT-4o is still the best option for most prompts.

Ravid Shwartz Ziv, an NYU professor studying AI models, shares, “It’s impressive, but I think the improvement is not very significant. It’s better at certain problems, but you don’t have this across-the-board improvement.”

Thinking Through Big Ideas

OpenAI o1 stands out because it breaks down big problems into small steps, attempting to identify when it gets a step right or wrong. This “multi-step reasoning” isn’t new but hasn’t been practical until recently. Kian Katanforoosh, Workera CEO and Stanford adjunct lecturer, explains, “If you can train a reinforcement learning algorithm paired with some of the language model techniques that OpenAI has, you can technically create step-by-step thinking and allow the AI model to walk backwards from big ideas you’re trying to work through.”

However, o1 is pricey. It charges for “reasoning tokens,” which are the small steps the model breaks big problems into. This makes it crucial to use o1 wisely to avoid high costs.

OpenAI o1 in Action

To test o1, I asked ChatGPT o1 preview to help plan Thanksgiving dinner for 11 people. After 12 seconds of “thinking,” it provided a detailed response, breaking down its thinking at each step. It suggested prioritizing oven space and even proposed renting a portable oven. While it performed better than GPT-4o, it also suggested overwhelming solutions for simpler tasks.

Advertisement

For instance, when asked where to find cedar trees in America, o1 delivered an 800+ word response, outlining every variation of cedar tree. GPT-4o provided a concise, three-sentence answer.

Tempering Expectations

The hype around o1 started in November 2023, leading some to speculate that it was a form of AGI. However, OpenAI CEO Sam Altman clarified that o1 is not AGI and is still flawed and limited. The AI community is coming to terms with a less exciting launch than expected.

Rohan Pandey, a research engineer with AI startup ReWorkd, notes, “The hype sort of grew out of OpenAI’s control.” Mike Conover, Brightwave CEO, adds, “Everybody is waiting for a step function change for capabilities, and it is unclear that this represents that.”

The Value of OpenAI o1

The principles behind o1 date back years. Google used similar techniques in 2016 to create AlphaGo. Andy Harrison, former Googler and CEO of the venture firm S32, points out that this brings up an age-old debate in the AI world. One camp believes in automating workflows through an agentic process, while the other thinks generalized intelligence and reasoning would eliminate the need for workflows.

Katanforoosh sees o1 as a tool to question your thinking on big decisions. For example, it can help assess a data scientist’s skills in a 30-minute interview. However, the question remains whether this helpful tool is worth the hefty price tag.

Advertisement

The Future of AI in Asia

The release of o1 raises questions about the future of AI, particularly in Asia. As AI models become more capable, they also become more expensive. The trade-off between cost and capability will shape how AI is adopted and used in the region.

Comment and Share:

What are your thoughts on OpenAI’s o1 model? Have you tried it yet? Share your experiences and thoughts on the future of AI and AGI in the comments below. Don’t forget to subscribe for updates on AI and AGI developments.

You may also like:

Author

Trending

Exit mobile version