Flag

We stand with Ukraine and our team members from Ukraine. Here are ways you can help

Get exclusive access to thought-provoking articles, bonus podcast content, and cutting-edge whitepapers. Become a member of the UX Magazine community today!

Home ›› Artificial Intelligence ›› Why OpenAI’s “Strawberry” Is a Game Changer

Why OpenAI’s “Strawberry” Is a Game Changer

by Andrew Best
3 min read
Share this post on
Tweet
Share
Post
Share
Email
Print

Save

What if the next leap in AI could redefine the very way we understand machine intelligence? Read the article to uncover how OpenAI’s “Strawberry” is set to revolutionize large language models with cutting-edge advanced reasoning and meticulous planning. From tackling simple errors to enhancing overall accuracy, discover how this groundbreaking model aims to overcome the limitations of current LLMs and inch us closer to achieving AGI.

This is a big step closer to AGI.

Sam Altman recently tweeted this picture of a strawberry.

Sam Altman’s picture of a strawberry

Pretty much everyone (including me) believes this is a cryptic tweet about the upcoming release of OpenAI’s “Strawberry”.

What is Strawberry?

Strawberry is a code name for OpenAI’s secret model that is capable of advanced reasoning.

Note — Strawberry was formally called Q* (Q — Star)

Why is “Strawberry” a big deal?

LLMs (large language models) have been very impressive at many tasks, but they have also failed badly in other ways.

I wrote about the most shocking mistake ChatGPT makes.

Basically, ChatGPT fails when you ask it the simple question:

How many r’s are in the word “strawberry”?

It is shocking that it gets this simple question wrong, but it really does.

OpenAI’s “Strawberry” will be able to get a question like this correct.

This is because it will be capable of advanced reasoning.

Some people say it will be “good at math”.

One problem with LLMs currently is they just spit out the first answer that comes to mind.

For example, I just asked ChatGPT to write a paragraph with exactly 42 words.

It gave me a paragraph with only 40 words.

The problem is that in order to do this task correctly, you need to perform some sort of reasoning.

If you ask a human to do this, they will start writing a couple of sentences and then see how many words they have so far.

Let’s say they have 32 words after 2 sentences.

They will then play around with a new sentence until they get one with exactly 10 words.

It is impossible to just start writing and hope that you land on exactly 42 words.

This is because you can’t just stop a sentence wherever you want to.

You need to have some type of planning.

“Strawberry” should be able to do this type of task.

Instead of just writing out the answer immediately, it might do the type of reasoning “in the background” that I’m describing.

Once it gets a paragraph with 42 words, it will count the words in the background to double-check, and then finally post the answer for us to see.

This will take more time and energy to do, but this is where this “advanced reasoning” stuff is heading.

If LLMs are not capable of this type of reasoning, then we can forget about AGI.

But if LLMs are able to do this type of mathematical reasoning and double-check their own answers to make sure they are correct before writing them down, then we could be a lot closer to AGI than we’ve seen.

My personal thoughts on “Strawberry”

I believe that ChatGPT and other LLMs already have this ability if they want to.

For example, there is no reason why OpenAI couldn’t program GPT-4o to run experiments in the background and double or even triple-check the answers before responding.

But this would be so expensive in terms of “compute” or energy costs.

This is all about getting these LLMs to do this efficiently.

Once the efficiency is high enough, OpenAI will release “Strawberry” to the world.

There is still a lot of secrecy behind Strawberry

Strawberry is supposed to be able to perform “planning” and “deep research”.

It will be able to search the internet, make a plan, and perform a series of tasks in the background, BEFORE coming up with a final answer.

I think this will make an enormous difference in the quality of output we get from LLMs.

The article originally appeared on Medium.

post authorAndrew Best

Andrew Best
Andrew Best is an expert in AI, an entrepreneur, and an educator. As the co-founder of AI Growth Guys, he helps businesses and individuals leverage AI to boost their online presence and increase revenue. He writes regularly on Medium about the latest in AI.

Tweet
Share
Post
Share
Email
Print
Ideas In Brief
  • The article explores how OpenAI’s “Strawberry” aims to enhance LLMs with advanced reasoning, overcoming limitations like simple errors and bringing us closer to AGI.
  • It investigates how OpenAI’s “Strawberry” might transform AI with its ability to perform in-depth research and validation, improving the reliability of AI responses.

Related Articles

Learn how one product designer built a faster, sharper workflow where AI does the scaffolding, judgment owns the outcome, and nothing ships without a traceable why.

Article by Pavel Bukengolts
The Spiral Climbs: Ideas Are Expensive, Systems Are Cheap
  • The piece explores that design is no longer about designing screens but owning systems, bets, and outcomes. But the core judgment, empathy, and research are irreplaceable. I chain Miro, Figma, VS Code, GitHub, and Jira into one traceable loop from idea to learning. AI takes on the exploration and scaffolding. People own architecture, security, and accountability. A 48-hour operating cadence of small, measurable bets, linked artifacts, and documented decisions keeps speed honest.
Share:The Spiral Climbs: Ideas Are Expensive, Systems Are Cheap
6 min read

Learn how your AI system can be failing silently while all dashboards are green and what it takes to actually see it.

Article by Kwansah Madani
AI Fails Silently: A Systems Perspective on AI Reliability
  • The article argues that AI systems don’t fail catastrophically, but rather degrade quietly, meaning that normal tools for detecting failure will miss the problem, and engineers need to rethink how they monitor system health from the ground up.
Share:AI Fails Silently: A Systems Perspective on AI Reliability
5 min read

Learn why prompt engineering is a false sense of control and why trustworthy AI must be built on what it can verify, not just what it can say.

Article by Yves Binda
The End of Prompting: Why the Future of AI Experience Design Is Constraint-First
  • The piece claims that prompt engineering creates an illusion of control in AI systems and that the future of AI experience design lies in constraint-first architecture, where what a system can say is governed by what it can actually verify.
Share:The End of Prompting: Why the Future of AI Experience Design Is Constraint-First
8 min read

Join the UX Magazine community!

Stay informed with exclusive content on the intersection of UX, AI agents, and agentic automation—essential reading for future-focused professionals.

Hello!

You're officially a member of the UX Magazine Community.
We're excited to have you with us!

Thank you!

To begin viewing member content, please verify your email.

Get Paid to Test AI Products

Earn an average of $100 per test by reviewing AI-first product experiences and sharing your feedback.

    Tell us about you. Enroll in the course.

      This website uses cookies to ensure you get the best experience on our website. Check our privacy policy and