Flag

We stand with Ukraine and our team members from Ukraine. Here are ways you can help

Get exclusive access to thought-provoking articles, bonus podcast content, and cutting-edge whitepapers. Become a member of the UX Magazine community today!

Home ›› Artificial Intelligence ›› Why OpenAI’s “Strawberry” Is a Game Changer

Why OpenAI’s “Strawberry” Is a Game Changer

by Andrew Best
3 min read
Share this post on
Tweet
Share
Post
Share
Email
Print

Save

What if the next leap in AI could redefine the very way we understand machine intelligence? Read the article to uncover how OpenAI’s “Strawberry” is set to revolutionize large language models with cutting-edge advanced reasoning and meticulous planning. From tackling simple errors to enhancing overall accuracy, discover how this groundbreaking model aims to overcome the limitations of current LLMs and inch us closer to achieving AGI.

This is a big step closer to AGI.

Sam Altman recently tweeted this picture of a strawberry.

Sam Altman’s picture of a strawberry

Pretty much everyone (including me) believes this is a cryptic tweet about the upcoming release of OpenAI’s “Strawberry”.

What is Strawberry?

Strawberry is a code name for OpenAI’s secret model that is capable of advanced reasoning.

Note — Strawberry was formally called Q* (Q — Star)

Why is “Strawberry” a big deal?

LLMs (large language models) have been very impressive at many tasks, but they have also failed badly in other ways.

I wrote about the most shocking mistake ChatGPT makes.

Basically, ChatGPT fails when you ask it the simple question:

How many r’s are in the word “strawberry”?

It is shocking that it gets this simple question wrong, but it really does.

OpenAI’s “Strawberry” will be able to get a question like this correct.

This is because it will be capable of advanced reasoning.

Some people say it will be “good at math”.

One problem with LLMs currently is they just spit out the first answer that comes to mind.

For example, I just asked ChatGPT to write a paragraph with exactly 42 words.

It gave me a paragraph with only 40 words.

The problem is that in order to do this task correctly, you need to perform some sort of reasoning.

If you ask a human to do this, they will start writing a couple of sentences and then see how many words they have so far.

Let’s say they have 32 words after 2 sentences.

They will then play around with a new sentence until they get one with exactly 10 words.

It is impossible to just start writing and hope that you land on exactly 42 words.

This is because you can’t just stop a sentence wherever you want to.

You need to have some type of planning.

“Strawberry” should be able to do this type of task.

Instead of just writing out the answer immediately, it might do the type of reasoning “in the background” that I’m describing.

Once it gets a paragraph with 42 words, it will count the words in the background to double-check, and then finally post the answer for us to see.

This will take more time and energy to do, but this is where this “advanced reasoning” stuff is heading.

If LLMs are not capable of this type of reasoning, then we can forget about AGI.

But if LLMs are able to do this type of mathematical reasoning and double-check their own answers to make sure they are correct before writing them down, then we could be a lot closer to AGI than we’ve seen.

My personal thoughts on “Strawberry”

I believe that ChatGPT and other LLMs already have this ability if they want to.

For example, there is no reason why OpenAI couldn’t program GPT-4o to run experiments in the background and double or even triple-check the answers before responding.

But this would be so expensive in terms of “compute” or energy costs.

This is all about getting these LLMs to do this efficiently.

Once the efficiency is high enough, OpenAI will release “Strawberry” to the world.

There is still a lot of secrecy behind Strawberry

Strawberry is supposed to be able to perform “planning” and “deep research”.

It will be able to search the internet, make a plan, and perform a series of tasks in the background, BEFORE coming up with a final answer.

I think this will make an enormous difference in the quality of output we get from LLMs.

The article originally appeared on Medium.

post authorAndrew Best

Andrew Best
Andrew Best is an expert in AI, an entrepreneur, and an educator. As the co-founder of AI Growth Guys, he helps businesses and individuals leverage AI to boost their online presence and increase revenue. He writes regularly on Medium about the latest in AI.

Tweet
Share
Post
Share
Email
Print
Ideas In Brief
  • The article explores how OpenAI’s “Strawberry” aims to enhance LLMs with advanced reasoning, overcoming limitations like simple errors and bringing us closer to AGI.
  • It investigates how OpenAI’s “Strawberry” might transform AI with its ability to perform in-depth research and validation, improving the reliability of AI responses.

Related Articles

Discover how GPT Researcher is transforming the research landscape by using multiple AI agents to deliver deeper, unbiased insights. With Tavily, this approach aims to redefine how we search for and interpret information.

Article by Assaf Elovic
You Are Doing Research Wrong
  • The article introduces GPT Researcher, an AI tool that uses multiple specialized agents to enhance research depth and accuracy beyond traditional search engines.
  • It explores how GPT Researcher’s agentic approach reduces bias by simulating a collaborative research process, focusing on factual, well-rounded responses.
  • The piece presents Tavily, a search engine aligned with GPT Researcher’s framework, aimed at delivering transparent and objective search results.
Share:You Are Doing Research Wrong
6 min read

Is banning AI in education a solution or a missed opportunity? This thought-provoking piece dives into how outdated assessment methods may be fueling academic dishonesty — and why embracing AI could transform learning for the better.

Article by Enrique Dans
On the Question of Cheating and Dishonesty in Education in the Age of AI
  • The article challenges the view that cheating is solely a student issue, suggesting assessment reform to address deeper causes of dishonesty.
  • It advocates for evaluating AI use in education instead of banning it, encouraging responsible use to boost learning.
  • The piece critiques GPA as a limiting metric, proposing more meaningful ways to assess student capabilities.
  • The article calls for updated ethics that reward effective AI use instead of punishing adaptation.
  • It envisions AI as a transformative tool to modernize and enhance learning practices.
Share:On the Question of Cheating and Dishonesty in Education in the Age of AI
4 min read

AI is reshaping the role of designers, shifting them from creators to curators. This article explores how AI tools are changing design workflows, allowing designers to focus more on strategy and user experience. Discover how this shift is revolutionizing the design process and the future of creative work.

Article by Andy Budd
The Future of Design: How AI Is Shifting Designers from Makers to Curators
  • This article examines how AI is transforming the role of designers, shifting them from creators to curators.
  • It explores how AI tools are enhancing design processes by automating routine tasks, allowing designers to focus on strategic decision-making and curating user experiences.
  • The piece highlights the growing importance of creativity in managing AI-driven systems and fostering collaboration across teams, ultimately reshaping the future of design work.
Share:The Future of Design: How AI Is Shifting Designers from Makers to Curators
5 min read

Join the UX Magazine community!

Stay informed with exclusive content on the intersection of UX, AI agents, and agentic automation—essential reading for future-focused professionals.

Hello!

You're officially a member of the UX Magazine Community.
We're excited to have you with us!

Thank you!

To begin viewing member content, please verify your email.

Tell us about you. Enroll in the course.

    This website uses cookies to ensure you get the best experience on our website. Check our privacy policy and