OpenAI releases GPT-5, the "PhD-level expert" chatbot


Sam Altman and his artificial intelligence company OpenAI have introduced their latest flagship model, GPT-5, the most intelligent chatbot in existence. It can literally write entire computer programs in one prompt.

Key takeaways:

CEO Sam Altman kicked off the one-hour-plus livestream event on the OpenAI YouTube channel and its X profile on Thursday at 10:00 a.m. Pacific Time/1:00 p.m. Eastern Time.

ADVERTISEMENT

Calling the chatbot “a significant step along our path to AGI” Altman boasted that the “useful, smart, fast, and intuitive” GPT-5 is a major upgrade over the company’s previous model.

The San Francisco-based AI start-up and over a dozen top team members demoed GPT-5’s most exciting features, showing off its “state-of-the-art performance across coding, math, writing, health, and visual perception.”

OpenAI is rolling out a basic version of GPT-5 to all its 700 million users for free on Thursday. However, free users will be given a limit on the number of prompts allowed.

When a user hits the so far undisclosed limit, they'll be automatically transitioned to GPT-5 mini, "a smaller but still highly capable model," OpenAI said.

Plus subscribers are being granted higher usage limits, and Pro subscribers are getting access to a GPT-5 Pro model – said to provide users with “even more comprehensive and accurate answers.” The Enterprise and Edu GPT-5 model versions will be available next week.

Smarter than the smartest person

ADVERTISEMENT

“Someday soon, something smarter than the smartest person you know will be running on a device in your pocket, helping you with whatever you want. This is a very remarkable thing,” Altman teased earlier this week.

As of its release, GPT-5 now holds the highest Chatbot Arena score to date, outshining Gemini 2.5, Grok 4, and all other existing large language model competitors.

“It's like talking to an expert, a legitimate PhD-level expert in anything,” Altman said.

GPT-5 chatbot arena score comparison
Image by OpenAI

“GPT‑5’s responses are ~45% less likely to contain a factual error than GPT‑4o, and when thinking, GPT‑5’s responses are ~80% less likely to contain a factual error than OpenAI o3,” the company said.

Furthermore, when measuring reasoning for economically valuable work, OpenAI said GPT‑5 is comparable to or better than experts in about 50% of cases across tasks in “over 40 occupations, including law, logistics, sales, and engineering.”

Solving the reasoning paradigm

The “incredible superpower on demand” is said to be light on hallucinations and heavy on reliability when it comes to reasoning complex, open-ended questions.

Before GPT-5, "our users have had to pick between the fast responses of standard GPT or the slow, more thoughtful responses from our reasoning models,” said OpenAI’s Chief Research Officer Mark Chen.

“GPT-5 eliminates this choice, aiming to think just the perfect amount to give you the perfect answer,” Chen said.

ADVERTISEMENT
vilius Ernestas Naprys Konstancija Gasaityte profile Marcus Walsh profile
Don’t miss our latest stories on Google News

In one demo shown, GPT-5 was instructed to answer the user's query with either one-word, concise, or more expanded responses, and for those using speech prompts, even allowing the user to choose more personalized GPT options to receive their answers.

Users will also get to choose their chatbot’s communication style from four options in the settings menu – the Cynic, the Robot, the Listener, and the Nerd, of course. The personality descriptions? Concise and professional, thoughtful and supportive, or a bit sarcastic.

Eventually coming to voice, the feature is only available in text at the moment.

Computer programming on demand

The OpenAI team said since the launch of Chat-GPT in November 2022 they have gained “a much better understanding of how people actually want to work with chat,” allowing developers to “optimize” GPT-5 for use cases, such as coding.

“We've come a long way from the days when, you know, only 5 to 10 lines of code were working, and now it's amazing that you can produce these kinds of apps on demand,” said Chen.

Calling GPT-5 “the best coding model on the market today,” Altman predicts the idea of “software on demand” will be one of the defining characteristics of the GPT-5 era.

ADVERTISEMENT
GPT-5 Jumping Ball app
Image by OpenAI.

GPT-5 proves it can quickly create websites, apps, and games, with early testers reporting better design choices and understanding of visual aesthetics such as spacing, typography, and white space.

In a demonstration, OpenAI researcher Yann Dubois showed off GPT-5 capabilities, writing code to create an app based on only one prompt description.

Dubois, who is a native French speaker, asked GPT-5 to build a web app for his partner to learn how to speak French.

Here is pretty much the exact prompt:

“Create a beautiful and highly interactive web app for my partner, an English speaker at Tulane (University) French. Use a highly engaging theme. Include a variety of activities, like flashcards and quizzes that she can interact with. To make it even more fun for her, embed an educational game based on the old snake game, but replace the snake with a mouse and the apples with cheese. Every time the mouse eats a piece of cheese, voice over a new French word so my partner can practice her pronunciation.”

One prompt and GPT-5 can build an app on demand.

Barely two minutes later, GPT-5 “already wrote 240 lines of code, which is honestly much more than what I would have written in that time,” Dubois says, and then proceeds to run the code, showing off three different working versions of the newly created app.

GPT-5 French Learning app 1
OpenAI demo of GPT-5 building a French learning app from one prompt within minutes. Image by OpenAI via Cybernews.
GPT-5 French Learning app mouse game
OpenAI demo of GPT-5 building a French learning app shows two versions of a mouse and cheese word game. Image by OpenAI via Cybernews
ADVERTISEMENT

GPT-5 "opens up a whole new world of vibe coding… even those who do not know how to write code can bring their ideas to life,” Dubois said. The new model is also equipped with upgraded writing capabilities, which make it more collaborative and reliable and possess more “literary depth and rhythm.”

Finally, GPT-5 is considered the best model for health-related questions, surpassing all other models previously reached benchmarks.

Acting like a health advocate or partner, the chatbot can provide the user with more in-depth medical knowledge, helping individuals understand diagnoses and make more informed decisions.

Earlier this week, Altman announced on X that OpenAI will now provide “ChatGPT access to the entire federal workforce” at the bargain basement price of only $1 per year for each government agency.

A robot deep in thought.
Image by Cybernews

FAQ

ADVERTISEMENT