OpenAI's performance charts in the GPT-5 launch video are such a mess you have to think GPT-5 itself probably made them, and the company's attempted fixes raise even more questions

What to make of OpenAI's latest GPT-5 chatbot ? Let's just say the reception from users has been sufficiently mixed to have OpenAI head honcho Sam Altman posting apologetically on X. And more than once . But one thing we can say for sure, the charts in the launch video were a bizarre mess that OpenAI has since attempted to tidy up, to mixed avail.

Most obviously, the claimed SWE-bench performance of GPT-5 versus older model shown on launch day was badly botched. The chart showed accuracy figures of 74.9% for ChatGPT 5, 69.1% for OpenAi o3 and 30.8% for GPT-4o.

Problem is, the bar graph heights were exactly the same for the latter two, giving the at-a-glance impression of total dominance for GPT-5 when in fact it is only marginally superior to OpenAI o3.

It's a basic enough mistake that

See Full Page

Interests (0)

Settings

OpenAI's performance charts in the GPT-5 launch video are such a mess you have to think GPT-5 itself probably made them, and the company's attempted fixes raise even more questions

Top White House official admits Trump can't accomplish this key goal without Congress

Trump's GOP nemesis vows to parade Epstein victims in Congress

Trump's Plan to Address Homelessness in D.C. Raises Concerns

Trump wants Medicare to pay for your Ozempic treatment. Taxpayers may foot the bill for billions in fraud

Rural Emergency Rooms Are Increasingly Run Without Doctors

I don’t normally strength train on vacation, but this time I did

'Stirred rage': Extremists — 'emboldened' by Trump — 'increasingly view him as an enemy'

Rockies first-round pick Ethan Holliday collects two hits in pro debut

Scrubby is a top robot. Scrubby scrubs whiteboards. Scrubby un-scrubs whiteboards. Scrubby sleeps to scrub again.

GameSir's G7 Pro has removable faceplates, TMR hall effect sticks, and 1000Ghz polling rate for under $80

Today's Wordle clues, hints and answer for August 13 #1516

Rescue Efforts Ongoing After Explosion at Steel Plant

Trump has been on a roll for the ages — but blowback could be looming

Tropical Storm Erin May Become First Hurricane of 2025

Danielle Spencer, 'What's Happening!!' Star, Dies at 60

Mom Changes Baby's Diaper in SUV Trunk—Stunned by What Stranger Yells

Happy move for Cristiano Ronaldo as Georgina Rodríguez announces their engagement

An awful Trump secret is about to come crashing into the open

Two Killed in Target Shooting in Austin, Suspect Detained

One killed, five injured in Chicago mass shooting

Police Recover $30,000 in Stolen Labubu Collectibles