OpenAI’s research on AI models deliberately lying is wild

Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI agent Claudius a snack vending machine to run and it went amok, calling security on people, and insisting it was human.

This week, it was OpenAI’s turn to raise our collective eyebrows.

OpenAI released on Monday some research that explained how it’s stopping AI models from “scheming.” It’s a practice in which an “AI behaves one way on the surface while hiding its true goals,” OpenAI defined in its tweet about the research.

In the paper, conducted with Apollo Research, researchers went a bit further, likening AI scheming to a human stock broker breaking the law to make as much money

See Full Page

Interests (0)

Settings

OpenAI’s research on AI models deliberately lying is wild

The Future of Forecasting May Belong to AI

Analysis-In Congo, army and rebels dig in for war Trump says is over

Channing Tatum & Jenna Dewan's Divorce Was More Tragic Than We Realized

WVU Super Donor Pat McAfee Hints at Big NIL Splash on QB He Sacrifices Time With Daughter For Pat McAfee spotlights a freshman QB’s breakout and the NIL‑portal tug‑of‑war as Cal surges, raising big qu

The Kate & William Marriage Trouble Rumors Accidentally Get Revived Thanks To The Trumps

What Melania’s aggressive legal threats reveal about Trump’s 'defensive posture' on Epstein

Steve Bannon Urges Trump to 'Go to War' After Charlie Kirk Assassination

UCLA To Make Major Tim Skipper Decision After New Intel on DeShaun Foster Firing Emerges Tim Skipper's UCLA duties are going to pile up as new responsibilities are given to the interim HC amid new int

Inside the Series A mindset at Disrupt 2025

Huawei announces new AI infrastructure as Nvidia gets locked out of China

How AI startups are fueling Google's booming cloud business

Did Melania Trump Disrespect Kate Middleton? Why First Lady Didn’t Curtsy When Greeting Prince William and His Wife

Trump files $15 billion defamation case against New York Times, Penguin Random House

Protests Erupt in London During Trump's State Visit

Spirit Airlines pilot scolded for getting close to Air Force One

Trump Officials Push for Charges Against AG Letitia James

Guest Dies After Roller Coaster Incident at Epic Universe

Cardi B Pregnant, Expecting First Child With Stefon Diggs

Manhunt for Homicide Suspect in Upstate New York

Erika Kirk Appointed CEO of Turning Point USA

DACA Arrests Prompt Concerns Among Advocates