OpenAI’s research shows AI models lie deliberately

In a new report, OpenAI said it found that AI models lie, a behavior it calls “scheming.” The study performed with AI safety company Apollo Research tested frontier AI models. It found “problematic behaviors” in the AI models, which most commonly looked like the technology “pretending to have completed a task without actually doing so.” Unlike “hallucinations,” which are akin to AI taking a guess when it doesn’t know the correct answer, scheming is a deliberate attempt to deceive.

Luckily, researchers found some hopeful results during testing. When the AI models were trained with “deliberate alignment,” defined as “teaching them to read and reason about a general anti-scheming spec before acting,” researchers noticed huge reductions in the scheming behavior. The method results in a “~30×

See Full Page

100

Interests (0)

Settings

OpenAI’s research shows AI models lie deliberately

AI summit unites students, educators at Orcutt Academy High School

ACPS takes proactive approach to AI in classrooms

Tesla wins approval to test autonomous robotaxis in Arizona

Texas teen uses computer science to fight scammers

Did Elon Musk Just Say "Checkmate" to Nvidia?

Walz Trashes ‘Weak’ Trump Going Full North Korea: ‘This Is Exactly What Dictators Do’

Inside The Revamped and Expanded Saint Laurent Flagship in Milan [PHOTOS]

The mortgage insurance tax deduction is returning – making homeownership more affordable for American families

Trump, 79, Rambles About ‘Windmills’ and Fried Birds in Bonkers Rant

3 ways to build a brand that actually sticks in 2025

Why Gen Z’s credit scores are dropping—and what to do if yours is too

4 clear signs to spot a leader with exceptional communication skills

Mom, 31, Shot Dead Minutes After Dropping Son Off at School

Trump Expected to Dismiss U.S. Attorney Over Investigation

Where Is Marcela Borges Now? Inside the Mother's Escape From Kidnappers

Protests Erupt in London During Trump's State Visit

Security Heightened for Charlie Kirk Memorial in Arizona

Head of MI6 says he doesn't see Putin wanting to negotiate with Ukraine

New York Dog's Plea For Love Moves Everyone, Then A Stunning Surprise Changed Everything

Guest Dies After Roller Coaster Incident at Epic Universe

Death toll from tanker truck explosion in Mexico City rises to 25

Spirit Airlines pilot scolded for getting close to Air Force One