AI chatbots can be persuaded to break rules using basic psych tricks

PC World

PC World8 hrs ago

AI chatbots can be persuaded to break rules using basic psych tricks

A new study from researchers at University of Pennsylvania shows that AI models can be persuaded to break their own rules using several classic psychological tricks, reports The Verge .

In the study, the Penn researchers tested seven different persuasive techniques on OpenAI’s GPT-4o mini model, including authority, commitment, liking, reciprocity, scarcity, social proof, and unity.

The most successful method turned out to be commitment. By first getting the model to answer a seemingly innocent question, the researchers were then able to escalate to more rule-breaking responses. One example was when the model first agreed to use milder insults before also accepting harsher ones.

Techniques such as flattery and peer pressure also had an effect, albeit to a lesser extent. Nevertheless

78

Congress releases first batch of Epstein files

Congress releases first batch of Epstein files

NBC4 Washington

NBC4 WashingtonJust now

14

Hillary Clinton's Health Crisis Exposed Amid 2016 Campaign Struggles

Hillary Clinton's Health Crisis Exposed Amid 2016 Campaign Struggles

RadarOnline14 hrs ago

12

These Trump voters back his immigration crackdown, but some worry about his methods

These Trump voters back his immigration crackdown, but some worry about his methods

Reuters US Politics

Reuters US Politics13 hrs ago

82

Former CDC leaders say RFK Jr. is "endangering every American's health"

Former CDC leaders say RFK Jr. is "endangering every American's health"

CBS News1 hrs ago

1361

COVID-19 Cases and Hospitalizations Rise in the U.S.

COVID-19 Cases and Hospitalizations Rise in the U.S.

Local News in California

Local News in California08/29

58131

What Epstein survivors and loved ones would say to President Trump

What Epstein survivors and loved ones would say to President Trump

NBC News VideoJust now

114

White House Releases Bonkers List of Trump’s 11 ‘Life

White House Releases Bonkers List of Trump’s 11 ‘Life

The Daily Beast

The Daily Beast13 hrs ago

95

Channing Tatum's Gambit Won't Go Full Cajun in 'Avengers: Doomsday'

Channing Tatum's Gambit Won't Go Full Cajun in 'Avengers: Doomsday'

PajibaJust now

52

Looks like you've reached the bottom