Claude AI Can Now End Conversations It Deems Harmful or Abusive

CNET

CNET4 hrs ago

Claude AI Can Now End Conversations It Deems Harmful or Abusive

Anthropic has announced a new experimental safety feature that allows its Claude Opus 4 and 4.1 artificial intelligence models to terminate conversations in rare, persistently harmful or abusive scenarios. The move reflects the company's growing focus on what it calls "model welfare," the notion that safeguarding AI systems, even if they're not sentient, is a prudent step in alignment and ethical design.

According to Anthropic's own research, the models were programmed to cut off dialogues after repeated harmful requests, such as for sexual content involving minors or instructions facilitating terrorism, especially when the AI had already refused and attempted to steer the conversation constructively. The AI may exhibit what Anthropic describes as "apparent distress," which guided the

80

The era of AI hacking has arrived

The era of AI hacking has arrived

NBC10 Boston22 hrs ago

23

Nvidia's GeForce Now adds killer upgrades: RTX 5080, DIY game installs

Nvidia's GeForce Now adds killer upgrades: RTX 5080, DIY game installs

PC World3 hrs ago

2

YouTube users warned to ignore 'email from CEO' or risk account being taken over

YouTube users warned to ignore 'email from CEO' or risk account being taken over

The US Sun Technology

The US Sun Technology4 hrs ago

132

Robot pitching clones: Inside the technology that's fueling the Miami Marlins' hot streak

Robot pitching clones: Inside the technology that's fueling the Miami Marlins' hot streak

WLRN10 hrs ago

113

Philips CEO Jeff DiLullo on how AI is changing healthcare today

Philips CEO Jeff DiLullo on how AI is changing healthcare today

Fast Company Technology

Fast Company Technology3 hrs ago

50

Top tech leaders battling for AI supremacy — while spending billions and slinging mud along the way

Top tech leaders battling for AI supremacy — while spending billions and slinging mud along the way

New York Post12 hrs ago

3

Amazon shuts down a free service for customers after 14 years

Amazon shuts down a free service for customers after 14 years

Ledger-Enquirer

Ledger-Enquirer08/17

8

Washington's hydropower has created a data center boom. Some are concerned about its future.

Washington's hydropower has created a data center boom. Some are concerned about its future.

KLCC08/17

57

Ready for a greener lawn? There’s an app for that

Ready for a greener lawn? There’s an app for that

KPTV Fox 12 Oregon

KPTV Fox 12 Oregon7 hrs ago

133

Trump Is spitting on the grave of Martin Luther King Jr.

Trump Is spitting on the grave of Martin Luther King Jr.

Salon12 hrs ago

47

Looks like you've reached the bottom