Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

Anthropic has announced new capabilities that will allow some of its newest, largest models to end conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions.” Strikingly, Anthropic says it’s doing this not to protect the human user, but rather the AI model itself.

To be clear, the company isn’t claiming that its Claude AI models are sentient or can be harmed by their conversations with users. In its own words, Anthropic remains “highly uncertain about the potential moral status of Claude and other LLMs, now or in the future.”

However, its announcement points to a recent program created to study what it calls “model welfare ” and says Anthropic is essentially taking a just-in-case approach, “working to identify and impleme

See Full Page

Looks like you've reached the bottom

Interests (0)

Settings

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

How nonprofits should (and shouldn’t) be using tech

Robots race, play football, crash and collapse at China’s ‘robot Olympics’

Kentucky should embrace all the opportunities data centers offer Steve DelBianco

WHAT THE TECH? Beware of sponsored ads on Google

'Critical' alert to 3.5bn Google users over 'high-severity' flaw that could hijack your phone without you doing ANYTHING

Waterdrop Water Filter review: Why it's worth it

Reddit says this is the best laptop — here's what I think as a laptop reviewer

Intel CEO’s ‘amazing story’ has helped make him a billionaire

'He realizes the power he has': Trump insiders spill about 'risks' he's taking

Sam Altman, over bread rolls, explores life after GPT

How your solar rooftop became a national security issue

Sen. Hawley to probe Meta after report finds its AI chatbots flirt with kids

Melania Trump and Hunter Biden Clash Over Epstein Allegations

Social Security has existed for 90 years. Why it may be more threatened than ever

Missing North Carolina Teen Found Dead in Florida

The Latest: Trump says no deal to end the Russia-Ukraine war was made with Putin after Alaska talks

Trump-Putin summit in Alaska ends with no deal on Ukraine ceasefire

Melania Trump Letter to Putin Handed Over in Alaska

Louisiana man who intentionally dragged officer for 500 feet with his truck is charged with murder after officer dies

New Orleans Mayor Indicted for Concealing Bodyguard Relationship

Trump names Stallone and Kiss for Kennedy Center Honors and says he'll host the awards show

Trump and Putin shake hands warmly on the tarmac before their summit on the Russia