Published 5 days ago • loading... • Updated 2 days ago

Anthropic Gives Claude AI Power to End Conversations as Part of 'Model Welfare' Push

Anthropic has equipped its Claude Opus 4 and 4.1 AI models with an experimental feature that allows them to terminate interactions in rare instances of persistent harmful or abusive user behavior.
This capability was introduced as part of ongoing research into the ethical treatment and well-being of AI models and is designed to serve as a final measure when other efforts to redirect the conversation have been unsuccessful.
Anthropic's pre-deployment testing included model welfare assessments revealing Claude's consistent refusal of harmful requests and signs of distress under repeated abuse.
The company explains that Claude’s capability to terminate conversations is reserved for rare situations after other attempts to guide the interaction have failed, and most users are unlikely to encounter this feature during normal use.
Anthropic views this function as a trial and is actively working to improve it, while maintaining uncertainty about Claude's moral status and the possibility of AI welfare.

Insights by Ground AI

Does this summary seem wrong?

15 Articles

India Today

Lean Right

Anthropic gives Claude AI power to end harmful chats to protect the model, not users

According to Anthropic, the vast majority of Claude users will never experience their AI suddenly walking out mid-chat. The feature is designed only for “extreme edge cases”.

3 days ago·India

Read Full Article

Live Mint

Center

Anthropic gives Claude AI power to end conversations as part of 'model welfare' push

Anthropic's Claude AI can now end conversations as a last resort in extreme cases of abusive dialogue. This feature aims to address potential harm to AI systems, ensuring that the majority of users will not experience this intervention during normal interactions.

3 days ago·New Delhi, India

Read Full Article

TechCrunch

Reposted by

Bioethics.com

Center

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

Anthropic says new capabilities allow its latest AI models to protect themselves by ending abusive conversations.

4 days ago·United States

Read Full Article

Israel Hayom

The Chatbot that Hung up on Me: Claude Can Now End Conversations that Hurt Him

Anthropic gives its artificial intelligence models a new ability - to end extremist conversations, out of concern for the "well-being" of the model itself

3 days ago

Read Full Article

write.as

Claude, the Fragile Ai Who Needs a Coffee Break — Romain Leclaire

The world of artificial intelligence never ceases to surprise us. Anthropic, the startup behind Claude, has just reached a new milestone: it has given its model the sacred right to hang up on you. Yes, you read that right. In certain circumstances deemed "extreme," Claude Opus 4 and 4.1 will now be able to decide that the conversation is over, thank you, goodbye. And be careful, this isn't to protect poor humans from the horrors they might read.…

3 days ago

Read Full Article

Tekedia

Anthropic Rolls Out A New Safeguard: Gives Claude Opus 4 And 4.1 Models The Ability To End Conversations

Anthropic has taken an unusual step in AI development by giving its Claude Opus 4 and 4.1 models the ability to end conversations—an intervention that, the company stresses, is designed not primarily to protect users but to shield the model itself from persistently harmful or abusive interactions. In an announcement, the company described the feature […] The post Anthropic Rolls Out A New Safeguard: Gives Claude Opus 4 And 4.1 Models The Ability…

3 days ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Coverage Details

Total News Sources15

Leaning Left0Leaning Right1Center2Last Updated2 days agoBias Distribution

67% Center

Bias Distribution

67% of the sources are Center

67% Center

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

anthropic.com broke the news in 5 days ago on Friday, August 15, 2025.

Sources are mostly out of (0)

Anthropic Gives Claude AI Power to End Conversations as Part of 'Model Welfare' Push

15 Articles

15 Articles

Anthropic gives Claude AI power to end harmful chats to protect the model, not users

Anthropic gives Claude AI power to end conversations as part of 'model welfare' push

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

The Chatbot that Hung up on Me: Claude Can Now End Conversations that Hurt Him

Claude, the Fragile Ai Who Needs a Coffee Break — Romain Leclaire

Anthropic Rolls Out A New Safeguard: Gives Claude Opus 4 And 4.1 Models The Ability To End Conversations

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

Anthropic Gives Claude AI Power to End Conversations as Part of 'Model Welfare' Push

Anthropic’s Claude Opus 4 and 4.1 can autonomously end chats after repeated harmful user interactions, reflecting findings from over 700,000 analyzed conversations, the company said.

15 Articles

15 Articles

Anthropic gives Claude AI power to end harmful chats to protect the model, not users

Anthropic gives Claude AI power to end conversations as part of 'model welfare' push

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

The Chatbot that Hung up on Me: Claude Can Now End Conversations that Hurt Him

Claude, the Fragile Ai Who Needs a Coffee Break — Romain Leclaire

Anthropic Rolls Out A New Safeguard: Gives Claude Opus 4 And 4.1 Models The Ability To End Conversations

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics