Published 4 days ago • loading... • Updated 2 days ago

Claude AI Can Now End Conversations It Deems Harmful or Abusive

In a statement Friday, the company said Claude Opus 4 and 4.1 models now can end conversations with users in extreme cases to improve AI welfare.
Amid concerns about AI well-being, Anthropic said it developed the feature as part of its work on potential "AI welfare" and plans to continue refining the approach.
According to Anthropic, Claude Opus 4 and 4.1 models only end conversations as a last resort in extreme cases like harmful requests, after multiple redirection attempts.
The change could threaten jailbreaking efforts, as most users won't notice Claude ending chats in extreme cases, Anthropic said.
Last week, Anthropic announced that Claude can autonomically end conversations in extreme cases amid safety concerns and rising reliance on AI chatbots for therapy and advice.

Insights by Ground AI

Does this summary seem wrong?

26 Articles

Sapo

Lean Right

Claude Da Anthropic Can Now Close Conversations that Consider "Disappropriate"

Anthropic, one of the leading companies in artificial intelligence research (IA), has implemented a new and controversial functionality in its most advanced models. Claude Opus 4 and 4.1 models now have the ability to...

3 days ago·Portugal

Read Full Article