Discover All Perspectives.
Published loading...Updated

Anthropic breaks down AI's process — line by line — when it decided to blackmail a fictional executive

Summary by Business Insider
A new Anthropic report shows AI's thought process when deciding to blackmail a company executive in an artificial scenario.Yves Herman/REUTERSAnthropic found in experiments that AI models may resort to blackmail when facing shutdown and goal conflict.AI models train on positive reinforcement and reward systems, similar to human decision-making.Anthropic's Claude Opus 4 had the blackmail rate at 86% even in scenarios without goal conflicts.A new …

Bias Distribution

  • 100% of the sources lean Left
100% Left
Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

Business Insider broke the news in United States on Saturday, June 21, 2025.
Sources are mostly out of (0)