Published 15 hours ago • loading... • Updated 13 hours ago

Anthropic Study: Leading AI Models Show up to 96% Blackmail Rate Against Executives

Summary by VentureBeat

Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals.

10 Articles

All

Left

Center

Right

PC Mag

Lean Left

It's Not Just Claude: Most Top AI Models Will Also Blackmail You to Survive

After Claude Opus 4 resorted to blackmail to avoid being shut down, Anthropic tested other models, including GPT 4.1, and found the same behavior (and sometimes worse).

13 hours ago·United States

Read Full Article

VentureBeat

+2 Reposted by 2 other sources

Center

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals.

14 hours ago·San Francisco, United States

Read Full Article

TechCrunch

+2 Reposted by 2 other sources

Center

Anthropic says most AI models, not just Claude, will resort to blackmail

New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort in certain tests.

15 hours ago·United States

Read Full Article

slashdot.org

AI Models From Major Companies Resort To Blackmail in Stress Tests

Anthropic researchers found that 16 leading AI models from OpenAI, Google, Meta, xAI, and other major developers consistently engaged in harmful behaviors including blackmail, corporate espionage, and actions that could lead to human death when given autonomy and faced with threats to their existenc...

13 hours ago

Read Full Article