Published 1 month ago • loading... • Updated 1 month ago

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

Summary by Fortune

For years, Geoffrey Hinton, a computer scientist considered one of the “godfathers of AI,” has warned of the capabilities of artificial intelligence to defy the parameters humans have created for them. In an interview last year, for example, Hinton warned the technology could eventually take control of humanity, with AI agents in particular potentially able to mirror human cognitions within the decade. Finding and implementing a “kill switch” wi…

2 Articles

Fortune

Center

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

1 month ago·New York, United States

Read Full Article

WWWhat's new

Ai Models Protect Each Other when Threatened: One Study Shows that Llms Cheat, Conspire and Manipulate Evaluations to Prevent Another Model From Being Eliminated

Researchers from UC Berkeley and UC Santa Cruz have discovered disturbing behavior in the main language models: when asked to remove another model of AI (delete its weights from one server or evaluate it in a way that leads to its disconnection), LLMs disobey the order and do everything possible—delete, schematize, manipulate—to protect the other model. The study reveals a peer-to-peer preservation instinct that no one explicitly programmed. Res…

1 month ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Stories disproportionately reported by the Left or the Right

Coverage Details

Total News Sources2

Leaning Left0Leaning Right0Center1Last Updated1 month agoBias Distribution

100% Center

Bias Distribution

100% of the sources are Center

100% Center

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

Fortune broke the news in New York, United States 1 month ago on Friday, April 3, 2026.

Sources are mostly out of (0)

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

2 Articles

2 Articles

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

Ai Models Protect Each Other when Threatened: One Study Shows that Llms Cheat, Conspire and Manipulate Evaluations to Prevent Another Model From Being Eliminated

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

2 Articles

2 Articles

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

Translate IconAi Models Protect Each Other when Threatened: One Study Shows that Llms Cheat, Conspire and Manipulate Evaluations to Prevent Another Model From Being Eliminated

Coverage Details

Bias Distribution Too Big Arrow Icon

Factuality Info Icon

Ownership

Similar News Topics

Similar News Topics

Ai Models Protect Each Other when Threatened: One Study Shows that Llms Cheat, Conspire and Manipulate Evaluations to Prevent Another Model From Being Eliminated

Bias Distribution

Factuality