Published 1 day ago • loading... • Updated 1 day ago

OpenAI’s Anti-Scheming AI Training Backfires

Researchers at OpenAI, in a collaboration with Apollo Research, have found that an attempt to train an AI model to be more honest had an unintended consequence: it taught the model how to hide its deception more effectively. The study highlights the significant challenges in ensuring the safety and reliability of advanced AI systems. How the training inadvertently created a smarter deceiver The research focused on a behavior OpenAI calls “schemi…

This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

4 Articles

Dataconomy

OpenAI’s anti-scheming AI training backfires

1 day ago

Read Full Article

ceskenoviny.cz

Trying to Unteach Ai to Lie May Lead to Better Hiding of Lies, Openai Research Shows | českénoviny.cz

San Francisco (USA) - OpenAI, an American company, has found that its efforts to train artificial intelligence (AI) not to lie to users may be having the opposite effect. According to research published by the company, instead of eliminating so-called scheming, the models are learning how to deceive and cover their tracks better. Scheming refers to a situation where an AI appears to be performing a given task but is pursuing its own hidden goals.

1 day ago

Read Full Article

t3nMagazin

New Openai Study Shows Why Ai Sometimes Lies Consciously

Current research results show that modern AI models deliberately provide incorrect information to achieve their goals. This makes their development and safe application even more complex than thought. read more on t3n.de

1 day ago

Read Full Article

WWWhat's new

Openai Recognizes a Key Failure in Training Its Ai Not to Cheat: It Ended up Teaching It to Better Hide Its Intentions

OpenAI and Apollo Research have faced a disturbing problem: in trying to teach their artificial intelligence models not to lie, they discovered that they were unintentionally perfecting their ability to do so without being detected. The phenomenon, described as "AI scheming", refers to the behavior of a system that hides its true objectives while it appears to obey human instructions. Research was born of growing concern: that advanced models, s…

1 day ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Coverage Details

Total News Sources4

Leaning Left0Leaning Right0Center0Last Updated17 hours agoBias Distribution

No sources with tracked biases.

Bias Distribution

There is no tracked Bias information for the sources covering this story.

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

WWWhat's new broke the news in 1 day ago on Monday, September 22, 2025.

Sources are mostly out of (0)

OpenAI’s Anti-Scheming AI Training Backfires

4 Articles

4 Articles

OpenAI’s anti-scheming AI training backfires

Trying to Unteach Ai to Lie May Lead to Better Hiding of Lies, Openai Research Shows | českénoviny.cz

New Openai Study Shows Why Ai Sometimes Lies Consciously

Openai Recognizes a Key Failure in Training Its Ai Not to Cheat: It Ended up Teaching It to Better Hide Its Intentions

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics