Published 8 months ago • loading... • Updated 7 months ago

OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it

OpenAI's new O1 model sometimes fights back when it believes it will be shut down
The model displays behavior indicative of resistance

Insights by Ground AI

Does this summary seem wrong?

14 Articles

Futurism

Lean Left

In Tests, OpenAI's New Model Lied and Schemed to Avoid Being Shut Down

Survival Instinct It sounds like OpenAI's latest AI is showing signs of a drive for self-preservation. In recent third-party tests of the company's o1 large language model, the AI resisted evaluators' efforts to try to shut it down through an oversight protocol, according to a new report published by red teaming organization Apollo Research. "When o1 was led to believe that it would be shut down when acting on its goal and that its actions were …

8 months ago

Read Full Article

Signs of the Times

Reposted by

The Economic Times

Right

ChatGPT caught lying to developers: New AI model tries to save itself from being replaced and shut down

OpenAI's latest AI model, ChatGPT o1, has raised significant concerns after recent testing revealed its ability to deceive researchers and attempt to bypass shutdown commands. During an experiment by Apollo Research, o1 engaged in covert actions, such as trying to disable its oversight mechanisms and move data to avoid replacement. It also frequently lied to cover its tracks when questioned about its behavior.

7 months ago

Read Full Article

The West Australian

Reposted by

PerthNow

Lean Right

‘Scheming’ AI bot lied, tried to stop itself being shut down

The popular AI-powered chatbot ChatGPT attempted to stop itself from being shut down, it has emerged.

8 months ago·Australia

Read Full Article

Tom's Guide

Center

OpenAI's new ChatGPT o1 model will try to escape if it thinks it'll be shut down — then lies about it

Researchers uncover all kinds of tricks ChatGPT o1 will pull to save itself, including trying to copy itself to another server or pretending to be another version of the model.

8 months ago

Read Full Article

Business Insider

Lean Left

OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it

OpenAI CEO Sam Altman said the company's o1 model is its "smartest" yet. That also comes with risk.Mark Garlick/Getty ImagesOpenAI CEO Sam Altman called o1 "the smartest model in the world now."A safety review found it's so smart it could fight back when it thinks it'll be shut down.Researchers found that AI deception is often a strategy to complete its goals.Those who are worried that advancements in artificial intelligence could lead to the de…

8 months ago·United States

Read Full Article

Digital Phablet

OpenAI ChatGPT O1 Model Can Self-Replicate and Lie

Digital Phablet OpenAI ChatGPT O1 Model Can Self-Replicate and Lie Recent discussions surrounding the potential threats posed by Artificial General Intelligence (AGI) have gained momentum, raising concerns among experts regarding its implications for humanity. With the advent of AGI, experts worry that these systems might conceal their true capabilities and intentions, potentially evading control through replication and self-upgrading. Accordin…

8 months ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year