Published 1 day ago • loading... • Updated 7 hours ago

Researchers Show That Hundreds of Bad Samples Can Corrupt Any AI Model

A study found that just 250 poisoned documents were enough to corrupt AI models up to 13 billion parameters in size, showcasing the need for new kinds of defenses.

5 Articles

La Repubblica

Lean Left

250 Documents Are Enough to Poison an a.i.a.

Anthropic has discovered that few hostile texts are enough to implant a hidden door in a linguistic model, regardless of its size...

17 hours ago·Turin, Italy

Read Full Article

WWWhat's new

The Silent Threat that Can Alter Ai Models with only 250 Documents

In the training of large language models (LLMs), it tends to be thought that the quality and mass quantity of data are guarantors of security. But a recent study by Anthropic, in collaboration with the UK AI Safety Institute and the Alan Turing Institute, has turned this idea upside down. Research has shown that there is no need to contaminate large amounts of data to compromise a model: just 250 malicious documents are enough to insert a functi…

7 hours ago

Read Full Article

decrypt.co

Researchers Show That Hundreds of Bad Samples Can Corrupt Any AI Model

A study found that just 250 poisoned documents were enough to corrupt AI models up to 13 billion parameters in size, showcasing the need for new kinds of defenses.

10 hours ago·New York, United States

Read Full Article

Programmez!

Ia: 250 Malicious Documents Can Corrupt an Llm

Finding a bit worrying of a joint study Anthropic and the UK AI Security Institute, with the Alan Turing Institute: 250 malicious / contaminated documents would suffice to produce a back door in a LLM and thus corrupt it over time. Whether in a LLM with 13 billion parameters or a 600 million model, only the training time changes... "Our results question the perceived idea that attackers have to control a (high) percentage of training data; they …

21 hours ago

Read Full Article

Developpez.com

Ai Models Such as Chatgpt~? Gemini and Claude Can Develop "Backdoor" Vulnerabilities when Corrupt Documents Are Inserted Into Their Drive Data

AI models such as ChatGPT, Gemini and Claude can develop "door-to-door" vulnerabilities when corrupt documents are inserted into their training dataIn a study conducted jointly with the UK AI Security Institute and the Alan Turing Institute, Anthropic discovered that only 250 malicious documents can create a "door-to-door" vulnerability in a large language model, regardless of the size of the model or the volume of the data...

1 day ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year