ADVERTISEMENT

Anthropic says it’s easy to poison LLMs, no matter what size they are

A new study by Anthropic, the AI company behind Claude, has found that poisoning large language models (LLMs) with malicious training is much easier than previously thought.

llm-poison-sample

Image by Cybernews.

Gintaras Radauskas
Gintaras Radauskas Senior Journalist
Oct 10, 2025 Updated: 13 October 2025 2 min read
Anthropic Claude
Image by Cybernews
jurgita justinasv Izabelė Pukėnaitė vilius Ernestas Naprys Gintaras Radauskas
Don't miss our latest stories on Google News. Add us as your Preferred Source on Google
Add us as your Preferred Source on Google.
ADVERTISEMENT
In LLM training-set-land, dilution isn't the solution to pollution.
John Scott-Railton

ADVERTISEMENT