After someone accessed Mythos, cyber pros doubt Anthropic can walk the walk


Anthropic is marketing its new bug-hunting AI model, Mythos, as too powerful for public release. But after a few Discord users simply used a contractor’s credentials and some basic internet sleuthing to get in, cybersecurity professionals are ringing the alarm bells.

The “small group of unauthorized users” was said to have accessed the advanced Mythos AI model the same day Anthropic began rolling out a preview of the model to a limited group of approved companies for testing in late February.

According to Bloomberg, the unnamed Discord users got into Mythos through the access one of them had as a worker at a third-party contractor for Anthropic, and by deploying methods used by cybersecurity researchers.

ADVERTISEMENT

True, the aforementioned group of users hasn’t run any cybersecurity prompts on the model and said they just wanted to play around with the technology, which Anthropic heavily markets as the game-changer in the industry, rather than cause trouble.

Contractor’s credentials were enough

Nonetheless, news of the potential breach will further alarm authorities who have already raised concerns about Mythos’s potential to wreak havoc – and cybersecurity professionals agree. Actually, they warn that the incident is a sign of things to come.

Anthropic Claude Mythos
Image by Cybernews.

Shane Fry, chief technology officer at RunSafe Security, points out that someone was able to access Anthropic’s Mythos model by – reportedly – just changing the model name.

“Even if their intent is just to explore, it shows how easily these systems can be exposed. The reality is these AI capabilities are already out there, ‘hacked’ or not, and they’re going to accelerate how quickly vulnerabilities are found and exploited,” said Fry.

“Software teams will need to look at how to harden their code so those vulnerabilities can’t be used in the first place.”

Another cyber pro entirely unsurprised by this particular incident is Nagomi CEO Emanuel Salmona, who says that how seriously Anthropic talks about security and how it actually performs are two different things.

ADVERTISEMENT

Has your password leaked?

Enter your password to check if it has leaked. Having a leaked password creates the risk of identity theft, financial damages, and worse!
35,607,543,468
Exposed Passwords
Ad
Protect your personal information from cybercriminals and get 50% off the top-rated password manager
link_title link_title

“Anthropic did everything right on paper. A contractor’s credentials were enough to get in anyway,” said Salmona.

“If the most safety-conscious AI lab in the world can’t close that distance between policy and execution, every organization running on AI infrastructure should be asking themselves honestly whether they can.”

False sense of control and security

Indeed, the UK’s AI Security Institute already warned last week that if bad actors were to get their hands on Mythos, the model would most likely be able to carry out attacks on IT systems many times faster than any human professionals.

Tim Mackey, head of software supply chain risk strategy at Black Duck, points his finger at Anthropic’s marketing strategy.

Project Glasswing
Jonathan Raa/NurPhoto via Getty Images

To Mackey, it looks like that Anthropic’s marketing message for Mythos was effectively a challenge not dissimilar to a capture-the-flag exercise, where success includes claims of unauthorized access to Mythos.

“The unfortunate reality is that while it’s great to hear that novel cybersecurity models are being provided to select researchers to evaluate, if your team is on the outside looking in, waiting for the final report might not be top of mind,” said Mackey.

“For defenders, even the specter of unauthorized access to an adversarial model as powerful as Mythos is purported to be, only increases anxiety levels.”

ADVERTISEMENT

Raluca Saceanu, CEO of cybersecurity company Smarttech247, thinks that Anthropic has chosen a pretty confusing course of action in the hopes that outsiders will see the company as behaving responsibly.

jurgita justinasv Izabelė Pukėnaitė vilius Ernestas Naprys Gintaras Radauskas
Don't miss our latest stories on Google News. Add us as your Preferred Source on Google

“Limiting access to a small number of organizations may have created a sense of control, but in reality, it also created a false sense of security. Once a capability like this exists outside the lab, even in a restricted preview, the risk profile changes immediately,” said Saceanu.

“These AI systems are incredibly powerful. The potential isn’t just for traditional hacking – it’s about how these tools can be accessed and used in ways that weren’t intended.”

According to Saceanu, in this particular case, the intent of those Discord users indeed wasn’t criminal, but it shows that simply by existing, Mythos has the potential to be accessed inappropriately by bad actors.

“Our advice to organizations is to closely monitor developments, strengthen internal controls, and assume attackers will continue evolving how they exploit these models,” she said.


Unlock more exclusive Cybernews content on YouTube.

ADVERTISEMENT