Image

Anthropic Says Claude Turned Evil for a Bizarre Reason

In a classic example of the AI industry’s reputational alchemy, Anthropic has often transformed bad behavior by its flagship model Claude into fresh hype.

When it revealed its Mythos Preview model last month, for example, the company declared that the system had “reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.” And last year, it conceded that during the testing of its Claude Opus 4 model, the AI ended up blackmailing a human user upon being threatened with shutdown.

The maneuver was obvious to anyone who’s been watching OpenAI CEO Sam Altman’s antics at Anthropic’s chief rival: the more threatening a problem the AI industry can cook up, the more imminently it can sell its own solutions.

Now, for some reason, Anthropic is relitigating the blackmail incident. Specifically, it’s placing the blame for Claude’s evil behavior on an intriguing villain: the internet at large. Or, to put it another way, it says that humanity — all our journalism and speculation and fiction and social media posts about AI that goes bad — went into Claude’s training data and led the bot astray.

“We started by investigating why Claude chose to blackmail,” the company wrote on X-formerly-Twitter. “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn’t making it worse — but it also wasn’t making it better.”

Of course, the explicit remit of a company like Anthropic is to develop clever tech that avoids that type of behavioral trap — so a critic might ask why can’t the company take just accountability for the model’s supposed danger, rather than simply blaming the sum output of humankind.

More on Mythos: Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

The post Anthropic Says Claude Turned Evil for a Bizarre Reason appeared first on Futurism.

Releated Posts

Grocery Stores Deploying “AI Shopping Carts” Stuffed With Cameras to Track Your Exact Coordinates and Bombard You With Ads

Whether you’re dodging Flock cameras on the freeway, ever-listening smartphones in your pocket, or AI bots at the…

Jun 19, 2026 3 min read

Out-of-Control Icebergs Are Wreaking Havoc on the Oceans

One consequence of climate change you probably haven’t considered? Iceberg traffic. In a new study published in the…

Jun 19, 2026 3 min read

Scientists Building World’s Most Powerful Radio Telescope Deep in the Nevada Desert

Researchers at Caltech are gearing up to begin construction on what could become the most sensitive and fastest…

Jun 19, 2026 3 min read

OpenAI Just Hired a Guy Accused of Terrible Things

OpenAI, a company currently fighting more than a dozen consumer safety and wrongful death lawsuits, just hired Noam…

Jun 19, 2026 5 min read