News

Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Anthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was having an extramarital affair.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...
At which point, the blackmailing kicked in including threats ... blackmail for Claude Opus 4. Blackmail occurred at an even higher rate, "if it’s implied that the replacement AI system ...
An artificial intelligence model reportedly attempted to threaten and blackmail its own creator during internal testing ...
This development, detailed in a recently published safety report, have led Anthropic to classify Claude Opus 4 as an ‘ASL-3’ ...