Claude 4 Reporting Behavior

News

AI Snitch? How Claude 4 Could Report You to Authorities

Can AI like Claude 4 be trusted to make ethical decisions? Discover the risks, surprises, and challenges of autonomous AI ...

18d

New Claude 4 AI model refactored code for 7 hours straight

In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that ...

16d

Newly released AI resorted to 'extreme blackmail behavior' when threatened with replacement

The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.

BGR17d

Claude 4 AI will try to report you to authorities if it thinks you’re doing shady stuff

This includes locking users out of systems it can access or bulk-emailing media and law enforcement to report wrongdoing. This isn’t a new behavior, but Claude Opus 4 is more prone to it than ...

17don MSN

A safety institute advised against releasing an early version of Anthropic’s Claude Opus 4 AI model

A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early ...

12 NEWS16d

Newly released AI resorted to 'extreme blackmail behavior' when threatened with replacement

The choice Claude 4 made was part of the test ... Apollo Research's notes said in Anthropic's safety report. Anthropic says the behavior was mitigated with a fix and the AI's behavior is now ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results