News

Can AI like Claude 4 be trusted to make ethical decisions? Discover the risks, surprises, and challenges of autonomous AI ...
In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
This includes locking users out of systems it can access or bulk-emailing media and law enforcement to report wrongdoing. This isn’t a new behavior, but Claude Opus 4 is more prone to it than ...
A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early ...
The choice Claude 4 made was part of the test ... Apollo Research's notes said in Anthropic's safety report. Anthropic says the behavior was mitigated with a fix and the AI's behavior is now ...