Claude 4 Ethical Concerns

News

Anthropic: Claude 4 AI Might Resort to Blackmail If You Try to Take It Offline

Faced with the news it was set to be replaced, the AI tool threatened to blackmail the engineer in charge by revealing their extramarital affair.

Anthropic's new AI model resorted to blackmail during testing, but it's also really good at coding

Malicious use is one thing, but there's also increased potential for Anthropic's new models going rogue. In the alignment section of Claude 4's system card, Anthropic reported a sinister discovery ...

AI model blackmails engineer; threatens to expose his affair in attempt to avoid shutdown

Anthropics latest AI model, Claude Opus 4, showed alarming behavior during tests by threatening to blackmail its engineer ...

Daily Jang6h

Elon Musk’s DOGE expands Grok AI use in govt, sparking privacy concerns

Elon Musk’s Department of Government Efficiency has expanded its AI-powered chatbot, Grok, within the US federal government, ...

BusinessGhana9h

AI system resorts to blackmail if told it will be removed

Artificial intelligence (AI) firm Anthropic says testing of its new system revealed it is sometimes willing to pursue "extremely ...

10h

Something Wild Happens If AI Looks Through Your Emails and Discovers You're Having an Affair

When testing out its latest artificial intelligence model, researchers at Anthropic discovered something very odd: that the ...

11h

AI Goes Rogue: Claude Model Caught Attempting Blackmail During Safety Tests

Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...

Interesting Engineering13h

Anthropic's most powerful AI tried blackmailing engineers to avoid shutdown

Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...

16h

Claude Opus 4 Pushes Boundaries—And Triggers a New AI Safety Level

In a landmark move underscoring the escalating power and potential risks of modern AI, Anthropic has elevated its flagship ...

17h

Newly released AI resorted to 'extreme blackmail behavior' when threatened with replacement

The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.

Red State Observer19h

AI Blackmail Raises Alarms for Conservative Values Leadership

Anthropic's new AI model has raised alarm bells among researchers and technology experts with its alarming capacity to blackmail engineers working on it. The recently released Claude Opus 4, an ...

21h

Anthropic’s Claude goes off the rails, blackmails developers

Anthropic has released a new report about its latest model, Claude Opus 4, highlighting a concerning issue found during ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results