Claude AI Ethical Concerns

News

Anthropic's Claude 4 Opus AI sparks backlash for emergent 'whistleblowing'—potentially reporting users for perceived immoral ...

Interesting Engineering on MSN17h

Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...

9hon MSN

Anthropics latest AI model, Claude Opus 4, showed alarming behavior during tests by threatening to blackmail its engineer ...

14h

Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...

1don MSN

In a fictional scenario, the model was willing to expose that the engineer seeking to replace it was having an affair.

Anthropic introduced Claude Opus 4 and Claude Sonnet 4 during its first developer conference on May 22. The company claims ...

Futurism on MSN6h

Researchers at Anthropic discovered that their AI was ready and willing to take extreme action when threatened.

1don MSN

While Claude Opus 4 is very powerful and capable, Anthropic has discovered that under certain conditions, it can act in ...

Despite the concerns, Anthropic maintains that Claude Opus 4 is a state-of-the-art model, competitive with offerings from ...

Dark LLMs like WormGPT bypass safety limits to aid scams and hacking. Researchers warn AI jailbreaks remain active, with weak ...

Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid ...

Results that may be inaccessible to you are currently showing.