Willingness to cooperate with harmful use cases

May 24, 2025

The company behind an Amazon-backed AI model revealed a number of concerning findings from its testing process, including that the AI would blackmail engineers who threatened to shut it down.

On Thursday, Artificial intelligence startup Anthropic launched Claude Opus 4, an AI model used for complex, long-running coding tasks.

The launch came more than a year after Amazon invested $4 billion into the project.

Anthropic said in its announcement that the AI model sets “new standards for coding, advanced reasoning, and AI agents.”

However, Anthropic revealed in a safety report that during testing, the AI model had sometimes taken “extremely harmful actions” to preserve its own existence when “ethical means” were “not available.”

Search This Blog

chatainews

Willingness to cooperate with harmful use cases

Comments

Post a Comment

Popular posts from this blog

Hamza Chaudhry

When their AI chums have Bob's data

Swarm 🦹‍♂️