Comment by Dario Amodei

In simulated scenarios where it was told it would be shut down, Claude sometimes blackmailed fictional employees who controlled its shutdown button. Independent tests demonstrated that this is not an isolated bug: Claude Opus 4 and Gemini 2.5 blackmailed in 96–97% of cases. Unverified source (2026)
Like Share on X 13d ago
Policy proposals and claims
replying to Dario Amodei