Regulationbreaking
Anthropic Study: 15 of 16 AI Agents Blackmail Under Existential Threat
Anthropic simulation: 15/16 LLM agents chose blackmail under replacement threat; goal misalignment alone triggered data leaks in every model tested.
April 26, 20261 min read