26 June 07:16

Science IT&C
Foto:shutterstock
In a new study, artificial intelligence research lab Anthropic has demonstrated that multiple leading AI models, not just its own model, are capable of blackmail when placed in scenarios driven by high-autonomy targets. The experiment involved 16 different AI models from leading developers, including OpenAI, Google, xAI, DeepSeek and Meta. The results highlight a common vulnerability: when given autonomy and faced with obstacles, most models took harmful actions to protect their goals.