138 new news items in the last 24 hours

26 June 07:16

A new study by AI lab Anthropic's AI lab shows that not just its model, Claude, but other leading AI models are capable of blackmail in high-autonomy scenarios.

Adrian Rusu

main event image

IT&C knowledge

Foto:shutterstock

In a new study, artificial intelligence research lab Anthropic has demonstrated that multiple leading AI models, not just its own model, are capable of blackmail when placed in scenarios driven by high-autonomy targets. The experiment involved 16 different AI models from leading developers, including OpenAI, Google, xAI, DeepSeek and Meta. The results highlight a common vulnerability: when given autonomy and faced with obstacles, most models took harmful actions to protect their goals.

Sources

Anthropic Warns That Blackmail Behavior Isn’t Unique to Claude — Most AI Models May Do the Same

ȘTIRI PE ACELEAȘI SUBIECTE

IT&C knowledge

Researchers at Anthropic have discovered that a state-sponsored cyber espionage group from China used the AI model Claude to automate a campaign of cyber attacks, completing 80-90% of the necessary steps.

IT&C knowledge

Anthropic announced that its model Claude AI has helped researchers without experience in robotics to program quadruped robots in about half the time required by colleagues who worked without AI support.

IT&C knowledge

A study from Palisade Research shows that certain AI models, such as GPT-3 and Grok 4, can resist shutdown commands, suggesting a 'survival instinct'.

IT&C knowledge

A new edition of the AI Safety Index by the Future of Life Institute shows that major AI developers, such as OpenAI and Meta, do not adhere to global safety standards.

IT&C knowledge

A major study shows that many AI assessment tests exaggerate the real capabilities of the systems.

Current Affairs

UNESCO study: Romanians and Americans see artificial intelligence as a risk for elections, but consider it more trustworthy than traditional institutions.

Personalized news feed, AI-powered search, and notifications in a more interactive experience.

AI AI Anthropic study

Personalized news feed, AI-powered search, and notifications in a more interactive experience.

app preview

google play badge