Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Blackmail becomes go-to strategy for AI models facing shutdown in new Anthropic tests
Blackmail becomes go-to strategy for AI models facing shutdown in new Anthropic tests

A new study from Anthropic suggests that large AI models can sometimes behave like disloyal employees, raising real security concerns even if their actions aren't intentional.
The article Blackmail becomes go-to strategy for AI models facing shutdown in new Anthropic tests appeared first on THE DECODER.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Anthropic is giving away its powerful Claude Haiku 4.5 AI for free to take

<p><a href="https://anthropic.com/"><u>Anthropic</u></a> released <a href="https://www.anthropic.com/news/claude-haiku-4-5"><u>Claude Haik [...]

Match Score: 198.38

venturebeat
How Anthropic’s ‘Skills’ make Claude faster, cheaper, and more consis

<p><a href="https://anthropic.com/"><u>Anthropic</u></a> launched a new capability on Thursday that allows its <a href="https://claude.ai/">< [...]

Match Score: 121.97

OpenAI and Anthropic conducted safety evaluations of each other's AI systems
OpenAI and Anthropic conducted safety evaluations of each other's AI system

<p>Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment [...]

Match Score: 65.95

Anthropic study: Leading AI models show up to 96% blackmail rate against executives
Anthropic study: Leading AI models show up to 96% blackmail rate against ex

Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals. [...]

Match Score: 59.18

Claude Sonnet 4.5 is Anthropic's safest AI model yet
Claude Sonnet 4.5 is Anthropic's safest AI model yet

<p style="text-align:left;"><span style="color:rgb(0, 0, 0);font-family:Verdana, sans-serif;">In May, Anthropic announced two new AI systems, </span><a target= [...]

Match Score: 57.98

Google is investing another billion dollars in Anthropic
Google is investing another billion dollars in Anthropic

<p>Google has decided to invest another billion into Anthropic, four sources told the <a data-i13n="cpos:1;pos:1" href="https://www.ft.com/content/ed631513-dd37-44a3-a536-b2002 [...]

Match Score: 54.92

Microsoft reportedly plans to start using Anthropic models to power some of Office 365's Copilot features
Microsoft reportedly plans to start using Anthropic models to power some of

<p>Microsoft reportedly plans to begin using Anthropic&#39;s latest Claude models to power some of the Copilot features in its Office 365 apps. In a report <a data-i13n="elm:affiliat [...]

Match Score: 51.78

Reddit is suing Anthropic for allegedly scraping its data without permission
Reddit is suing Anthropic for allegedly scraping its data without permissio

<p>Reddit had <a data-i13n="elm:context_link;elmt:doNotAffiliate;cpos:1;pos:1" class="no-affiliate-link" href="https://redditinc.com/hubfs/Reddit%20Inc/Content/PDFs/D [...]

Match Score: 49.45

Anthropic's Claude AI now has the ability to end 'distressing' conversations
Anthropic's Claude AI now has the ability to end 'distressing' conversation

<p>Anthropic&#39;s latest feature for two of its <a data-i13n="cpos:1;pos:1" href="https://www.engadget.com/ai/anthropic-brings-claudes-learning-mode-to-regular-users-and-d [...]

Match Score: 48.55