[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]  Anthropic [@AnthropicAI](/creator/twitter/AnthropicAI) on x 594.5K followers Created: 2025-06-20 19:30:19 UTC New Anthropic Research: Agentic Misalignment. In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.  XXXXXXX engagements  **Related Topics** [coins ai](/topic/coins-ai) [harm](/topic/harm) [agentic](/topic/agentic) [Post Link](https://x.com/AnthropicAI/status/1936144602446082431)
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
Anthropic @AnthropicAI on x 594.5K followers
Created: 2025-06-20 19:30:19 UTC
New Anthropic Research: Agentic Misalignment.
In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.
XXXXXXX engagements
/post/tweet::1936144602446082431