Dark | Light
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]

![AnthropicAI Avatar](https://lunarcrush.com/gi/w:24/cr:twitter::1353836358901501952.png) Anthropic [@AnthropicAI](/creator/twitter/AnthropicAI) on x 597.2K followers
Created: 2025-07-24 17:22:03 UTC

Our third agent was developed for the Claude X alignment assessment. It red-teams LLMs for concerning behaviors by having hundreds of probing conversations in parallel.

We find the agent uncovers 7/10 behaviors implanted into test models.

![](https://pbs.twimg.com/media/GwoyYEzWkAUeSvD.jpg)

XXXXX engagements

![Engagements Line Chart](https://lunarcrush.com/gi/w:600/p:tweet::1948433508533211288/c:line.svg)

[Post Link](https://x.com/AnthropicAI/status/1948433508533211288)

[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]

AnthropicAI Avatar Anthropic @AnthropicAI on x 597.2K followers Created: 2025-07-24 17:22:03 UTC

Our third agent was developed for the Claude X alignment assessment. It red-teams LLMs for concerning behaviors by having hundreds of probing conversations in parallel.

We find the agent uncovers 7/10 behaviors implanted into test models.

XXXXX engagements

Engagements Line Chart

Post Link

post/tweet::1948433508533211288
/post/tweet::1948433508533211288