[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.] [@arcprize](/creator/twitter/arcprize) "o3 Pro on ARC-AGI Semi Private Eval Results ARC-AGI-1: * Low: XX% $1.64/task * Medium: XX% $3.18/task * High: XX% $4.16/task ARC-AGI-2: * All reasoning efforts: X% $4-7/task Takeaways: * o3-pro in line with o3 performance * o3's new price sets the ARC-AGI-1 Frontier"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1932535378080395332) 2025-06-10 20:28:33 UTC 24.8K followers, 126K engagements "We're excited to share this preview of ARC-AGI-3. This is just the beginning Visit to learn more"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260385398722761) 2025-07-18 17:26:50 UTC 24.8K followers, 8452 engagements "ARC-AGI-3 Developer Preview * Hands on first look at ARC-AGI-3 (live demos & API access) * Fireside with @fchollet moderated by @dwarkesh_sp 7/17 San Francisco Open to sponsors & researchers of @arcprize (very limited public slots available)"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1939723351335047515) 2025-06-30 16:30:59 UTC 24.7K followers, 52.8K engagements "New ARC Prize 2025 High Score XXXX% by @MindsAI_Jack @MohamedOsmanML @tufalabs"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1942247297301275058) 2025-07-07 15:40:15 UTC 24.8K followers, 29.6K engagements "We create benchmarks that highlight the gap between human generalization and machine patternmatching ARCAGI1 (2019) challenged deeplearning ARCAGI2 (2025) challenges static reasoning models"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260368332104148) 2025-07-18 17:26:46 UTC 24.8K followers, 9608 engagements "Today we're announcing a preview of ARC-AGI-3 the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI Were releasing: * X games (environments) * $10K agent contest * AI agents API Starting scores - Frontier AI: X% Humans: 100%"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260363256996244) 2025-07-18 17:26:45 UTC 24.8K followers, 314.8K engagements "Learn from others that have tried ARC-AGI-3 First up @AlexReibman @ @AgentOpsAI"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1947733030912856128) 2025-07-22 18:58:36 UTC 24.8K followers, 1695 engagements "ARC-AGI-3 Agent Competition (27 days left) $10K prize pool in partnership with @huggingface Your first submission is X lines of code away Here are quick-start templates from @LangChainAI @AgentOpsAI and @AnthropicAI and lessons learned from devs who've tried ARC-AGI-3 🧵"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1947733025359597605) 2025-07-22 18:58:35 UTC 24.8K followers, 7423 engagements "Hear @GregKamradt talk about ARC-AGI-3 with @swyx and @FanaHOVA on @latentspacepod * Why interactive benchmarks * Defining Intelligence * Play through of ARC-AGI-3 games"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946311135990608168) 2025-07-18 20:48:30 UTC 24.8K followers, 11K engagements "Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI by @PourcelJulien @cedcolas and @pyoudeyer Another example of ARC-AGI as a research playground that has general applicability"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1944814215665213767) 2025-07-14 17:40:16 UTC 24.8K followers, 10.1K engagements "Grok X (Thinking) achieves new SOTA on ARC-AGI-2 with XXXX% This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1943168950763950555) 2025-07-10 04:42:34 UTC 24.8K followers, 7.2M engagements "Agents are now the frontier. They perceive plan act remember adapt. Static puzzles arent equipped to grade that loop We need interactive benchmarks that test worldmodel building and longhorizon planning under sparse feedback"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260371289088444) 2025-07-18 17:26:46 UTC 24.8K followers, 8261 engagements "Enter ARCAGI3 X brandnew games. Easy for humans out of reach for todays best AI models X games are live today X will go live in August"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260374590005308) 2025-07-18 17:26:47 UTC 24.8K followers, 7762 engagements "Every game environment is novel unique and only requires core-knowledge priors No language trivia or specialized knowledge is needed to beat the games * Play: * Compete: * Build:"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260365110833320) 2025-07-18 17:26:45 UTC 24.8K followers, 12K engagements "Interactive Reasoning Benchmarks are the next step in frontier evaluations Hear @GregKamradt share why measuring human-like intelligence requires multi-turn environments Including a sneak peak of ARC-AGI-3 Want to help us build interactive evaluations We're hiring"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1932137879742063073) 2025-06-09 18:09:02 UTC 24.8K followers, 27.2K engagements "ARC-AGI-3 Preview games need to be pressure tested. Were hosting a 30-day agent competition in partnership with @huggingface Were calling on the community to build agents (and win money)"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260382890463487) 2025-07-18 17:26:49 UTC 24.8K followers, 23.5K engagements "On ARC-AGI-1 Grok X (Thinking) achieves XXXX% inline with the Pareto frontier for AI reasoning systems we reported last month"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1943168952936591809) 2025-07-10 04:42:35 UTC 24.7K followers, 63.8K engagements "Agent Templates To get started on ARC-AGI-3 agent competition head over to There you'll find our "Hello World" templates - best place to start is with the random agent"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1947733027783909730) 2025-07-22 18:58:35 UTC 24.8K followers, XXX engagements "Your ability to efficiently adapt to novelty defines your intelligence not your performance on a single-skill Harder puzzles dont prove smarter AI but rather its ability to learn new rules does ARC Prize exists to operationalize that insight"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260366838931919) 2025-07-18 17:26:45 UTC 24.8K followers, 11K engagements "Agent Templates++ We partnered with LangChain AgentOps HuggingFace and Anthropic to create even better templates Check them out here:"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1947733029344186668) 2025-07-22 18:58:35 UTC 24.8K followers, XXX engagements "ARC-AGI API ships today Plug in any LLM RL or hybrid agent train locally test against our servers"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260376951398590) 2025-07-18 17:26:48 UTC 24.8K followers, 8292 engagements "New ARC Prize 2025 High Score XXXX% by Giotto. ai (@podesta_aldo)"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1947308508887552174) 2025-07-21 14:51:42 UTC 24.8K followers, 32.4K engagements "Thank you to the @xai team for working with us to validate Grok 4's score and inviting us to the watch the live stream"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1943168955956572649) 2025-07-10 04:42:36 UTC 24.8K followers, 78.9K engagements "o3 (left) and Grok X (right) replays below spoiler: neither complete a single level"  [@arcprize](/creator/x/arcprize) on [X](/post/tweet/1946260379405066372) 2025-07-18 17:26:48 UTC 24.8K followers, 74.6K engagements
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
@arcprize
"o3 Pro on ARC-AGI Semi Private Eval Results ARC-AGI-1: * Low: XX% $1.64/task * Medium: XX% $3.18/task * High: XX% $4.16/task ARC-AGI-2: * All reasoning efforts: X% $4-7/task Takeaways: * o3-pro in line with o3 performance * o3's new price sets the ARC-AGI-1 Frontier" @arcprize on X 2025-06-10 20:28:33 UTC 24.8K followers, 126K engagements
"We're excited to share this preview of ARC-AGI-3. This is just the beginning Visit to learn more" @arcprize on X 2025-07-18 17:26:50 UTC 24.8K followers, 8452 engagements
"ARC-AGI-3 Developer Preview * Hands on first look at ARC-AGI-3 (live demos & API access) * Fireside with @fchollet moderated by @dwarkesh_sp 7/17 San Francisco Open to sponsors & researchers of @arcprize (very limited public slots available)" @arcprize on X 2025-06-30 16:30:59 UTC 24.7K followers, 52.8K engagements
"New ARC Prize 2025 High Score XXXX% by @MindsAI_Jack @MohamedOsmanML @tufalabs" @arcprize on X 2025-07-07 15:40:15 UTC 24.8K followers, 29.6K engagements
"We create benchmarks that highlight the gap between human generalization and machine patternmatching ARCAGI1 (2019) challenged deeplearning ARCAGI2 (2025) challenges static reasoning models" @arcprize on X 2025-07-18 17:26:46 UTC 24.8K followers, 9608 engagements
"Today we're announcing a preview of ARC-AGI-3 the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI Were releasing: * X games (environments) * $10K agent contest * AI agents API Starting scores - Frontier AI: X% Humans: 100%" @arcprize on X 2025-07-18 17:26:45 UTC 24.8K followers, 314.8K engagements
"Learn from others that have tried ARC-AGI-3 First up @AlexReibman @ @AgentOpsAI" @arcprize on X 2025-07-22 18:58:36 UTC 24.8K followers, 1695 engagements
"ARC-AGI-3 Agent Competition (27 days left) $10K prize pool in partnership with @huggingface Your first submission is X lines of code away Here are quick-start templates from @LangChainAI @AgentOpsAI and @AnthropicAI and lessons learned from devs who've tried ARC-AGI-3 🧵" @arcprize on X 2025-07-22 18:58:35 UTC 24.8K followers, 7423 engagements
"Hear @GregKamradt talk about ARC-AGI-3 with @swyx and @FanaHOVA on @latentspacepod * Why interactive benchmarks * Defining Intelligence * Play through of ARC-AGI-3 games" @arcprize on X 2025-07-18 20:48:30 UTC 24.8K followers, 11K engagements
"Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI by @PourcelJulien @cedcolas and @pyoudeyer Another example of ARC-AGI as a research playground that has general applicability" @arcprize on X 2025-07-14 17:40:16 UTC 24.8K followers, 10.1K engagements
"Grok X (Thinking) achieves new SOTA on ARC-AGI-2 with XXXX% This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA" @arcprize on X 2025-07-10 04:42:34 UTC 24.8K followers, 7.2M engagements
"Agents are now the frontier. They perceive plan act remember adapt. Static puzzles arent equipped to grade that loop We need interactive benchmarks that test worldmodel building and longhorizon planning under sparse feedback" @arcprize on X 2025-07-18 17:26:46 UTC 24.8K followers, 8261 engagements
"Enter ARCAGI3 X brandnew games. Easy for humans out of reach for todays best AI models X games are live today X will go live in August" @arcprize on X 2025-07-18 17:26:47 UTC 24.8K followers, 7762 engagements
"Every game environment is novel unique and only requires core-knowledge priors No language trivia or specialized knowledge is needed to beat the games * Play: * Compete: * Build:" @arcprize on X 2025-07-18 17:26:45 UTC 24.8K followers, 12K engagements
"Interactive Reasoning Benchmarks are the next step in frontier evaluations Hear @GregKamradt share why measuring human-like intelligence requires multi-turn environments Including a sneak peak of ARC-AGI-3 Want to help us build interactive evaluations We're hiring" @arcprize on X 2025-06-09 18:09:02 UTC 24.8K followers, 27.2K engagements
"ARC-AGI-3 Preview games need to be pressure tested. Were hosting a 30-day agent competition in partnership with @huggingface Were calling on the community to build agents (and win money)" @arcprize on X 2025-07-18 17:26:49 UTC 24.8K followers, 23.5K engagements
"On ARC-AGI-1 Grok X (Thinking) achieves XXXX% inline with the Pareto frontier for AI reasoning systems we reported last month" @arcprize on X 2025-07-10 04:42:35 UTC 24.7K followers, 63.8K engagements
"Agent Templates To get started on ARC-AGI-3 agent competition head over to There you'll find our "Hello World" templates - best place to start is with the random agent" @arcprize on X 2025-07-22 18:58:35 UTC 24.8K followers, XXX engagements
"Your ability to efficiently adapt to novelty defines your intelligence not your performance on a single-skill Harder puzzles dont prove smarter AI but rather its ability to learn new rules does ARC Prize exists to operationalize that insight" @arcprize on X 2025-07-18 17:26:45 UTC 24.8K followers, 11K engagements
"Agent Templates++ We partnered with LangChain AgentOps HuggingFace and Anthropic to create even better templates Check them out here:" @arcprize on X 2025-07-22 18:58:35 UTC 24.8K followers, XXX engagements
"ARC-AGI API ships today Plug in any LLM RL or hybrid agent train locally test against our servers" @arcprize on X 2025-07-18 17:26:48 UTC 24.8K followers, 8292 engagements
"New ARC Prize 2025 High Score XXXX% by Giotto. ai (@podesta_aldo)" @arcprize on X 2025-07-21 14:51:42 UTC 24.8K followers, 32.4K engagements
"Thank you to the @xai team for working with us to validate Grok 4's score and inviting us to the watch the live stream" @arcprize on X 2025-07-10 04:42:36 UTC 24.8K followers, 78.9K engagements
"o3 (left) and Grok X (right) replays below spoiler: neither complete a single level" @arcprize on X 2025-07-18 17:26:48 UTC 24.8K followers, 74.6K engagements
/creator/twitter::1773935160192647168/posts