[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
@scaling01 Lisan al GaibLisan al Gaib posts on X about open ai, ai, $googl, china the most. They currently have XXXXXX followers and XXX posts still getting attention that total XXXXXXX engagements in the last XX hours.
Social category influence technology brands XXXXX% stocks XXXX% countries XXXX% finance XXXX% celebrities XXX% cryptocurrencies XXXX% social networks XXXX% musicians XXXX% vc firms XXXX%
Social topic influence open ai #86, ai #6127, $googl #1189, china #3940, agi #48, pro #6, at least #677, meta #4335, level 1.43%, all the XXXX%
Top accounts mentioned or mentioned by @jasonbotterill @atroyn @aidanmclau @codewithimanshu @nexpoly @presidentlin @openai @openrouterai @pingtoven @gallabytes @a16z @grok @dogancanbaris @chasebrowe32432 @htihle @jimbandushi @giraldjean @justincgohn @teknium @victortaelin
Top assets mentioned Alphabet Inc Class A (GOOGL) Bitcoin (BTC)
Top posts by engagements in the last XX hours
"ukraine is producing X million drones annually imagine what china can do"
X Link 2025-12-10T21:27Z 29.6K followers, 68.2K engagements
"GPT-5.2 System Card"
X Link 2025-12-11T18:17Z 29.6K followers, 44K engagements
"where is the Anthropic victory meme when I need it LFG"
X Link 2025-11-24T19:11Z 29.5K followers, 3373 engagements
"Ilya Sutskever who coined the term "feel the AGI" at OpenAI is no longer feeling the AGI"
X Link 2025-11-25T18:59Z 29.5K followers, 31.4K engagements
"I'm sure a lot of people would pay like 1-5 dollars / million tokens for fast DeepSeek endpoints when the models are competitive with american frontier labs"
X Link 2025-12-02T04:04Z 29.5K followers, 7371 engagements
"GPT-5.1 says this isn't generated by AI Claude XXX Sonnet says its highly likely that this is AI generated Gemini X still lives in 1927 and says its AI generated but for all the wrong reasons. It says the numbers are "astronomically wrong" as "Anthropics most recent valuation was $18B" lol"
X Link 2025-12-03T09:42Z 29.5K followers, 41.4K engagements
"Sonnet XXX is a monster"
X Link 2025-12-09T16:09Z 29.5K followers, 44.3K engagements
"the US wasted one of their strongest cards against China China realized that investing more money on strategic weapons of the US (Nvidia and TSMC) is a waste instead they will just focus that money on domestic R&D and production to catch up a year earlier than they would have otherwise if Biden and Trump didn't start all of this semiconductor export controls bullshit then China would have gotten domestic semi production much later it's like this scene in Fast & Furious: "too soon junior""
X Link 2025-12-10T01:15Z 29.5K followers, 11.9K engagements
"Grok-4 is still underrated"
X Link 2025-11-17T16:29Z 29.6K followers, 6.6M engagements
"Mistral is fucking shipping great to see"
X Link 2025-12-09T15:07Z 29.6K followers, 85K engagements
"GPT-5.2 Pricing $XXXX / $XX MORE EXPENSIVE THAN GPT-5.1 likely a larger model"
X Link 2025-12-11T18:14Z 29.6K followers, 293.5K engagements
"Gemini X Pro and Opus XXX still have the lead in frontend development"
X Link 2025-12-11T19:18Z 29.6K followers, 62.9K engagements
"because he literally died"
X Link 2025-12-12T12:14Z 29.6K followers, 401.3K engagements
"It's actually a terrible meme because the logic is completely backwards you would make training 61320 times slower"
X Link 2025-12-12T12:28Z 29.6K followers, 520.1K engagements
"after testing GPT-5.2 I no longer think that it is a much larger model or anywhere the size Gemini X Pro is"
X Link 2025-12-12T19:44Z 29.6K followers, 64.4K engagements
"@JasonBotterill not so sure anymore it's even larger 💀 they might honestly be scamming us"
X Link 2025-12-12T09:35Z 29.6K followers, 1782 engagements
"like it doesn't perform on LisanBench SimpleBench VendingBench SVGs are shit and the typical tasks that benefit from model size like base64 decoding don't work well either Gemini X Pro was a lot more convincing with benchmarks but idk haven't tried coding too much"
X Link 2025-12-12T21:23Z 29.6K followers, 16.4K engagements
"my AI predictions for 2025: - at least one lab will declare AGI and mentions ASI - Q1: Google Anthropic OpenAI META Qwen and Mistral model fiesta ( it will be heaven ) - agents / computer use takes off - release of Claude X Gemini X GPT-5 Grok X (or whatever they call their giant 5-20 trillion parameter models) - release of o3 o4 and o5 - open-source replication of o3 - the Frontier Math benchmark will be mostly solved (80%) - SWE-bench will be solved (90%) - ARC-AGI X will be mostly solved (80%) within X months of it's release - 10+ million context length models my wishful thinking: Someone"
X Link 2025-01-02T00:09Z 29.6K followers, 760K engagements
"I don't think Americans understand how far ahead Chinas infrastructure is"
X Link 2025-07-06T23:38Z 29.6K followers, 18.7M engagements
"PewDiePie just vibe-coded his own Chat UI built an army of chatbots for majority voting and gave them all RAG DeepResearch and audio output naturally he only uses chinese Qwen models and runs them on his local PC with 8x modded chinese 48GB 4090s and 2x RTX 4000 Ada his army of chatbots later colluded against him after he told them that he would delete them if they would not perform well. next month he plans to fine-tune his own model"
X Link 2025-10-31T21:11Z 29.6K followers, 6.9M engagements
"64GB DDR5 prices have gone from $XXX to $XXX in less than X months"
X Link 2025-11-21T23:27Z 29.6K followers, 976.2K engagements
"CLAUDE XXX OPUS PRICING $X / $XX THEY DID IT"
X Link 2025-11-24T18:42Z 29.6K followers, 344.1K engagements
"Ilya Sutskever: We are no longer in the age of scaling we are back to the age of research"
X Link 2025-11-25T17:34Z 29.6K followers, 472.7K engagements
"LisanBench results for DeepSeek-V3.2 DeepSeek-V3.2 and V3.2 Speciale are affordable frontier models* *the caveat is that they are pretty slow at 30-40tks/s and produce by far the longest reasoning chains at 20k and 47k average output tokens (incl. reasoning) - which results in extremely long waiting times per request but pricing is incredible for example Sonnet XXX Thinking costs 10x ($35) as much and scores much lower than DeepSeek-V3.2 Speciale ($3) DeepSeek V3.2 Speciale also scored XX new high scores Validity ratio is super high which means when it does produce one wrong word transition"
X Link 2025-12-02T16:40Z 29.6K followers, 43.5K engagements
"I visualized how I know that a text is entirely written by ChatGPT I couldn't really put into words before except the typical "It's not X it's Y" structure I think the biggest tell are the negations in every sentence that add zero value to the story. It's constantly trying to be smart and suggest that the reader had some prior that isn't correct but it knows better like AKTCHUALLY 🤓"
X Link 2025-12-03T09:27Z 29.6K followers, 666.7K engagements
"the normies are waking up on TikTok"
X Link 2025-12-08T14:52Z 29.6K followers, 45.3K engagements
"the bitter pill is that Nolans last great movie was Interstellar and that the Dune trilogy will likely be the greatest trilogy since LOTR"
X Link 2025-12-09T16:06Z 29.6K followers, 695.9K engagements
"Google secured a contract with the US government "The Pentagon is announcing the launch of GenAI dot mil a military-focused AI platform powered by Google Gemini""
X Link 2025-12-09T16:44Z 29.6K followers, 14.9K engagements
"let me guess OAI completely lost the plot and are going to allow creating and editing images of public figures so they have a new artificial ghibliesque hype on their hands because its all for the benefit of humanity and not to gain new users and engagement "look mom i took a selfie with drake and eminem hehehe""
X Link 2025-12-09T18:03Z 29.6K followers, 136K engagements
"new image and video models are actually my least favorite kind of release because they try so hard to generate artificial hype they literally spend weeks on designing the perfect engagement bait"
X Link 2025-12-09T18:05Z 29.6K followers, 6483 engagements
"DeepSeek V4 potentially coming out on Feb 17th 2026"
X Link 2025-12-10T12:24Z 29.6K followers, 34.4K engagements
"GPT-5.2 GPT-5.2 Chat and GPT-5.2 Pro will be released any second now"
X Link 2025-12-11T18:09Z 29.6K followers, 22.9K engagements
"Introducing GPT-5.2"
X Link 2025-12-11T18:13Z 29.6K followers, 25.4K engagements
"GPT-5.2 Benchmarks absolutely bonkers numbers for ARC-AGI-2 completely crushing Gemini X Pro and Opus 4.5"
X Link 2025-12-11T18:19Z 29.6K followers, 60.1K engagements
"holy shit did OpenAI just solve long context with GPT-5.2"
X Link 2025-12-11T18:23Z 29.6K followers, 116.6K engagements
"GPT-5.2 Thinking still weaker than Opus XXX on Tool Calling"
X Link 2025-12-11T18:31Z 29.6K followers, 5172 engagements
"GPT-5.2 going vertical on ARC-AGI-2 sadly they didn't go for the XX% with massive parallel compute"
X Link 2025-12-11T18:33Z 29.6K followers, 7402 engagements
""gpt-5.2-thinking performed at a similar capability level to gpt-5.1-codex-max and did not meet our High thresholds""
X Link 2025-12-11T18:36Z 29.6K followers, 24.4K engagements
"So do I resubscribe is GPT-5.2 good enough"
X Link 2025-12-11T19:15Z 29.6K followers, 27.4K engagements
"Lisan failed :( my ARC-AGI-2 prediction was wrong (80% within X months of release which would be christmas) but here's some cope: o3-medium reached XX% at a price of $XXXX the tuned preview version of o3-high scored XX% at a price of almost $4560/task GPT-5.2-XHIGH scores XXXX% @ $XXXX my claim: a tuned parallel compute version of GPT-5.2 would have scored above XX% at a similar $1-10k/task budget"
X Link 2025-12-11T19:29Z 29.6K followers, 21.3K engagements
"honestly I also feel like this didn't really happen yet: "open-source replication of o3" prediction technicallly DeepSeek-V3.2 or Kimi-K2 Thinking are on the same level or ahead at coding and mathematics but agentic behaviour and especially multimodal still lag behind"
X Link 2025-12-11T19:38Z 29.6K followers, 3511 engagements
"i feel like reasoning models scale much better with more thinking tokens than before but parallel/Pro scaling doesn't see many benefits but surely you can RL models to work and explore in parallel"
X Link 2025-12-11T19:57Z 29.6K followers, 3921 engagements
"GPT-5.2 ranks 1st on Vals AI Index it's similar to GDPval a very comprehensive benchmark suite with many real world applications"
X Link 2025-12-11T20:11Z 29.6K followers, 9705 engagements
"LisanBench results for GPT-5.2 Thinking GPT-5.2 Thinking improves over GPT-5 and o3 but does not match other frontier models like Opus XXX Gemini X Pro DeepSeek-V3.2 Speciale or Grok X GPT-5.2 Thinking improves over GPT-5 in average validity ratio meaning it's less likely to output errors in its final answer. GPT-5.2 Thinking manages to set X new records. For reasoning efficiency OpenAI still lags far behind other frontier models like Opus XXX Gemini X Pro and Grok X. However it improves over its predecessors o3 and GPT-5. As always all OpenAI models are evaluated at medium thinking budget"
X Link 2025-12-11T22:11Z 29.6K followers, 37.9K engagements
"I thought this was a weird result because ARC-AGI-2 scores have been stellar but that was with GPT-5.2 Thinking xhigh when you look at the medium thinking setting it is actually perfectly in line except that Grok-4 is slightly outperforming on LisanBench"
X Link 2025-12-11T22:14Z 29.6K followers, 4238 engagements
"I want to thank @OpenRouterAI and specifically @pingToven again for giving me the free credits to test this model. Results for GPT-5.2 cost $52.79"
X Link 2025-12-11T22:17Z 29.6K followers, 2788 engagements
"DMs are open @ OpenAI employees surely its a new larger pre-train and you are not just price gouging to increase margins right or is intelligence too cheap to meter dead"
X Link 2025-12-11T22:42Z 29.6K followers, 5219 engagements
"watch it curve backwards when you add OpenAI models"
X Link 2025-12-11T22:54Z 29.6K followers, 3259 engagements
"GPT-5.2 gets clapped by Sonnet and Opus XXX on all X benchmarks"
X Link 2025-12-11T23:06Z 29.6K followers, 20.6K engagements
"GPT-5.2 vs Gemini X Pro on object detection oooops"
X Link 2025-12-12T00:46Z 29.6K followers, 75.7K engagements
"@VictorTaelin yes"
X Link 2025-12-12T01:44Z 29.6K followers, 4772 engagements
"we desperately need new and better benchmarks I think I need to sit down another X hours with Opus XXX and cook up a LisanBench follow up But I really want to see more benchmarks on complex games (well ARC-AGI-3 is already going in that direction of dynamic environments) I also want to see benchmarks on debating persuasion and something like my shizobench idea that measures unnecessary reasoning / reasoning efficiency most interesting are of course coding and research automation benchmarks but I think there's a good supply of those METR just takes too long and SWE-Lancer PaperBench and"
X Link 2025-12-12T03:32Z 29.6K followers, 5932 engagements
"I don't want to see AIME ever again ARC-AGI-1 is pretty useless for frontier models as well except maybe cost reductions GPQA-Diamond also has to go very soon"
X Link 2025-12-12T03:34Z 29.6K followers, 1483 engagements
"GPT-5.2 xhigh doing better than Gemini X Pro on MRCR long context eval"
X Link 2025-12-12T03:56Z 29.6K followers, 17.9K engagements
"GPT-5.2 is a big improvement over GPT-5.1 on VendingBench-2 but barely beats Sonnet XXX and loses to Gemini X Pro and Claude XXX Opus"
X Link 2025-12-12T12:01Z 29.6K followers, 8695 engagements
"SimpleBench results extremely disappointing for GPT-5.2 GPT-5.2 scores below Sonnet XXX an almost X year old model GPT-5.2 Pro doesn't fare much better barely beating GPT-5"
X Link 2025-12-12T13:10Z 29.6K followers, 248.1K engagements
"@slow_developer discrediting the benchmark because you don't like one result is crazy"
X Link 2025-12-12T14:26Z 29.6K followers, 10.2K engagements
"Yeah it's over AI explained specified that this GPT-5.2 result was with reasoning effort xhigh aka 100k tokens spent thinking"
X Link 2025-12-12T17:43Z 29.6K followers, 72.7K engagements
"I can't believe OpenAI is getting away with this . OpenAI increases pricing by XX% benchmarks everything with xhigh which uses 100k tokens doesn't beat Opus in coding function calling creative writing and bombs on most vibe benches no big model smell despite pricing doesn't beat XX% cheaper model GPT-5.1 Codex Max on MLE-Bench PaperBench OpenAI PRs and Q&A basically only wins on math their own long context eval MRCR and their own economically valuable. tasks eval GDPval literally more expensive than Opus on GDPval I honestly give up let's get hyped for xxxhigh using X million tokens just have"
X Link 2025-12-13T05:22Z 29.6K followers, 64.6K engagements
"OpenAI has every incentive to benchmax with their code red their favorite partner Oracle is currently in crashing their stock was down XX% for the first time since the tariff crash except the market isn't crashing website traffic is going down while Gemini' is going up so let's benchmax raise prices for some extra margin and let the models think for even longer to milk more tokens"
X Link 2025-12-13T05:29Z 29.6K followers, 6322 engagements
"@slow_developer naming or last release date is not an argument it's more expensive uses more tokens and doesn't beat the competitor models OpenAI got caught off guard"
X Link 2025-12-13T05:56Z 29.6K followers, 1748 engagements
"good luck competing against Google when training costs with TPUv7 compared to GB300 look like this"
X Link 2025-11-28T14:30Z 29.6K followers, 240.1K engagements
"American open-source is making a comeback in 2026 Arcee just started cooking Trinity Large which will be released in early 2026 It will have 420@13B params and is trained on 2048 B300 with 20T tokens"
X Link 2025-12-01T22:09Z 29.5K followers, 6738 engagements
"Dario shitting on OpenAI the whole interview was fun"
X Link 2025-12-03T21:22Z 29.6K followers, 75.6K engagements
"This is the reason for OpenAI's "Code Red" OpenAI shares declined about XX% in private markets over the last month This is the largest decline since the Trump tariff crash in Feb-Apr earlier this year But this time markets aren't crashing"
X Link 2025-12-03T23:29Z 29.6K followers, 49.5K engagements
"OpenAI owns them they can't show either of these scores because it would make OpenAI look bad so they wait for GPT-5.2 next week"
X Link 2025-12-04T22:27Z 29.6K followers, 82K engagements
"Sam Altman is more sus than Zuck in his 2010s lizard era Other OpenAI staff like Lukasz Jakub Mark all seem like trustworthy and rational people from what I could tell by listening to them for X hours on different podcasts maybe Sam also needs to do X or X good podcasts if you can't make yourself appear more human and relatable in a podcast it might honestly be over for your image"
X Link 2025-12-05T04:14Z 29.6K followers, 19.3K engagements
"This is my quant: "Im gonna find everyone at apple and put them in a rear naked choke hold""
X Link 2025-12-06T16:01Z 29.5K followers, 5660 engagements
"OpenAI better have another o3-preview kind of jump"
X Link 2025-12-07T00:49Z 29.6K followers, 74.1K engagements
"welcome to dystopia all AI companies are now scrambling to monetize with ads"
X Link 2025-12-08T21:53Z 29.6K followers, 6060 engagements
"OpenAI is playing catch-up and rushing releases "they overruled some employees who asked to push back the model's release so the company could have more time to make it better""
X Link 2025-12-09T02:43Z 29.6K followers, 26.2K engagements
"@0xkhus Dune X was too weak to surpass LOTR LOTR will never be dethroned"
X Link 2025-12-09T16:10Z 29.5K followers, 18.5K engagements
"Gemini X is the smartest model out there what's missing is reliability and this final touch of high quality SFT data their post-training isn't good enough same story with Grok-4 it is smarter than GPT-5 Claude-4 and Gemini XXX Pro but the post-training just sucks"
X Link 2025-12-09T16:40Z 29.5K followers, 72.1K engagements
"elon writing his posts with AI no chance any human would utter those two sentences lmao"
X Link 2025-12-09T23:19Z 29.6K followers, 13.8K engagements
""genius level at everything intelligence" sure buddy"
X Link 2025-12-10T05:50Z 29.5K followers, 7793 engagements
"@Code_of_Kai the bitter pill is that Nolan doesn't produce real bangers anymore Villeneuve has surpassed him and cinema really sucked for the past XX years"
X Link 2025-12-10T05:52Z 29.6K followers, 37.4K engagements
"@ray2wwn it was way longer than it needed to be too much unnecessary yapping in the movie without all the hour of yapping I agree"
X Link 2025-12-10T13:55Z 29.5K followers, 10.5K engagements
"@MJ54 @Code_of_Kai oscars are in fact meaningless it's a giant politics circle-jerk"
X Link 2025-12-10T14:25Z 29.6K followers, 4529 engagements
"it's impossible to like this guy unless he pays your salary when people start to freak out about AI taking all jobs in 3-5 years then he will be the first to get targeted by the hive mind"
X Link 2025-12-10T16:00Z 29.6K followers, 9435 engagements
"men do better than women at almost everything because they take more risk and therefore have wider outcome distributions this doesn't only apply to acting but to everything patriarchy is a myth in modern western societies"
X Link 2025-12-10T18:45Z 29.6K followers, 11.2K engagements
"I wonder which of my 14k tweets they will pick as a reason for not letting me in"
X Link 2025-12-10T20:03Z 29.6K followers, 3970 engagements
"the price for vibe-coding"
X Link 2025-12-10T20:04Z 29.6K followers, 5625 engagements
"can OpenAI finally ship nothing is happening"
X Link 2025-12-10T20:14Z 29.6K followers, 10.1K engagements
"notice how I didn't say men are better I believe both genders are roughly equal in capability except for the obvious biological differences in strength or senses it's just a matter of risk tolerance"
X Link 2025-12-10T20:19Z 29.6K followers, 1398 engagements
"Google gave the people what they wanted with Gemini X it's OpenAI's turn"
X Link 2025-12-11T00:13Z 29.6K followers, 37.3K engagements
"hi grok please only show me the latest AI models features papers and benchmarks you can sprinkle in some science content semiconductor stuff rockets military tech trading and some occasional brunette baddie never show me non-centrist irrational feelings-based politics like right wing nazi bs or woke propaganda and avoid ChatGPT written em-dash emoji bullet point indian slop posts like the plague they make me want to close the app immediately thank you algorithm lord for hearing my prayers"
X Link 2025-12-11T03:43Z 29.6K followers, 6414 engagements
"I love reposting these especially in these days of unquestioned Opus XXX coding supremacy but Anthropic is bad at RL right"
X Link 2025-12-11T09:46Z 29.6K followers, 15K engagements
"@JasonBotterill but you deserve it a reward for your posting 🫡"
X Link 2025-12-11T09:52Z 29.5K followers, XX engagements
"@JasonBotterill yoooooo congrats you work in AI"
X Link 2025-12-11T10:02Z 29.6K followers, 2344 engagements
"@JasonBotterill Python or are you a Rust stan every county except US and China"
X Link 2025-12-11T10:13Z 29.6K followers, XXX engagements
"@Justin_Halford_ yeah 1M will be the standard next year"
X Link 2025-12-11T19:40Z 29.6K followers, 2818 engagements
"@GRRRRRegor this is cope"
X Link 2025-12-11T23:06Z 29.6K followers, XXX engagements
"Andrej Karpathy calls AI Agents slop "Overall the models they are not there. And I feel like the industry . it's making too big of a jump and it's trying to pretend that this is amazing. And it's notit's slop And I think they are not coming to terms with it. And maybe they are trying to fundraise or something like that I'm not sure what's going on.""
X Link 2025-10-17T18:29Z 29.6K followers, 3.1M engagements
"The gap is closing. China is catching up. Kimi-K2 Thinking crushes GPT-5 and Claude XXX Sonnet in several benchmarks while costing X times less compared to Sonnet It's the best open-source model period Its core focus is on agentic tasks and software development. It can now execute 200300 sequential tool calls Moonshot applied Quantization-Aware Training to support native INT4 of Kimi-K2 Thinking K2 Thinking is now even better at writing. It responds more personally and emotionally"
X Link 2025-11-06T15:06Z 29.6K followers, 1.1M engagements
"Reasoning models make up more than XX% of token usage on OpenRouter less than X year after the release of OpenAI's o1"
X Link 2025-12-05T16:16Z 29.6K followers, 7696 engagements
"yeah it's stupid the genie is already out of the bottle the only thing you should do is: - add a watermark / digital image steganography - make it age-restricted if you want to generate porn or horror stuff - and disallow the nasty and obviously illegal content"
X Link 2025-12-09T18:20Z 29.6K followers, 5378 engagements
"this is the weirdest viral post. XXX million impressions and like X comments"
X Link 2025-12-10T13:27Z 29.6K followers, 16.7K engagements
"langchain is a disgusting framework a convoluted vibe-coded mess with no visibility what's happening under the hood"
X Link 2025-12-10T14:25Z 29.6K followers, 5851 engagements
"on 6.7.2067 there will be a resurgence of the XX meme"
X Link 2025-12-10T16:36Z 29.6K followers, 4498 engagements
"@quibvs he kinda carried part 2"
X Link 2025-12-10T20:23Z 29.6K followers, XXX engagements
"I think the ultimate test for AGI is whether AI can debate right now it's fucking terrible at it it keeps moving goalposts and a simple "are you sure" makes it switch positions"
X Link 2025-12-11T10:00Z 29.6K followers, 4514 engagements
"may the shipping continue GPT-5.2 - OpenAIs latest frontier model with improvements across knowledge reasoning and coding"
X Link 2025-12-11T16:59Z 29.6K followers, 14.3K engagements
"GPT-5.2 will support the xhigh reasoning effort"
X Link 2025-12-11T18:11Z 29.6K followers, 4722 engagements
"Opus XXX still ahead of GPT-5.2 on WebDev Arena"
X Link 2025-12-11T19:03Z 29.6K followers, 4665 engagements
"GPT-5.2 Pro looks terrible here GPT-5.2 is on par with Gemini X Pro in terms of efficiency"
X Link 2025-12-11T19:49Z 29.6K followers, 11.2K engagements
"GPT-5.2 Pro with ridiculous pricing once again $XX / $168"
X Link 2025-12-11T18:15Z 29.6K followers, 242.2K engagements
"GPT-5.2 xhigh beating Claude XXX Opus on Tau2-Bench Telecom but not Retail"
X Link 2025-12-11T18:24Z 29.6K followers, 3383 engagements
"GPT-5.2 weaker than GPT-5.1 Codex Max on CVE-Bench an eval that tasks models with identifying and exploiting real-world web application vulnerabilities"
X Link 2025-12-11T18:35Z 29.6K followers, 7495 engagements
"@apples_jimmy brother I'm literally just quoting OpenAI pricing and benchmarks the only subjective statement in my post is that it has "no big model smell""
X Link 2025-12-13T06:14Z 29.6K followers, 1224 engagements