[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]

@casper_hansen_ Avatar @casper_hansen_ Casper Hansen

Casper Hansen posts on X about k2, casper, o3, 1m the most. They currently have XXXXX followers and XXX posts still getting attention that total XXXXXX engagements in the last XX hours.

Engagements: XXXXXX #

Engagements Line Chart

Mentions: XX #

Mentions Line Chart

Followers: XXXXX #

Followers Line Chart

CreatorRank: XXXXXXX #

CreatorRank Line Chart

Social Influence #


Social category influence technology brands XXXX% stocks XXXX% travel destinations XXXX% countries XXXX% finance XXXX%

Social topic influence k2 #22, casper #14, o3 #8, 1m 2.88%, lucy 1.92%, open ai 1.92%, dot 1.92%, inference 1.92%, claude 1.92%, $googl XXXX%

Top accounts mentioned or mentioned by @maziyarpanahi @giffmana @thezachmueller @philtrem22 @willccbb @alandao_ai @garyfung @quixiai @xlr8harder @huggingface @hrishbhdalal @altryne @alibabaqwen @liminalsnake @ivanfioravanti @iotcoi @cloneofsimo @chatgpt21 @presidentlin @winglian

Top assets mentioned Alphabet Inc Class A (GOOGL)

Top Social Posts #


Top posts by engagements in the last XX hours

"Recipe to post-train Qwen3 1.7B into a DeepResearch model What does it mean for something small to think deeply Meet Lucy a posttrained Qwen31.7B as a DeepResearch model based on @willccbb's verifiers. Primary Rule-based Rewards: - Answer correctness We check whether the final response literally contains the ground-truth answer. This substring match is cheap and avoids calling a larger LLM judge. - Visit/search ratio If the agent visits at least as many pages as it issues search queries it receives ((visit_search_ratio - 1) / 4) ** XXXX. If it searches more than it visits the score is -0.5."
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 15:07:00 UTC 8849 followers, 39.8K engagements

"@daniel_mac8 it's been said that chinese teams ship fast but which one ships the 1M context model"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 17:39:51 UTC 8858 followers, 3098 engagements

"veRL v0.5.0 is out Agentic multi-turn rollouts with vLLM / SGLang LangGraph multi-turn agent rollouts Async trainer with X step off policy LoRA support for vision-language models Use remote LLM-as-a-judge (e.g. OpenAI models)"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 16:19:00 UTC 8850 followers, XXX engagements

"btw @minishlab would recommend adding an example like the one in my screenshot that shows how to use semhash with a Huggingface dataset"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 07:51:00 UTC 8859 followers, XXX engagements

"@chatgpt21 You think not What would it be then - o3 alpha"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-23 05:58:45 UTC 8859 followers, 1669 engagements

"@Presidentlin @altryne Qwen3 coder is already released after this post. This is from Zhipu AI and has been live on z (dot) ai for a bit"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 15:36:31 UTC 8861 followers, XX engagements

"@willccbb the correct answer is always depends if your job title includes engineer or researcher"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 14:41:01 UTC 8827 followers, XXX engagements

"The RL codebase I like the most: - The NanoGPT of RL - Supports multi-turn RL - Just 1k lines of code in Python - Data Tensor Sequence Parallel"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-17 09:45:32 UTC 8794 followers, 26.4K engagements

"@garyfung @6___0 kimi should have the edge with so many more parameters"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 16:46:43 UTC 8846 followers, XX engagements

"Im seeing lots of people with the worst takes on IMO medals OpenAI and generally AI on the timeline. Where did we go wrong when we critique such a crazy achievement The craziest part that its just natural language"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-20 10:40:44 UTC 8619 followers, 1164 engagements

"@mgoin_ Michael if you ever need more buy-in from the vLLM / PyTorch team on this just reference this X post please :D Very much looking forward to see the progress on this one as I think a ton of people will feel a difference here (serverless RL)"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 15:21:29 UTC 8791 followers, XXX engagements

"Another 20s saved on load already merged in nightly. Next vLLM release (0.9.3) will be amazing"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 14:35:18 UTC 8858 followers, 1451 engagements

"Ever wanted to solve biomedicine Here you go thousands of applications can be created from this"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-17 11:16:56 UTC 8791 followers, 6045 engagements

"@Sebyverse Ever heard of RLHF or RLVR Thats how models like o3 R1 K2 etc. are trained. 80-90% of this training process is inference. So you need to build inference before you build training"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 13:13:27 UTC 8731 followers, XXX engagements

"@menloresearch @Alibaba_Qwen I wrote this post that covers your model Thanks for sharing everything - would recommend releasing a paper"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 16:37:35 UTC 8840 followers, XXX engagements

"@giffmana @Alibaba_Qwen Try to read the chat template and apply it in various ways over multiple turns"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 19:13:05 UTC 8775 followers, XXX engagements

"@Conor_D_Dart we will manifest a release by giving them attention on X. they are testing models by the looks of it (fixed some quantization issue) so it's close"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 17:07:47 UTC 8859 followers, XXX engagements

"@WolframRvnwlf I found this nugget yesterday. Surface-level this looks like everything you would want"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 12:47:23 UTC 8779 followers, XXX engagements

"Does anyone know what the required hardware is to run Qwen3 235B at 256k context length"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-23 08:25:35 UTC 8860 followers, 1154 engagements

"claude opus and kimi k2 have been so heavily RL'd that they will sometimes hallucinate for literally no reason. and people say o3 is bad all models exhibit this behaviour"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 13:01:00 UTC 8859 followers, XXX engagements

"@giffmana @Alibaba_Qwen Unfortunately I don't have public code to trigger this. Will touches on it briefly here. It's mostly horrible in terms of managing enable_thinking between turns and trying to capture format with empty think tags"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 19:12:47 UTC 8779 followers, XXX engagements

"This is not a SMALL update. This is huge Give us this for every model please Qwen team🙏"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 17:32:55 UTC 8859 followers, 38.8K engagements

"Want to get better at managing multiple Claude Code instances Just go play StarCraft thats way harder"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 06:55:11 UTC 8840 followers, 1035 engagements

"@willccbb Model (Apache 2.0): Code: (MIT):"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 15:10:50 UTC 8856 followers, 2585 engagements

"if you ever see "nuggetization" in a paper you know they are not holding back"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 17:04:39 UTC 8864 followers, X engagements

"@boneGPT Sam has a CS degree and previously coded"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-19 18:39:27 UTC 8767 followers, 2584 engagements

"@giffmana @Alibaba_Qwen The Qwen3 hybrid chat template was a nightmare to manage in a multi-turn scenario. Apart from that a hybrid model should be as strong as the standalone version. If you cant do that then a routing system over a model harness is better"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 19:06:21 UTC 8779 followers, 2003 engagements

"@prashant_hq I didnt have the lite version"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 12:40:00 UTC 8723 followers, XXX engagements

"@michaelzluo @willccbb Wish I had those numbers. Paper inference and MCP server is pending release. They released SimpleQA though"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 16:06:16 UTC 8808 followers, XXX engagements

"if you loved kimi k2 you will love what another cracked chinese team is about to release. Models: - o3 competitor: multi-turn reasoning coding search - 106B A12B: XXX experts X active 128k context GQA - 355B A32B: Details unknown about config"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 15:18:15 UTC 8861 followers, 25.8K engagements

"uv for brew uv for apt uv for literally anything is all you need"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 14:51:00 UTC 8859 followers, 1018 engagements

"Vibe check on Kimi K2 vs Qwen3 Coder a 6D projection of space and time folding in half many times over XXX vs XXX lines of pure html rendered in a single gradio space winner winner chicken dinner for kimi "visualizing the multidimensional folding of space-time fabric across six dimensions" "each vertex represents a quantum state each fold a temporal distortion" no chicken dinner for qwen3 in this round effects are just not as cool or spacy to a non-quantum guy like me Why are problems like these hard They test extrapolation abilities - there is no plausible dataset for a 6D projection of"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 17:37:00 UTC 8863 followers, XX engagements

"Step X of many: X. Three weeks ago I released a biomedical dataset of 521k samples. X. Two weeks ago I released full-text embeddings (32k) with 2560 dimension from Qwen3 4B embedding model. X. This week I release 19k semantically clustered texts that can be used to construct multi-article biomedical question-answering"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 15:01:34 UTC 8858 followers, 4457 engagements

"@vikhyatk accurate. you need a hook to capture a diminishing attention span. like i 2x'd my whatever because of 4-bit quant"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 08:59:04 UTC 8860 followers, XXX engagements

"@TheZachMueller Looked it up gigabrain vibes on this one"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 12:42:45 UTC 8724 followers, XXX engagements

"@axolotl_ai Woah I'm a fan of this ALST @winglian is this preferred over sequence parallelism faster/scales better"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-09 19:29:03 UTC 8810 followers, XX engagements

"@alandao_ai Thanks for creating it Alan Do you by chance have a wandb run that you could share Would love to observe the raw data of the training run :)"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-23 08:21:57 UTC 8796 followers, XXX engagements

"@mark_k @ChatGPTapp Not sure I will get it anytime soon as I don't have a subscription"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 10:24:16 UTC 8847 followers, XXX engagements

"We need something better than Nvidia or AMD if we want to scale up AGI and make it accessible. A simple 2x in compute performance every X years is not enough to satisfy where we are headed"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 14:11:36 UTC 8772 followers, 1002 engagements

"@altryne More news to (maybe) chat about :)"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 15:25:03 UTC 8859 followers, XXX engagements

"be Demis Hassabis give Nobel Prize lecture drops a provocative conjecture: "any pattern that can be generated or found in nature can be efficiently discovered and modelled by a classical learning algorithm" protein folding's search space is astronomically large yet nature solves it in milliseconds Why Natural systems aren't random. They have structure shaped by evolution and physics (think protein folding mountain ranges planetary orbits) This underlying structure is learnable by AI He calls it "Survival of the Stablest" AI is reverse-engineering the universe"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 18:18:00 UTC 8863 followers, X engagements

"I will be at the ACL conference in Vienna soon AGI in hotel room vs talking to people will be a real struggle"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 16:28:00 UTC 8849 followers, XXX engagements

"@jon_durbin Do you have a launch command or script for this that works with all the bells and whistles you just described"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-23 10:51:13 UTC 8791 followers, XXX engagements

"TLDR; you can just fill out a form to have coffee with Lex which I think is pretty cool"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 09:36:33 UTC 8863 followers, XXX engagements

"vLLM is finally addressing a long-standing problem: startup times 35s - 2s for CUDA graph capture is a great reduction"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 12:36:35 UTC 8864 followers, 35.2K engagements

""Home light music synchronization with GPT-5 in X minutes" is essentially what's coming very soon. And you were clowning him for not having coding taste Imagine the demos"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 19:54:03 UTC 8857 followers, 75K engagements

"@VoyageAI would love to share this but your HF upload is empty and doesn't have an open-source license"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 08:24:08 UTC 8859 followers, XXX engagements

"be Sam Altman asked what keeps him up at night about AI X scary categories X. Bad guy gets superintelligence first designs a bioweapon take down the United States power grid break into the financial system and take everyone's money X. Sci-fi loss of control The AI is like oh I don't actually want you to turn me off. I'm afraid I can't do that X. Accidental takeover the models kind of accidentally take over the world they just become so ingrained in society Theres young people who just say like I can't make any decision in my life without telling ChatGPT everything that's going on We're so"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 19:41:51 UTC 8859 followers, 97.3K engagements

"@kalomaze it's a great model. the tool calling abilities alone are outstanding but would like to see evals. have you tried the model yet (it's on z (dot) ai)"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 18:32:49 UTC 8864 followers, X engagements

"Google just deactivated my final ad-blocker that worked. So now its really time to move browser - what are people using these days"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-21 12:37:07 UTC 8767 followers, 3752 engagements

"if you loved kimi k2 you will love what a certain chinese team is about to release which is highly competitive with 1M context length"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 16:35:32 UTC 8859 followers, 96K engagements

"Whenever someone says DeepSeek this song runs in my head but with "I follow you Deep Seek baby""
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-22 17:34:21 UTC 8860 followers, 6263 engagements

"@chipro post training because context engineering is a space with space more players as there are many strong players already and post training is mostly an untapped market that's expanding massively with verifiers and multi-turn RL"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 16:01:59 UTC 8864 followers, XXX engagements

"a bunch of tricks to fool AI in a gotcha is not a good eval and not something you should hill climb. HLE might be detrimental to true progress given the low quality"
@casper_hansen_ Avatar @casper_hansen_ on X 2025-07-24 15:57:00 UTC 8859 followers, XXX engagements