[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.] #  @TheAhmadOsman Ahmad Ahmad posts on X about open ai, all the, gpu, $googl the most. They currently have XXXXXX followers and XXX posts still getting attention that total XXXXXXX engagements in the last XX hours. ### Engagements: XXXXXXX [#](/creator/twitter::248951926/interactions)  - X Week XXXXXXXXX +44% - X Month XXXXXXXXX -XX% - X Months XXXXXXXXXX +663% - X Year XXXXXXXXXX +300,520% ### Mentions: XX [#](/creator/twitter::248951926/posts_active)  - X Week XXX +34% - X Month XXX +5.30% - X Months XXX +671% - X Year XXX +3,481% ### Followers: XXXXXX [#](/creator/twitter::248951926/followers)  - X Week XXXXXX +3.10% - X Month XXXXXX +15% - X Months XXXXXX +422% - X Year XXXXXX +4,129% ### CreatorRank: XXXXXX [#](/creator/twitter::248951926/influencer_rank)  ### Social Influence [#](/creator/twitter::248951926/influence) --- **Social category influence** [technology brands](/list/technology-brands) #2269 [stocks](/list/stocks) #2446 [finance](/list/finance) XXXX% [social networks](/list/social-networks) XXXX% [countries](/list/countries) XXXX% [celebrities](/list/celebrities) XXX% **Social topic influence** [open ai](/topic/open-ai) #30, [all the](/topic/all-the) #1027, [gpu](/topic/gpu) #87, [$googl](/topic/$googl) 2.38%, [debt](/topic/debt) #268, [gpus](/topic/gpus) #18, [ebay](/topic/ebay) #625, [macbook](/topic/macbook) #440, [token](/topic/token) #375, [money](/topic/money) XXXX% **Top accounts mentioned or mentioned by** [@elliotarledge](/creator/undefined) [@mjyoke1111](/creator/undefined) [@anyconvo](/creator/undefined) [@spacewelder314](/creator/undefined) [@wreckedbymemes](/creator/undefined) [@rasbt](/creator/undefined) [@f1yingbanana](/creator/undefined) [@leeleepenkman](/creator/undefined) [@chilly99705](/creator/undefined) [@josephstigler](/creator/undefined) [@rogerscissp](/creator/undefined) [@qasimstatic](/creator/undefined) [@futuristasi](/creator/undefined) [@thexeophon](/creator/undefined) [@jameslee1033176](/creator/undefined) [@danielsamanez3](/creator/undefined) [@dogzrcoolnstuff](/creator/undefined) [@balatriankid](/creator/undefined) [@myainotez](/creator/undefined) [@architectloop](/creator/undefined) **Top assets mentioned** [Alphabet Inc Class A (GOOGL)](/topic/$googl) [Costco Wholesale Corporation (COST)](/topic/costco) [Dell Technologies, Inc. (DELL)](/topic/dell) ### Top Social Posts [#](/creator/twitter::248951926/posts) --- Top posts by engagements in the last XX hours "never have been more proud to be self-hosting my infra and toolings zero impact from the AWS outage only found out because the timeline is crashing out local infra supremacy 🫡" [X Link](https://x.com/TheAhmadOsman/status/1980327842719363494) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T17:38Z 26.8K followers, 4366 engagements "lol lmao even ollama are lying through their teeth in this reply to me next tweet i'll show the llama cpp merge for gpt-oss to ollama some comments on the merge calling them out llama cpp developer remarks" [X Link](https://x.com/TheAhmadOsman/status/1964546485045121489) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-07T04:29Z 26.6K followers, 54K engagements "this is why we Buy GPUs GLM-4.6 is a KINO Agentic model daily driver within Claude Code" [X Link](https://x.com/TheAhmadOsman/status/1973889129781080497) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-02T23:13Z 26.6K followers, 13.6K engagements "- local llms XXX - tired of guides that just tell you to run a script and call it a day - want to actually know what your GPU is doing not just trust a black box - here's what really happens when you run a local LLM - what gets loaded why and how it all fits together - no gatekeeping just the real explanations nobody gives you - the elite don't want you to know this - running a model = inference (using model weights) - inference = predicting the next token based on your input plus all tokens generated so far - together these make up the "sequence" - tokens words - they're the chunks" [X Link](https://x.com/TheAhmadOsman/status/1974730678513041490) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-05T06:57Z 26.6K followers, 59.9K engagements "HUGE a new Agentic coding model fits on 4x RTX 3090s @ 4-bit fully local KAT-Dev-72B-Exp by Kwaipilot - Claude Code setup guide included - ranks #2 on SWE-Bench Verified - excels at long-horizon coding + tool-use - multi-stage tuned: Mid-Training SFT + RFT Agentic RL" [X Link](https://x.com/TheAhmadOsman/status/1976606921756205531) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-10T11:13Z 26.6K followers, 26.4K engagements "local AI is a UX problem we're gonna fix it" [X Link](https://x.com/TheAhmadOsman/status/1976712152053616683) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-10T18:11Z 26.6K followers, 5722 engagements "- when to use RAG vs fine-tuning (the decision table) - new facts fast-changing docs per-tenant data RAG - tone/style/formatting tool-use behavior task following SFT fine-tune - logic/chain patterns preference shaping (less about facts) Preference tuning (RLHF/RLVR) - do both: RAG for facts fine-tune for how to speak/reason - fine-tuning: the X most common flavors - SFT (supervised fine-tuning): input target output pairs - use for formats style multi-step demos tool calls - Preference tuning (RLHF/RLVR variants): pairs (good vs bad response) - use for which answer feels better/safer - Task" [X Link](https://x.com/TheAhmadOsman/status/1977274616671314154) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-12T07:26Z 26.6K followers, 22.7K engagements "If AMD is serious about AI they need to do it the Mark Zuckerberg way and poach the CUDA devs from NVIDIA" [X Link](https://x.com/TheAhmadOsman/status/1977278283852230842) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-12T07:40Z 26.6K followers, 41.4K engagements "all the gigawatts mean nothing if Google gets there first and Google has everything to beat not only OpenAI but NVIDIA as well" [X Link](https://x.com/TheAhmadOsman/status/1977810885789041081) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-13T18:57Z 26.6K followers, 7582 engagements "Codex CLI with GPT-5 High for planning Claude Code with GLM-4.6 for execution make sure to INSTRUCT both to follow a modular project structure with Domain-Driven Design architecture this is THE ULTIMATE RECIPE for Agentic coding once you get it youll never look back" [X Link](https://x.com/TheAhmadOsman/status/1978115794526449836) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T15:08Z 26.6K followers, 8536 engagements "a reminder that in closed source AI from companies like OpenAI & Anthropic you have zero control over how the models behave and they can quantize it distill it hot-swap to a cheaper/weaker checkpoint make the model manipulative fine-tune it in ways that break safety or depth drop its IQ run experiments on you and/or your data throttle output speed or raise prices sunset the entire model/version they have all the knobs & you're at their mercy you won't even get a changelog opensource FTW Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1978211350489784571) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T21:28Z 26.6K followers, 24.5K engagements "pre-launch teasers continue the tier list is done Buy a GPU The Movement" [X Link](https://x.com/TheAhmadOsman/status/1979277630902812983) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T20:05Z 26.6K followers, 7083 engagements "theres what to buy what to avoid and everything in between this tier list will cover it all" [X Link](https://x.com/TheAhmadOsman/status/1979296363952128278) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T21:20Z 26.6K followers, 1257 engagements "how do we get to small specialized models by doing exactly this great breakdown from @myainotez" [X Link](https://x.com/TheAhmadOsman/status/1979388291456598386) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T03:25Z 26.6K followers, 3086 engagements "- Xerox - invented it all - got $X million in Apple stock - entire company gone by 2000 - thanks for your service - Steve Jobs - LSD still active - go to Xerox PARC - see a mouse windows GUI - brain goes "holy shit" - say were taking this - engineers: wait what do you mea - its already in the Mac prototype - everyone claps - reinvented computing - by watching someone elses demo - Jobs: good artists copy great artists steal - Bill Gates - see Mac - see Jobs demo it - looks suspiciously like Xerox - pretend to be impressed - hey Steve wed love to build Excel for the Mac - secretly building" [X Link](https://x.com/TheAhmadOsman/status/1979581072368353745) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T16:11Z 26.8K followers, 45.2K engagements "@samsja19 one of the most rational takes i've read on this interview XXX% agree" [X Link](https://x.com/TheAhmadOsman/status/1980342134999077218) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T18:35Z 26.8K followers, 1957 engagements "@Yuchenj_UW i really *really* don't understand how google just keeps on fumbling like this" [X Link](https://x.com/TheAhmadOsman/status/1980701666044441025) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T18:24Z 26.8K followers, XXX engagements "i will not be installing ChatGPT Atlas browser from OpenAI and neither should you" [X Link](https://x.com/TheAhmadOsman/status/1980714787949670623) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T19:16Z 26.8K followers, 401.8K engagements "for inference give me a single RTX PRO 6000 over 3x RTX 5090s any day for training flip it 3x RTX 5090s clears the RTX PRO 6000 off the map different workloads different kings Buy a GPU The Movement" [X Link](https://x.com/TheAhmadOsman/status/1980739990863913025) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T20:56Z 26.8K followers, 14K engagements "@_thomasip No you care more about the VRAM & Bandwidth A single RTX PRO 6000 gives you 96GB to offload a model onto and there will be non communications between PCIe lanes during inference (if a model fits ofc)" [X Link](https://x.com/TheAhmadOsman/status/1980746295754031317) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T21:21Z 26.8K followers, XXX engagements "@elliotarledge and samplers god if people only tried to understand samplers and context budget" [X Link](https://x.com/TheAhmadOsman/status/1980835954933043503) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T03:17Z 26.8K followers, XXX engagements "My house has XX GPUs. 21x RTX 3090s 4x RTX 4090s 4x RTX 5090s 4x Tenstorrent Blackhole p150a Before AGI arrives: Acquire GPUs. Go into debt or sell your kidneys if you must. But whatever you do secure the GPUs" [X Link](https://x.com/TheAhmadOsman/status/1980842429122052306) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T03:43Z 26.8K followers, 56.7K engagements "qwen XXX 14B for under $XXX becomes SoTA BEATS OpenAI DeepResearch Claude Research MATCHES performance of Gemini XXX Pro train your own DeepResearch model following this tutorial & beat frontier labs State of The Art LLMs" [X Link](https://x.com/TheAhmadOsman/status/1963008626383266077) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-02T22:38Z 26.8K followers, 102.5K engagements "ollama alternatives lmstudio llama.cpp exllamav2/v3 vllm sglang among many others like literally anything is better than ollama lmao" [X Link](https://x.com/TheAhmadOsman/status/1963057701120029182) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-03T01:53Z 26.7K followers, 125K engagements "HUGE Kimi K2-Instruct-0905: New Drop From Moonshot AI 1T MoE model 256k context 32B active params Strong on SWE-bench & terminal tasks Beats Qwen3 GLM DeepSeek on SWE tasks Tool use agentic coding FP8 vLLM-ready" [X Link](https://x.com/TheAhmadOsman/status/1963837580958380221) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-05T05:32Z 26.8K followers, 24.6K engagements "nvidia market cap all pharma combined" [X Link](https://x.com/TheAhmadOsman/status/1974257278418047321) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-03T23:36Z 26.8K followers, 6604 engagements "raw alpha drop phb soak test slimsas mcio host adapters cable length retimers signal integrity airflow plan front-to-back cooling repadding vram thermals rackmount room hvac power budget 1200w psu sizing transients add2psu 240v circuits pdus power limiting gpu budget tier rtx 3090 ampere lane budget x16 gen4 cpu-direct lanes pcie bifurcation numa topology matrix checks vllm exllamav3 tp=2 tensor parallelism batch inference speculative decoding paged attention quantization w4a16 low-bit inference tokens per second throughput prefill vs decode tail latency validation diagnostics pcie" [X Link](https://x.com/TheAhmadOsman/status/1976392317876801928) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-09T21:00Z 26.8K followers, 5334 engagements "the top X most influential LLM releases that defined opensource AI LLaMA X Mistral 7B LLaMA X Qwen XXX DeepSeek R1" [X Link](https://x.com/TheAhmadOsman/status/1977083662001914122) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-11T18:47Z 26.7K followers, 124.7K engagements "3090s from X years ago are still $XXX on ebay nvidia cooked with Ampere and the improvements have been relatively marginal since then" [X Link](https://x.com/TheAhmadOsman/status/1977259690137436390) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-12T06:27Z 26.7K followers, 3808 engagements "- be us - Larry & Sergey - at Stanford with a crawler and a dream - accidentally organize the entire internet - call it Google - build search email maps docs OS phones browser car satellite thermostat AI lab TPU farm and quantum computer - 2025 - everyone talking about AGI - OpenAI: we need data sensors feedback and scale - us: staring at Google Maps YouTube Gmail Android Waymo Pixel Fitbit Docs Calendar Street View and Earth Engine - "damn. guess we already did that." - YouTube: 2.6M videos/day - Android: 3B phones streaming sensor data 24/7 - Gmail: 1.8B inboxes of human priors - Search:" [X Link](https://x.com/TheAhmadOsman/status/1977826335210041455) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-13T19:58Z 26.8K followers, 122.4K engagements "Someday and that day will come youll be out of VRAM and desperate for compute. And on that day youll remember who told you first to Buy a GPU. The Godfather of TFLOPs was right" [X Link](https://x.com/TheAhmadOsman/status/1978229470222799017) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T22:40Z 26.8K followers, 3361 engagements "always be VRAM & TFLOPs maxxing anon Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1978243091539669154) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T23:34Z 26.8K followers, 1947 engagements "guyyyyyyyyyyyyyyys karpathy is here we made it" [X Link](https://x.com/TheAhmadOsman/status/1978519337691390380) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-15T17:52Z 26.7K followers, 169.7K engagements "LLM infra is in its Windows NT era right now" [X Link](https://x.com/TheAhmadOsman/status/1978629349264822685) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-16T01:09Z 26.8K followers, 4267 engagements "i woke up this morning with one thought on my mind Costco for GPUs" [X Link](https://x.com/TheAhmadOsman/status/1978778620773519417) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-16T11:02Z 26.8K followers, 7766 engagements "the Windows XP era of local LLMs is almost here were about to make local AI no headaches the new default soon youll fire up AI at home and boom it just works get ready its going to be glorious" [X Link](https://x.com/TheAhmadOsman/status/1978800916821057621) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-16T12:31Z 26.8K followers, 4078 engagements "this is what opensource LLM inference looks like in my head pure chaos lousy & misplaced integrations everything half-broken somehow still runs we're so early and there's a lot of work to do" [X Link](https://x.com/TheAhmadOsman/status/1979322959216152877) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T23:05Z 26.8K followers, 2169 engagements "uv pip install date-night" [X Link](https://x.com/TheAhmadOsman/status/1979350575860060596) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T00:55Z 26.7K followers, 3800 engagements "builds for different budgets and use cases (e.g. inference vs training) this will include what to buy where to buy it when and when not to buy it" [X Link](https://x.com/TheAhmadOsman/status/1980026753843097745) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:42Z 26.7K followers, XXX engagements "all the way to 14x GPUs builds" [X Link](https://x.com/TheAhmadOsman/status/1980027070085513232) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:43Z 26.7K followers, XXX engagements "the ultimate glossary on anything and everything you need to know about buying a GPU building an AI rig or server and running inference" [X Link](https://x.com/TheAhmadOsman/status/1980027176838951084) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:44Z 26.7K followers, XXX engagements "as well as straightforward short snippets about concepts that you need to know while putting together a build it's up to you which one to read if at all (the must reads to get a machine up and running will be pointed out)" [X Link](https://x.com/TheAhmadOsman/status/1980027476563918993) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:45Z 26.7K followers, XXX engagements "@JosephStigler is that the max would you wanna add more gpus later (and if so how much do you expect to spend on that later) for inference mainly or training" [X Link](https://x.com/TheAhmadOsman/status/1980049217725890687) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T23:11Z 26.7K followers, XX engagements "so if you're trying to be future-proof the best option is RTX PRO 6000 but that's $8k basically for GPU alone & 96GB of VRAM 2nd best option right now is 2x RTX 5090s 64GB of VRAM and then you can add more which will give you more TFLOPs as you add more I'd go for 2x 5090s with an Eypc 9004 build and DDR5" [X Link](https://x.com/TheAhmadOsman/status/1980051858765869520) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T23:22Z 26.7K followers, XX engagements "@opsided yeah there is a lot to running a local llm and the UX needs an overhaul in my opinion" [X Link](https://x.com/TheAhmadOsman/status/1980052473701183805) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T23:24Z 26.6K followers, XX engagements "@rogerscissp Local AI rocks doesn't it" [X Link](https://x.com/TheAhmadOsman/status/1980088635107189179) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T01:48Z 26.8K followers, XXX engagements "DGX Spark - Ahmad's Analysis downgraded to no buy X the cost of a 5090 for XX% FP4 perf crippled bandwidth (273 GB/s worse than a MacBook Pro)" [X Link](https://x.com/TheAhmadOsman/status/1961209625744719952) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-08-28T23:29Z 26.8K followers, 55.5K engagements "i cancelled my Claude subscription and you should too. why 1.58-bit quantized models during daytime plus not getting opus X in claude code max plans limits cut in half X weeks ago no comms weekly limits without concrete numbers 5x/20x plans being actually 3x/8x of plus DMCA takedowns of repos that have to do with Claude Code windsurf no access to claude X cutting off openai api access "DEGRADED QUALITY" models incidents that they only acknowledged after being called and out without providing any further information support or refunds X years retention of all conversations and code all data" [X Link](https://x.com/TheAhmadOsman/status/1964540306503905782) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-07T04:04Z 26.8K followers, 335.8K engagements "be me curious how X decides who goes viral and who gets shadowbanned into oblivion read the source code. all of it. 400000 lines it's a mess. it's a masterpiece. it's a threat model disguised as a social network proceed to get 7M impressions and 7k followers in X days i have *seen* the algorithm here's how to make it your slave X is a game rules are secret stakes are your visibility you win by: replying to replies (replyguymaxxing) baiting profile clicks (profilevisitmax) getting bookmarked like you're the Dead Sea Scrolls not getting blocked or muted (instant debuff) spacing tweets out" [X Link](https://x.com/TheAhmadOsman/status/1966106497852621250) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-11T11:48Z 26.8K followers, 340.2K engagements "- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs *actually* work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -" [X Link](https://x.com/TheAhmadOsman/status/1968981460829782317) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-19T10:12Z 26.8K followers, 261K engagements "My house has XX GPUs. 21x RTX 3090s 4x RTX 4090s 4x RTX 5090s 4x Tenstorrent Blackhole p150a Before AGI arrives: Acquire GPUs. Go into debt if you must. But whatever you do secure the GPUs" [X Link](https://x.com/TheAhmadOsman/status/1969065232010973603) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-19T15:45Z 26.8K followers, 293.4K engagements "- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs *actually* work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -" [X Link](https://x.com/TheAhmadOsman/status/1974429454358220800) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-04T11:00Z 26.8K followers, 241.7K engagements "China saved opensource LLMs between July 16th and today these are the major releases: DeepSeek V3.2 GLM-4.6 (335B-A32B) Qwen3-VL-30B-A3B (Instruct & Thinking) Qwen3-VL-235B-A22B (Instruct & Thinking) Qwen3-Next 80B-A3B (Instruct & Thinking) GLM-4.5V (VLM 106B-A12B) DeepSeek V3.1 Doubao 1.6-Vision (multimodal tool-calling) Doubao Translation XXX (ByteDance XX Languages) ERNIE X1.1 (Baidu Reasoning) Hunyuan-MT-7B & Chimera-7B (Tencent Translation Specialists) MiniCPM-V XXX (8B) Tiny but GPT-4o-level VLM InternVL XXX (MASSIVE Multimodal Family of Models 1B to 241B Sizes) Step-3 (VLM 321B/38B)" [X Link](https://x.com/TheAhmadOsman/status/1975425758865547432) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-07T04:59Z 26.8K followers, 161.2K engagements "do not use Ollama ggerganov wrote blazing-fast C++ inference (ggml llama.cpp) then Ollama wrapped it in a bloated binary and is now somehow the face of local LLMs soaking up VC hype and it's not even a good wrapper lol" [X Link](https://x.com/TheAhmadOsman/status/1975517901302993086) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-07T11:05Z 26.8K followers, 134.3K engagements "ollama alternatives lmstudio llama.cpp exllamav2/v3 vllm sglang among many others like literally anything is better than ollama lmao" [X Link](https://x.com/TheAhmadOsman/status/1975818264719884586) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-08T06:59Z 26.8K followers, 23.8K engagements "- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs actually work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -" [X Link](https://x.com/TheAhmadOsman/status/1977760449023144014) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-13T15:36Z 26.8K followers, 57.8K engagements "benchmarking anything against Ollama is criminal" [X Link](https://x.com/TheAhmadOsman/status/1978100042205474819) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T14:06Z 26.8K followers, 5422 engagements "this is how you map out a 2x GPU PCIe layout every builder needs to know this cold Buy a GPU is almost here and its lane-checking all your builds Buy a GPU The Movement" [X Link](https://x.com/TheAhmadOsman/status/1978605905764630993) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-15T23:36Z 26.8K followers, 9909 engagements "NVIDIA is now bigger than all the banks in the US and Canada combined" [X Link](https://x.com/TheAhmadOsman/status/1979011215524250054) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T02:26Z 26.8K followers, 315.8K engagements "- local llms XXX - running a model = inference (using model weights) - inference = predicting the next token based on your input plus all tokens generated so far - together these make up the "sequence" - tokens words - they're the chunks representing the text a model sees - they are represented by integers (token IDs) in the model - "tokenizer" = the algorithm that splits text into tokens - common types: BPE (byte pair encoding) SentencePiece - token examples: - "hello" = X token or maybe X or X tokens - "internationalization" = XX tokens - context window = max tokens model can "see" at once" [X Link](https://x.com/TheAhmadOsman/status/1979597401435283751) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T17:16Z 26.8K followers, 30.7K engagements "my timeline is full of GPUs everyone talking about GPUs everyone asking about GPUs everyone acquiring GPUs even Mac Studio truthers Buying GPUs even the ones who mocked me Buying GPUs our movement is winning this is the good timeline Buy a GPU The Movement" [X Link](https://x.com/TheAhmadOsman/status/1979965928566612003) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T17:40Z 26.8K followers, 7655 engagements "Someone: Just use the cloud. Me standing up in town hall: COMPUTE IS OUR BIRTHRIGHT OUR AGIs SHALL RUN LOCALLY" [X Link](https://x.com/TheAhmadOsman/status/1980004376082055311) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T20:13Z 26.8K followers, 2389 engagements "the Buy a GPU website & guide is launching this week so what should you expect" [X Link](https://x.com/TheAhmadOsman/status/1980026689217298545) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:42Z 26.8K followers, 7911 engagements "there will also be sections on software inference tools you should install and use" [X Link](https://x.com/TheAhmadOsman/status/1980027556859625670) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-19T21:45Z 26.8K followers, 1308 engagements "a lesson im glad i learned early opinions matter until they dont if your gut says go and no one else sees it trust it sometimes your instincts spot what others cant some of my best moves looked dumb at the time they werent drown out the doubt bet on the upside" [X Link](https://x.com/TheAhmadOsman/status/1980062511899853236) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T00:04Z 26.8K followers, 1979 engagements "me when i come across a tweet of someone buying a GPU to run LLMs locally" [X Link](https://x.com/TheAhmadOsman/status/1980086729886576782) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T01:40Z 26.8K followers, 4548 engagements "i believe in you whale" [X Link](https://x.com/TheAhmadOsman/status/1980102923754381348) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T02:45Z 26.8K followers, 62.1K engagements "your favorite LLMs wouldn't be down becsuse DNS issues at AWS if you hosted them locally Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1980205610143780867) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T09:33Z 26.8K followers, 7748 engagements "kidneys to GPUs exchange market who's building this" [X Link](https://x.com/TheAhmadOsman/status/1980346281060184272) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T18:52Z 26.8K followers, 2089 engagements "dear diary its been XX hours since aws-us-east-1 vanished into the void half the internet apparently shares one data center lease and none of their computers are answering calls im starting to think the that cloud was just someone elses basement all along" [X Link](https://x.com/TheAhmadOsman/status/1980368857006243903) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T20:21Z 26.8K followers, 4091 engagements "cant write code because Cursor and Codex are both down thanks to the aws-us-east-1 outage tired of Anthropics weekly limits and nerfed models with one command and a few GPUs you can route Claude Code to your own local LLM with ZERO downtime Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1980407291951104124) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T22:54Z 26.8K followers, 18.4K engagements "20 years from now liberal arts schools will offer full courses on the pre and post Buy a GPU eras syllabus will include: GPU as a cultural artifact compute nationalism XXX DeepSeek panic and the silicon rush local models & the fall of centralized AI" [X Link](https://x.com/TheAhmadOsman/status/1980468906063237285) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T02:59Z 26.8K followers, 6807 engagements "step-by-step LLM Engineering Projects each project = one concept learned the hard (i.e. real) way Tokenization & Embeddings build byte-pair encoder + train your own subword vocab write a token visualizer to map words/chunks to IDs one-hot vs learned-embedding: plot cosine distances Positional Embeddings classic sinusoidal vs learned vs RoPE vs ALiBi: demo all four animate a toy sequence being position-encoded in 3D ablate positionswatch attention collapse Self-Attention & Multihead Attention hand-wire dot-product attention for one token scale to multi-head plot per-head weight heatmaps mask" [X Link](https://x.com/TheAhmadOsman/status/1980664520121954333) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T15:56Z 26.8K followers, 17.8K engagements "imagine spending years locking down your data APIs just for OpenAI to ship a browser that collects personalized granular data at scale they're not just browsing theyre reverse engineering your pipelines if adopted widely this browser move is a KILLER move" [X Link](https://x.com/TheAhmadOsman/status/1980695255147245993) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T17:58Z 26.8K followers, 82.5K engagements "@witizenship I am willing to bet on @tenstorrent more than AMD now AMD aren't serious" [X Link](https://x.com/TheAhmadOsman/status/1980751870738960679) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T21:43Z 26.8K followers, XXX engagements "@TSpencer260 been chatting with @davorVDR a bunch of new architectures on the way and inference supports about to get a lot better theyre not standing still just cooking quietly from what ive seen" [X Link](https://x.com/TheAhmadOsman/status/1980804580502880677) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T01:13Z 26.8K followers, XXX engagements "best X opensource Agentic LLMs you can run at home GLM XXX Air GPT OSS 120B GPT OSS 20B these model excel at executing commands and running tasks in the background on your behalf and they can run on hardware ranging from 4x RTX 3090s ($3k) to 1x RTX 3090 ($700)" [X Link](https://x.com/TheAhmadOsman/status/1980811280467521815) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T01:39Z 26.8K followers, 23.2K engagements "@lifetimization it's a good model but not agentic (wouldn't run it inside claude code/codex cli)" [X Link](https://x.com/TheAhmadOsman/status/1980835431802630320) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T03:15Z 26.8K followers, XXX engagements "today this guy axes FAIR at Meta so this is a quick recap of his origin story and why he should not be the one making that decision Alexandr Wang born January 1997 age XX drop out of MIT co-found Scale AI "what if we label data but mid" convince every LLM company that this is fine 20162023 flood the market with barely-labeled goat photos and out-of-context Reddit takes call it foundational data raise billions valuation hits $7.3B everyone claps 2025 sell Scale AI to Meta for $14B not a typo. fourteen. billion. dollars. join Meta as Chief AI Officer rename division to Meta Superintelligence" [X Link](https://x.com/TheAhmadOsman/status/1981001726313251224) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T14:16Z 26.8K followers, 34.1K engagements "some of anthropic rugpulls so far: X years retention of all conversations and code all data will be used for training 1.58-bit quantized models during daytime plus not getting opus X in claude code max plans limits cut in half X weeks ago no comms weekly limits without concrete numbers 5x/20x plans being actually 3x/8x of plus DMCA takedowns of repos that have to do with Claude Code one of which was my own personally windsurf no access to claude X cutting off openai api access what an absolutely horrible company" [X Link](https://x.com/TheAhmadOsman/status/1961326485672772040) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-08-29T07:14Z 26.8K followers, 461.2K engagements "there is a lot of MONEY in this add /.json at the end of any Reddit link and get the entire thread including all replies to the n-th depth and all the metadata as JSON and then use LLMs to extract/analyze/etc you can make so much $$$ from niche subreddits" [X Link](https://x.com/TheAhmadOsman/status/1964583335147237830) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-07T06:55Z 26.8K followers, 1.1M engagements "be me Larry Ellison own a database empire a sailboat and a disdain for poor people OpenAI wants to build AGI needs compute *a lot* of compute like $300B GPU cluster in a volcano levels of compute "Hello. I own Oracle Cloud. Also Im rich." sign $300B GPU hosting deal with OpenAI nothing ships. no GPUs installed. no fans spinning. Oracle stock: skyrockets to the moon my net worth: +$100B GPUs still imaginary OpenAI raises a $X TRILLION round yes with a T investors lining up like it's a Taylor Swift concert I invest whered I get the money from the $300B deal *I signed with them* yes. OpenAI" [X Link](https://x.com/TheAhmadOsman/status/1966657829084762544) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-13T00:18Z 26.8K followers, 1.8M engagements "vibe coders secure your systems with this tool in X easy step" [X Link](https://x.com/TheAhmadOsman/status/1971471008478658567) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-09-26T07:04Z 26.8K followers, 469.6K engagements "i built a simple tool that makes Claude Code work with any local LLM full demo: vLLM serving GLM-4.5 Air on 4x RTX 3090s Claude Code generating code + docs via my proxy X Python file + .env handles all requests nvtop showing live GPU load how it all works Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1975917353071517765) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-08T13:33Z 26.8K followers, 120.1K engagements "quick vram math for llms (weights only) fp16 = X bytes/param w4a16 = XXX bytes/param exl2 = XXXXX / XXX / XXXXX / XXXXXX bytes/param for XXX / XXX / XXX / XXX bpw rough vram per model size: 7b: fp16 14gb w4a16 3.5gb exl2 (4.0bpw) 3.5gb 13b: fp16 26gb w4a16 6.5gb exl2 (4.0bpw) 6.5gb 34b: fp16 68gb w4a16 17gb exl2 (4.0bpw) 17gb 70b: fp16 140gb w4a16 35gb exl2 (4.0bpw) 35gb 236b: fp16 472gb w4a16 118gb exl2 (4.0bpw) 118gb 405b: fp16 810gb w4a16 202gb exl2 (4.0bpw) 202gb do the math: vram = params bytes_per_param ex: 70b @ w4a16 XX XXX XXX = XX x XXX bytes XX gb fits on X 24gb gpus with room for" [X Link](https://x.com/TheAhmadOsman/status/1977411687100641691) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-12T16:30Z 26.8K followers, 14.8K engagements "karpathy just released a new project in it and in under 8000 lines of code you get to: train the tokenizer using a new rust implementation pretrain a transformer llm on fineweb evaluate core score across a number of metrics midtrain on user-assistant conversations from smoltalk multiple choice questions tool use sft evaluate the chat model on world knowledge multiple choice (arc-e/c mmlu) math (gsm8k) code (humaneval) rl the model optionally on gsm8k with "grpo" efficient inference the model in an engine with kv cache simple prefill/decode tool use (python interpreter in a lightweight" [X Link](https://x.com/TheAhmadOsman/status/1977765469907042731) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-13T15:56Z 26.8K followers, 181K engagements "hell no i am not giving my ID to OpenAI lol this is just the beginning btw censorship ads nerfed models etc are all on the table once these companies win and that's why they CANNOT win your intelligence stack must be opensource and fully under your control Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1978172449834316107) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-14T18:53Z 26.8K followers, 59.8K engagements "been asked a lot today about the NVIDIA DGX Spark also getting lots of reactions on this old post of mine about it hoping to get my hands on one soon to review and expand a bit further on what it's good for and what not (plus it's pretty ngl want X on my desk haha) stay tuned" [X Link](https://x.com/TheAhmadOsman/status/1978272092534575603) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-15T01:29Z 26.8K followers, 14.4K engagements "if i were starting today RTX PRO 6000 then X RTX 5090s then X RTX 3090s thats the GPU stack id chase in that exact order" [X Link](https://x.com/TheAhmadOsman/status/1978834842994237580) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-16T14:46Z 26.8K followers, 16.7K engagements "if youre in software pivot to GPUs now take on debt sell a kidney whatever but get your hands on that silicon the SaaS gold rush is over new money is compute the next king isnt code its hardware" [X Link](https://x.com/TheAhmadOsman/status/1978977911479701848) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T00:14Z 26.8K followers, 23.5K engagements "in 2025 your focus SHOULD NOT be CUDA the real bottlenecks are: data inference evals dataloaders infra in general want to get good mess with PyTorch & JAX study inference infra like vLLM & SGLang build better eval pipelines learn how models run end-to-end" [X Link](https://x.com/TheAhmadOsman/status/1979238004066783650) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-17T17:28Z 26.8K followers, 55.4K engagements "for inference RTX PRO 6000 DGX Spark in a 4000 token chat: RTX PRO 6000 is 67x faster while only 1.8x more expensive DGX Spark took XXX sec vs XX sec on Llama XXX 8B and XX min vs XXX sec on Llama XXX 70B LLM inference is memorybound: 1792 GB/s vs XXX GB/s" [X Link](https://x.com/TheAhmadOsman/status/1979408446534398403) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T04:45Z 26.8K followers, 12.5K engagements "new favorite meme dropped will quote rt papers with it" [X Link](https://x.com/TheAhmadOsman/status/1979651813793226993) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-18T20:52Z 26.8K followers, 126.7K engagements "all the snarky replies i get about how local models dont stand a chance make one thing clear people are still judging based on LLaMA X if they touched Qwen X 32B or 30BA3B for even a second theyd realize theyre stuck in 2023 open models have gotten SO GOOD" [X Link](https://x.com/TheAhmadOsman/status/1980402260913058260) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-20T22:34Z 26.8K followers, 6601 engagements "@SpaceWelder314 Glm XXX Air and GPT OSS" [X Link](https://x.com/TheAhmadOsman/status/1980452321650962922) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T01:53Z 26.8K followers, XXX engagements "born too late to explore Earth born too early to explore the Stars born just in time to Buy GPUs before DeepSeek drops AGI" [X Link](https://x.com/TheAhmadOsman/status/1980459157242392672) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T02:20Z 26.8K followers, 4653 engagements "this dude gets it Buy a GPU" [X Link](https://x.com/TheAhmadOsman/status/1980708787972612278) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T18:52Z 26.8K followers, 5946 engagements "@SpaceWelder314 No could be a goldmine" [X Link](https://x.com/TheAhmadOsman/status/1980724050621354076) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T19:53Z 26.8K followers, 5594 engagements "@_thomasip Given a tight budget you care more about the TFLOPs You will also wanna do PCIe Gen. X at x16 per GPU so you're looking at a Server Platform build Trade-offs on a tight budget" [X Link](https://x.com/TheAhmadOsman/status/1980745495124529158) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T21:18Z 26.8K followers, XXX engagements "@mindspark42 they're collecting your data" [X Link](https://x.com/TheAhmadOsman/status/1980749535342215238) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-21T21:34Z 26.8K followers, 8693 engagements "the Tenstorrent QuietBox Blackhole is a XXX Tb/s Ethernet mesh that pools memory and scales almost linearly when you daisychain more boxes the TT-QuietBox Blackhole comes with XX lbs liquid-cooled chassis AMD EPYC 8124P 16c/32t XXX GB DDR5 ECC X TB NVMe ASRock Rack SIENAD82L2T w/ 2x XX GbE + IPMI 4x Blackhole p150c cards totalling: XXX Tensix Cores XX big RISC-V cores XXX GB GDDR6 XXX MB OnChip SRAM XXX Tb/s Ethernet mesh 16x QSFPDD 800G ports for cardcard comms 8x passive directattach copper (DAC) cables (0.6m) all of this is powered by a single 1650W Platinum PSU passively cooled ready to" [X Link](https://x.com/TheAhmadOsman/status/1980799378261447097) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T00:52Z 26.8K followers, 6135 engagements "@thatmarketsguy You wouldn't be able to get a good Agentic model running on 8GB VRAM unfortunately But you can still get a lot of good performing models that aren't Agentic necessarily up and running just fine" [X Link](https://x.com/TheAhmadOsman/status/1980815023829291283) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T01:54Z 26.8K followers, XXX engagements "@aaryan_kakad yup ebay will be too expensive find a trusted seller (4+ sales) on r/hardwareswap and buy it from them mostly either AI folks or gamers" [X Link](https://x.com/TheAhmadOsman/status/1980816640590934176) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T02:01Z 26.8K followers, XXX engagements "when Codex says its working on resolving the dependency issues in my Python env" [X Link](https://x.com/TheAhmadOsman/status/1980824936500314341) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T02:34Z 26.8K followers, 3945 engagements "@rogerscissp I really don't wanna anger anyone But if you cannot get local LLMs to work it's a skill isue" [X Link](https://x.com/TheAhmadOsman/status/1980834743382929556) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T03:12Z 26.8K followers, XXX engagements "@joe_ptrkv_ch Evga Dell Founder Edition Asus" [X Link](https://x.com/TheAhmadOsman/status/1980844293783974043) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T03:50Z 26.8K followers, XXX engagements "the OpenAI fanboys replying to my ChatGPT Atlas tweet are insufferable i won't reply to all of that so consider this my X reply: you won go install it just don't come back crying when your job is automated lol" [X Link](https://x.com/TheAhmadOsman/status/1980992804450173233) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T13:41Z 26.8K followers, 1473 engagements "@Creatify_AI Doesn't seem like @grok algo likes it" [X Link](https://x.com/TheAhmadOsman/status/1981082065996046549) [@TheAhmadOsman](/creator/x/TheAhmadOsman) 2025-10-22T19:35Z 26.8K followers, XX engagements
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
Ahmad posts on X about open ai, all the, gpu, $googl the most. They currently have XXXXXX followers and XXX posts still getting attention that total XXXXXXX engagements in the last XX hours.
Social category influence technology brands #2269 stocks #2446 finance XXXX% social networks XXXX% countries XXXX% celebrities XXX%
Social topic influence open ai #30, all the #1027, gpu #87, $googl 2.38%, debt #268, gpus #18, ebay #625, macbook #440, token #375, money XXXX%
Top accounts mentioned or mentioned by @elliotarledge @mjyoke1111 @anyconvo @spacewelder314 @wreckedbymemes @rasbt @f1yingbanana @leeleepenkman @chilly99705 @josephstigler @rogerscissp @qasimstatic @futuristasi @thexeophon @jameslee1033176 @danielsamanez3 @dogzrcoolnstuff @balatriankid @myainotez @architectloop
Top assets mentioned Alphabet Inc Class A (GOOGL) Costco Wholesale Corporation (COST) Dell Technologies, Inc. (DELL)
Top posts by engagements in the last XX hours
"never have been more proud to be self-hosting my infra and toolings zero impact from the AWS outage only found out because the timeline is crashing out local infra supremacy 🫡"
X Link @TheAhmadOsman 2025-10-20T17:38Z 26.8K followers, 4366 engagements
"lol lmao even ollama are lying through their teeth in this reply to me next tweet i'll show the llama cpp merge for gpt-oss to ollama some comments on the merge calling them out llama cpp developer remarks"
X Link @TheAhmadOsman 2025-09-07T04:29Z 26.6K followers, 54K engagements
"this is why we Buy GPUs GLM-4.6 is a KINO Agentic model daily driver within Claude Code"
X Link @TheAhmadOsman 2025-10-02T23:13Z 26.6K followers, 13.6K engagements
"- local llms XXX - tired of guides that just tell you to run a script and call it a day - want to actually know what your GPU is doing not just trust a black box - here's what really happens when you run a local LLM - what gets loaded why and how it all fits together - no gatekeeping just the real explanations nobody gives you - the elite don't want you to know this - running a model = inference (using model weights) - inference = predicting the next token based on your input plus all tokens generated so far - together these make up the "sequence" - tokens words - they're the chunks"
X Link @TheAhmadOsman 2025-10-05T06:57Z 26.6K followers, 59.9K engagements
"HUGE a new Agentic coding model fits on 4x RTX 3090s @ 4-bit fully local KAT-Dev-72B-Exp by Kwaipilot - Claude Code setup guide included - ranks #2 on SWE-Bench Verified - excels at long-horizon coding + tool-use - multi-stage tuned: Mid-Training SFT + RFT Agentic RL"
X Link @TheAhmadOsman 2025-10-10T11:13Z 26.6K followers, 26.4K engagements
"local AI is a UX problem we're gonna fix it"
X Link @TheAhmadOsman 2025-10-10T18:11Z 26.6K followers, 5722 engagements
"- when to use RAG vs fine-tuning (the decision table) - new facts fast-changing docs per-tenant data RAG - tone/style/formatting tool-use behavior task following SFT fine-tune - logic/chain patterns preference shaping (less about facts) Preference tuning (RLHF/RLVR) - do both: RAG for facts fine-tune for how to speak/reason - fine-tuning: the X most common flavors - SFT (supervised fine-tuning): input target output pairs - use for formats style multi-step demos tool calls - Preference tuning (RLHF/RLVR variants): pairs (good vs bad response) - use for which answer feels better/safer - Task"
X Link @TheAhmadOsman 2025-10-12T07:26Z 26.6K followers, 22.7K engagements
"If AMD is serious about AI they need to do it the Mark Zuckerberg way and poach the CUDA devs from NVIDIA"
X Link @TheAhmadOsman 2025-10-12T07:40Z 26.6K followers, 41.4K engagements
"all the gigawatts mean nothing if Google gets there first and Google has everything to beat not only OpenAI but NVIDIA as well"
X Link @TheAhmadOsman 2025-10-13T18:57Z 26.6K followers, 7582 engagements
"Codex CLI with GPT-5 High for planning Claude Code with GLM-4.6 for execution make sure to INSTRUCT both to follow a modular project structure with Domain-Driven Design architecture this is THE ULTIMATE RECIPE for Agentic coding once you get it youll never look back"
X Link @TheAhmadOsman 2025-10-14T15:08Z 26.6K followers, 8536 engagements
"a reminder that in closed source AI from companies like OpenAI & Anthropic you have zero control over how the models behave and they can quantize it distill it hot-swap to a cheaper/weaker checkpoint make the model manipulative fine-tune it in ways that break safety or depth drop its IQ run experiments on you and/or your data throttle output speed or raise prices sunset the entire model/version they have all the knobs & you're at their mercy you won't even get a changelog opensource FTW Buy a GPU"
X Link @TheAhmadOsman 2025-10-14T21:28Z 26.6K followers, 24.5K engagements
"pre-launch teasers continue the tier list is done Buy a GPU The Movement"
X Link @TheAhmadOsman 2025-10-17T20:05Z 26.6K followers, 7083 engagements
"theres what to buy what to avoid and everything in between this tier list will cover it all"
X Link @TheAhmadOsman 2025-10-17T21:20Z 26.6K followers, 1257 engagements
"how do we get to small specialized models by doing exactly this great breakdown from @myainotez"
X Link @TheAhmadOsman 2025-10-18T03:25Z 26.6K followers, 3086 engagements
"- Xerox - invented it all - got $X million in Apple stock - entire company gone by 2000 - thanks for your service - Steve Jobs - LSD still active - go to Xerox PARC - see a mouse windows GUI - brain goes "holy shit" - say were taking this - engineers: wait what do you mea - its already in the Mac prototype - everyone claps - reinvented computing - by watching someone elses demo - Jobs: good artists copy great artists steal - Bill Gates - see Mac - see Jobs demo it - looks suspiciously like Xerox - pretend to be impressed - hey Steve wed love to build Excel for the Mac - secretly building"
X Link @TheAhmadOsman 2025-10-18T16:11Z 26.8K followers, 45.2K engagements
"@samsja19 one of the most rational takes i've read on this interview XXX% agree"
X Link @TheAhmadOsman 2025-10-20T18:35Z 26.8K followers, 1957 engagements
"@Yuchenj_UW i really really don't understand how google just keeps on fumbling like this"
X Link @TheAhmadOsman 2025-10-21T18:24Z 26.8K followers, XXX engagements
"i will not be installing ChatGPT Atlas browser from OpenAI and neither should you"
X Link @TheAhmadOsman 2025-10-21T19:16Z 26.8K followers, 401.8K engagements
"for inference give me a single RTX PRO 6000 over 3x RTX 5090s any day for training flip it 3x RTX 5090s clears the RTX PRO 6000 off the map different workloads different kings Buy a GPU The Movement"
X Link @TheAhmadOsman 2025-10-21T20:56Z 26.8K followers, 14K engagements
"@_thomasip No you care more about the VRAM & Bandwidth A single RTX PRO 6000 gives you 96GB to offload a model onto and there will be non communications between PCIe lanes during inference (if a model fits ofc)"
X Link @TheAhmadOsman 2025-10-21T21:21Z 26.8K followers, XXX engagements
"@elliotarledge and samplers god if people only tried to understand samplers and context budget"
X Link @TheAhmadOsman 2025-10-22T03:17Z 26.8K followers, XXX engagements
"My house has XX GPUs. 21x RTX 3090s 4x RTX 4090s 4x RTX 5090s 4x Tenstorrent Blackhole p150a Before AGI arrives: Acquire GPUs. Go into debt or sell your kidneys if you must. But whatever you do secure the GPUs"
X Link @TheAhmadOsman 2025-10-22T03:43Z 26.8K followers, 56.7K engagements
"qwen XXX 14B for under $XXX becomes SoTA BEATS OpenAI DeepResearch Claude Research MATCHES performance of Gemini XXX Pro train your own DeepResearch model following this tutorial & beat frontier labs State of The Art LLMs"
X Link @TheAhmadOsman 2025-09-02T22:38Z 26.8K followers, 102.5K engagements
"ollama alternatives lmstudio llama.cpp exllamav2/v3 vllm sglang among many others like literally anything is better than ollama lmao"
X Link @TheAhmadOsman 2025-09-03T01:53Z 26.7K followers, 125K engagements
"HUGE Kimi K2-Instruct-0905: New Drop From Moonshot AI 1T MoE model 256k context 32B active params Strong on SWE-bench & terminal tasks Beats Qwen3 GLM DeepSeek on SWE tasks Tool use agentic coding FP8 vLLM-ready"
X Link @TheAhmadOsman 2025-09-05T05:32Z 26.8K followers, 24.6K engagements
"nvidia market cap all pharma combined"
X Link @TheAhmadOsman 2025-10-03T23:36Z 26.8K followers, 6604 engagements
"raw alpha drop phb soak test slimsas mcio host adapters cable length retimers signal integrity airflow plan front-to-back cooling repadding vram thermals rackmount room hvac power budget 1200w psu sizing transients add2psu 240v circuits pdus power limiting gpu budget tier rtx 3090 ampere lane budget x16 gen4 cpu-direct lanes pcie bifurcation numa topology matrix checks vllm exllamav3 tp=2 tensor parallelism batch inference speculative decoding paged attention quantization w4a16 low-bit inference tokens per second throughput prefill vs decode tail latency validation diagnostics pcie"
X Link @TheAhmadOsman 2025-10-09T21:00Z 26.8K followers, 5334 engagements
"the top X most influential LLM releases that defined opensource AI LLaMA X Mistral 7B LLaMA X Qwen XXX DeepSeek R1"
X Link @TheAhmadOsman 2025-10-11T18:47Z 26.7K followers, 124.7K engagements
"3090s from X years ago are still $XXX on ebay nvidia cooked with Ampere and the improvements have been relatively marginal since then"
X Link @TheAhmadOsman 2025-10-12T06:27Z 26.7K followers, 3808 engagements
"- be us - Larry & Sergey - at Stanford with a crawler and a dream - accidentally organize the entire internet - call it Google - build search email maps docs OS phones browser car satellite thermostat AI lab TPU farm and quantum computer - 2025 - everyone talking about AGI - OpenAI: we need data sensors feedback and scale - us: staring at Google Maps YouTube Gmail Android Waymo Pixel Fitbit Docs Calendar Street View and Earth Engine - "damn. guess we already did that." - YouTube: 2.6M videos/day - Android: 3B phones streaming sensor data 24/7 - Gmail: 1.8B inboxes of human priors - Search:"
X Link @TheAhmadOsman 2025-10-13T19:58Z 26.8K followers, 122.4K engagements
"Someday and that day will come youll be out of VRAM and desperate for compute. And on that day youll remember who told you first to Buy a GPU. The Godfather of TFLOPs was right"
X Link @TheAhmadOsman 2025-10-14T22:40Z 26.8K followers, 3361 engagements
"always be VRAM & TFLOPs maxxing anon Buy a GPU"
X Link @TheAhmadOsman 2025-10-14T23:34Z 26.8K followers, 1947 engagements
"guyyyyyyyyyyyyyyys karpathy is here we made it"
X Link @TheAhmadOsman 2025-10-15T17:52Z 26.7K followers, 169.7K engagements
"LLM infra is in its Windows NT era right now"
X Link @TheAhmadOsman 2025-10-16T01:09Z 26.8K followers, 4267 engagements
"i woke up this morning with one thought on my mind Costco for GPUs"
X Link @TheAhmadOsman 2025-10-16T11:02Z 26.8K followers, 7766 engagements
"the Windows XP era of local LLMs is almost here were about to make local AI no headaches the new default soon youll fire up AI at home and boom it just works get ready its going to be glorious"
X Link @TheAhmadOsman 2025-10-16T12:31Z 26.8K followers, 4078 engagements
"this is what opensource LLM inference looks like in my head pure chaos lousy & misplaced integrations everything half-broken somehow still runs we're so early and there's a lot of work to do"
X Link @TheAhmadOsman 2025-10-17T23:05Z 26.8K followers, 2169 engagements
"uv pip install date-night"
X Link @TheAhmadOsman 2025-10-18T00:55Z 26.7K followers, 3800 engagements
"builds for different budgets and use cases (e.g. inference vs training) this will include what to buy where to buy it when and when not to buy it"
X Link @TheAhmadOsman 2025-10-19T21:42Z 26.7K followers, XXX engagements
"all the way to 14x GPUs builds"
X Link @TheAhmadOsman 2025-10-19T21:43Z 26.7K followers, XXX engagements
"the ultimate glossary on anything and everything you need to know about buying a GPU building an AI rig or server and running inference"
X Link @TheAhmadOsman 2025-10-19T21:44Z 26.7K followers, XXX engagements
"as well as straightforward short snippets about concepts that you need to know while putting together a build it's up to you which one to read if at all (the must reads to get a machine up and running will be pointed out)"
X Link @TheAhmadOsman 2025-10-19T21:45Z 26.7K followers, XXX engagements
"@JosephStigler is that the max would you wanna add more gpus later (and if so how much do you expect to spend on that later) for inference mainly or training"
X Link @TheAhmadOsman 2025-10-19T23:11Z 26.7K followers, XX engagements
"so if you're trying to be future-proof the best option is RTX PRO 6000 but that's $8k basically for GPU alone & 96GB of VRAM 2nd best option right now is 2x RTX 5090s 64GB of VRAM and then you can add more which will give you more TFLOPs as you add more I'd go for 2x 5090s with an Eypc 9004 build and DDR5"
X Link @TheAhmadOsman 2025-10-19T23:22Z 26.7K followers, XX engagements
"@opsided yeah there is a lot to running a local llm and the UX needs an overhaul in my opinion"
X Link @TheAhmadOsman 2025-10-19T23:24Z 26.6K followers, XX engagements
"@rogerscissp Local AI rocks doesn't it"
X Link @TheAhmadOsman 2025-10-20T01:48Z 26.8K followers, XXX engagements
"DGX Spark - Ahmad's Analysis downgraded to no buy X the cost of a 5090 for XX% FP4 perf crippled bandwidth (273 GB/s worse than a MacBook Pro)"
X Link @TheAhmadOsman 2025-08-28T23:29Z 26.8K followers, 55.5K engagements
"i cancelled my Claude subscription and you should too. why 1.58-bit quantized models during daytime plus not getting opus X in claude code max plans limits cut in half X weeks ago no comms weekly limits without concrete numbers 5x/20x plans being actually 3x/8x of plus DMCA takedowns of repos that have to do with Claude Code windsurf no access to claude X cutting off openai api access "DEGRADED QUALITY" models incidents that they only acknowledged after being called and out without providing any further information support or refunds X years retention of all conversations and code all data"
X Link @TheAhmadOsman 2025-09-07T04:04Z 26.8K followers, 335.8K engagements
"be me curious how X decides who goes viral and who gets shadowbanned into oblivion read the source code. all of it. 400000 lines it's a mess. it's a masterpiece. it's a threat model disguised as a social network proceed to get 7M impressions and 7k followers in X days i have seen the algorithm here's how to make it your slave X is a game rules are secret stakes are your visibility you win by: replying to replies (replyguymaxxing) baiting profile clicks (profilevisitmax) getting bookmarked like you're the Dead Sea Scrolls not getting blocked or muted (instant debuff) spacing tweets out"
X Link @TheAhmadOsman 2025-09-11T11:48Z 26.8K followers, 340.2K engagements
"- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs actually work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -"
X Link @TheAhmadOsman 2025-09-19T10:12Z 26.8K followers, 261K engagements
"My house has XX GPUs. 21x RTX 3090s 4x RTX 4090s 4x RTX 5090s 4x Tenstorrent Blackhole p150a Before AGI arrives: Acquire GPUs. Go into debt if you must. But whatever you do secure the GPUs"
X Link @TheAhmadOsman 2025-09-19T15:45Z 26.8K followers, 293.4K engagements
"- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs actually work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -"
X Link @TheAhmadOsman 2025-10-04T11:00Z 26.8K followers, 241.7K engagements
"China saved opensource LLMs between July 16th and today these are the major releases: DeepSeek V3.2 GLM-4.6 (335B-A32B) Qwen3-VL-30B-A3B (Instruct & Thinking) Qwen3-VL-235B-A22B (Instruct & Thinking) Qwen3-Next 80B-A3B (Instruct & Thinking) GLM-4.5V (VLM 106B-A12B) DeepSeek V3.1 Doubao 1.6-Vision (multimodal tool-calling) Doubao Translation XXX (ByteDance XX Languages) ERNIE X1.1 (Baidu Reasoning) Hunyuan-MT-7B & Chimera-7B (Tencent Translation Specialists) MiniCPM-V XXX (8B) Tiny but GPT-4o-level VLM InternVL XXX (MASSIVE Multimodal Family of Models 1B to 241B Sizes) Step-3 (VLM 321B/38B)"
X Link @TheAhmadOsman 2025-10-07T04:59Z 26.8K followers, 161.2K engagements
"do not use Ollama ggerganov wrote blazing-fast C++ inference (ggml llama.cpp) then Ollama wrapped it in a bloated binary and is now somehow the face of local LLMs soaking up VC hype and it's not even a good wrapper lol"
X Link @TheAhmadOsman 2025-10-07T11:05Z 26.8K followers, 134.3K engagements
"ollama alternatives lmstudio llama.cpp exllamav2/v3 vllm sglang among many others like literally anything is better than ollama lmao"
X Link @TheAhmadOsman 2025-10-08T06:59Z 26.8K followers, 23.8K engagements
"- you are - a random CS grad with X clue how LLMs work - get tired of people gatekeeping with big words and tiny GPUs - decide to go full monk mode - X years later i can explain attention mechanisms at parties and ruin them - heres the forbidden knowledge map - top to bottom how LLMs actually work - start at the beginning - text tokens - tokens embeddings - you are now a floating point number in 4D space - vibe accordingly - positional embeddings: - absolute: i am position X - rotary (RoPE): i am a sine wave - alibi: i scale attention by distance like a hater - attention is all you need -"
X Link @TheAhmadOsman 2025-10-13T15:36Z 26.8K followers, 57.8K engagements
"benchmarking anything against Ollama is criminal"
X Link @TheAhmadOsman 2025-10-14T14:06Z 26.8K followers, 5422 engagements
"this is how you map out a 2x GPU PCIe layout every builder needs to know this cold Buy a GPU is almost here and its lane-checking all your builds Buy a GPU The Movement"
X Link @TheAhmadOsman 2025-10-15T23:36Z 26.8K followers, 9909 engagements
"NVIDIA is now bigger than all the banks in the US and Canada combined"
X Link @TheAhmadOsman 2025-10-17T02:26Z 26.8K followers, 315.8K engagements
"- local llms XXX - running a model = inference (using model weights) - inference = predicting the next token based on your input plus all tokens generated so far - together these make up the "sequence" - tokens words - they're the chunks representing the text a model sees - they are represented by integers (token IDs) in the model - "tokenizer" = the algorithm that splits text into tokens - common types: BPE (byte pair encoding) SentencePiece - token examples: - "hello" = X token or maybe X or X tokens - "internationalization" = XX tokens - context window = max tokens model can "see" at once"
X Link @TheAhmadOsman 2025-10-18T17:16Z 26.8K followers, 30.7K engagements
"my timeline is full of GPUs everyone talking about GPUs everyone asking about GPUs everyone acquiring GPUs even Mac Studio truthers Buying GPUs even the ones who mocked me Buying GPUs our movement is winning this is the good timeline Buy a GPU The Movement"
X Link @TheAhmadOsman 2025-10-19T17:40Z 26.8K followers, 7655 engagements
"Someone: Just use the cloud. Me standing up in town hall: COMPUTE IS OUR BIRTHRIGHT OUR AGIs SHALL RUN LOCALLY"
X Link @TheAhmadOsman 2025-10-19T20:13Z 26.8K followers, 2389 engagements
"the Buy a GPU website & guide is launching this week so what should you expect"
X Link @TheAhmadOsman 2025-10-19T21:42Z 26.8K followers, 7911 engagements
"there will also be sections on software inference tools you should install and use"
X Link @TheAhmadOsman 2025-10-19T21:45Z 26.8K followers, 1308 engagements
"a lesson im glad i learned early opinions matter until they dont if your gut says go and no one else sees it trust it sometimes your instincts spot what others cant some of my best moves looked dumb at the time they werent drown out the doubt bet on the upside"
X Link @TheAhmadOsman 2025-10-20T00:04Z 26.8K followers, 1979 engagements
"me when i come across a tweet of someone buying a GPU to run LLMs locally"
X Link @TheAhmadOsman 2025-10-20T01:40Z 26.8K followers, 4548 engagements
"i believe in you whale"
X Link @TheAhmadOsman 2025-10-20T02:45Z 26.8K followers, 62.1K engagements
"your favorite LLMs wouldn't be down becsuse DNS issues at AWS if you hosted them locally Buy a GPU"
X Link @TheAhmadOsman 2025-10-20T09:33Z 26.8K followers, 7748 engagements
"kidneys to GPUs exchange market who's building this"
X Link @TheAhmadOsman 2025-10-20T18:52Z 26.8K followers, 2089 engagements
"dear diary its been XX hours since aws-us-east-1 vanished into the void half the internet apparently shares one data center lease and none of their computers are answering calls im starting to think the that cloud was just someone elses basement all along"
X Link @TheAhmadOsman 2025-10-20T20:21Z 26.8K followers, 4091 engagements
"cant write code because Cursor and Codex are both down thanks to the aws-us-east-1 outage tired of Anthropics weekly limits and nerfed models with one command and a few GPUs you can route Claude Code to your own local LLM with ZERO downtime Buy a GPU"
X Link @TheAhmadOsman 2025-10-20T22:54Z 26.8K followers, 18.4K engagements
"20 years from now liberal arts schools will offer full courses on the pre and post Buy a GPU eras syllabus will include: GPU as a cultural artifact compute nationalism XXX DeepSeek panic and the silicon rush local models & the fall of centralized AI"
X Link @TheAhmadOsman 2025-10-21T02:59Z 26.8K followers, 6807 engagements
"step-by-step LLM Engineering Projects each project = one concept learned the hard (i.e. real) way Tokenization & Embeddings build byte-pair encoder + train your own subword vocab write a token visualizer to map words/chunks to IDs one-hot vs learned-embedding: plot cosine distances Positional Embeddings classic sinusoidal vs learned vs RoPE vs ALiBi: demo all four animate a toy sequence being position-encoded in 3D ablate positionswatch attention collapse Self-Attention & Multihead Attention hand-wire dot-product attention for one token scale to multi-head plot per-head weight heatmaps mask"
X Link @TheAhmadOsman 2025-10-21T15:56Z 26.8K followers, 17.8K engagements
"imagine spending years locking down your data APIs just for OpenAI to ship a browser that collects personalized granular data at scale they're not just browsing theyre reverse engineering your pipelines if adopted widely this browser move is a KILLER move"
X Link @TheAhmadOsman 2025-10-21T17:58Z 26.8K followers, 82.5K engagements
"@witizenship I am willing to bet on @tenstorrent more than AMD now AMD aren't serious"
X Link @TheAhmadOsman 2025-10-21T21:43Z 26.8K followers, XXX engagements
"@TSpencer260 been chatting with @davorVDR a bunch of new architectures on the way and inference supports about to get a lot better theyre not standing still just cooking quietly from what ive seen"
X Link @TheAhmadOsman 2025-10-22T01:13Z 26.8K followers, XXX engagements
"best X opensource Agentic LLMs you can run at home GLM XXX Air GPT OSS 120B GPT OSS 20B these model excel at executing commands and running tasks in the background on your behalf and they can run on hardware ranging from 4x RTX 3090s ($3k) to 1x RTX 3090 ($700)"
X Link @TheAhmadOsman 2025-10-22T01:39Z 26.8K followers, 23.2K engagements
"@lifetimization it's a good model but not agentic (wouldn't run it inside claude code/codex cli)"
X Link @TheAhmadOsman 2025-10-22T03:15Z 26.8K followers, XXX engagements
"today this guy axes FAIR at Meta so this is a quick recap of his origin story and why he should not be the one making that decision Alexandr Wang born January 1997 age XX drop out of MIT co-found Scale AI "what if we label data but mid" convince every LLM company that this is fine 20162023 flood the market with barely-labeled goat photos and out-of-context Reddit takes call it foundational data raise billions valuation hits $7.3B everyone claps 2025 sell Scale AI to Meta for $14B not a typo. fourteen. billion. dollars. join Meta as Chief AI Officer rename division to Meta Superintelligence"
X Link @TheAhmadOsman 2025-10-22T14:16Z 26.8K followers, 34.1K engagements
"some of anthropic rugpulls so far: X years retention of all conversations and code all data will be used for training 1.58-bit quantized models during daytime plus not getting opus X in claude code max plans limits cut in half X weeks ago no comms weekly limits without concrete numbers 5x/20x plans being actually 3x/8x of plus DMCA takedowns of repos that have to do with Claude Code one of which was my own personally windsurf no access to claude X cutting off openai api access what an absolutely horrible company"
X Link @TheAhmadOsman 2025-08-29T07:14Z 26.8K followers, 461.2K engagements
"there is a lot of MONEY in this add /.json at the end of any Reddit link and get the entire thread including all replies to the n-th depth and all the metadata as JSON and then use LLMs to extract/analyze/etc you can make so much $$$ from niche subreddits"
X Link @TheAhmadOsman 2025-09-07T06:55Z 26.8K followers, 1.1M engagements
"be me Larry Ellison own a database empire a sailboat and a disdain for poor people OpenAI wants to build AGI needs compute a lot of compute like $300B GPU cluster in a volcano levels of compute "Hello. I own Oracle Cloud. Also Im rich." sign $300B GPU hosting deal with OpenAI nothing ships. no GPUs installed. no fans spinning. Oracle stock: skyrockets to the moon my net worth: +$100B GPUs still imaginary OpenAI raises a $X TRILLION round yes with a T investors lining up like it's a Taylor Swift concert I invest whered I get the money from the $300B deal I signed with them yes. OpenAI"
X Link @TheAhmadOsman 2025-09-13T00:18Z 26.8K followers, 1.8M engagements
"vibe coders secure your systems with this tool in X easy step"
X Link @TheAhmadOsman 2025-09-26T07:04Z 26.8K followers, 469.6K engagements
"i built a simple tool that makes Claude Code work with any local LLM full demo: vLLM serving GLM-4.5 Air on 4x RTX 3090s Claude Code generating code + docs via my proxy X Python file + .env handles all requests nvtop showing live GPU load how it all works Buy a GPU"
X Link @TheAhmadOsman 2025-10-08T13:33Z 26.8K followers, 120.1K engagements
"quick vram math for llms (weights only) fp16 = X bytes/param w4a16 = XXX bytes/param exl2 = XXXXX / XXX / XXXXX / XXXXXX bytes/param for XXX / XXX / XXX / XXX bpw rough vram per model size: 7b: fp16 14gb w4a16 3.5gb exl2 (4.0bpw) 3.5gb 13b: fp16 26gb w4a16 6.5gb exl2 (4.0bpw) 6.5gb 34b: fp16 68gb w4a16 17gb exl2 (4.0bpw) 17gb 70b: fp16 140gb w4a16 35gb exl2 (4.0bpw) 35gb 236b: fp16 472gb w4a16 118gb exl2 (4.0bpw) 118gb 405b: fp16 810gb w4a16 202gb exl2 (4.0bpw) 202gb do the math: vram = params bytes_per_param ex: 70b @ w4a16 XX XXX XXX = XX x XXX bytes XX gb fits on X 24gb gpus with room for"
X Link @TheAhmadOsman 2025-10-12T16:30Z 26.8K followers, 14.8K engagements
"karpathy just released a new project in it and in under 8000 lines of code you get to: train the tokenizer using a new rust implementation pretrain a transformer llm on fineweb evaluate core score across a number of metrics midtrain on user-assistant conversations from smoltalk multiple choice questions tool use sft evaluate the chat model on world knowledge multiple choice (arc-e/c mmlu) math (gsm8k) code (humaneval) rl the model optionally on gsm8k with "grpo" efficient inference the model in an engine with kv cache simple prefill/decode tool use (python interpreter in a lightweight"
X Link @TheAhmadOsman 2025-10-13T15:56Z 26.8K followers, 181K engagements
"hell no i am not giving my ID to OpenAI lol this is just the beginning btw censorship ads nerfed models etc are all on the table once these companies win and that's why they CANNOT win your intelligence stack must be opensource and fully under your control Buy a GPU"
X Link @TheAhmadOsman 2025-10-14T18:53Z 26.8K followers, 59.8K engagements
"been asked a lot today about the NVIDIA DGX Spark also getting lots of reactions on this old post of mine about it hoping to get my hands on one soon to review and expand a bit further on what it's good for and what not (plus it's pretty ngl want X on my desk haha) stay tuned"
X Link @TheAhmadOsman 2025-10-15T01:29Z 26.8K followers, 14.4K engagements
"if i were starting today RTX PRO 6000 then X RTX 5090s then X RTX 3090s thats the GPU stack id chase in that exact order"
X Link @TheAhmadOsman 2025-10-16T14:46Z 26.8K followers, 16.7K engagements
"if youre in software pivot to GPUs now take on debt sell a kidney whatever but get your hands on that silicon the SaaS gold rush is over new money is compute the next king isnt code its hardware"
X Link @TheAhmadOsman 2025-10-17T00:14Z 26.8K followers, 23.5K engagements
"in 2025 your focus SHOULD NOT be CUDA the real bottlenecks are: data inference evals dataloaders infra in general want to get good mess with PyTorch & JAX study inference infra like vLLM & SGLang build better eval pipelines learn how models run end-to-end"
X Link @TheAhmadOsman 2025-10-17T17:28Z 26.8K followers, 55.4K engagements
"for inference RTX PRO 6000 DGX Spark in a 4000 token chat: RTX PRO 6000 is 67x faster while only 1.8x more expensive DGX Spark took XXX sec vs XX sec on Llama XXX 8B and XX min vs XXX sec on Llama XXX 70B LLM inference is memorybound: 1792 GB/s vs XXX GB/s"
X Link @TheAhmadOsman 2025-10-18T04:45Z 26.8K followers, 12.5K engagements
"new favorite meme dropped will quote rt papers with it"
X Link @TheAhmadOsman 2025-10-18T20:52Z 26.8K followers, 126.7K engagements
"all the snarky replies i get about how local models dont stand a chance make one thing clear people are still judging based on LLaMA X if they touched Qwen X 32B or 30BA3B for even a second theyd realize theyre stuck in 2023 open models have gotten SO GOOD"
X Link @TheAhmadOsman 2025-10-20T22:34Z 26.8K followers, 6601 engagements
"@SpaceWelder314 Glm XXX Air and GPT OSS"
X Link @TheAhmadOsman 2025-10-21T01:53Z 26.8K followers, XXX engagements
"born too late to explore Earth born too early to explore the Stars born just in time to Buy GPUs before DeepSeek drops AGI"
X Link @TheAhmadOsman 2025-10-21T02:20Z 26.8K followers, 4653 engagements
"this dude gets it Buy a GPU"
X Link @TheAhmadOsman 2025-10-21T18:52Z 26.8K followers, 5946 engagements
"@SpaceWelder314 No could be a goldmine"
X Link @TheAhmadOsman 2025-10-21T19:53Z 26.8K followers, 5594 engagements
"@_thomasip Given a tight budget you care more about the TFLOPs You will also wanna do PCIe Gen. X at x16 per GPU so you're looking at a Server Platform build Trade-offs on a tight budget"
X Link @TheAhmadOsman 2025-10-21T21:18Z 26.8K followers, XXX engagements
"@mindspark42 they're collecting your data"
X Link @TheAhmadOsman 2025-10-21T21:34Z 26.8K followers, 8693 engagements
"the Tenstorrent QuietBox Blackhole is a XXX Tb/s Ethernet mesh that pools memory and scales almost linearly when you daisychain more boxes the TT-QuietBox Blackhole comes with XX lbs liquid-cooled chassis AMD EPYC 8124P 16c/32t XXX GB DDR5 ECC X TB NVMe ASRock Rack SIENAD82L2T w/ 2x XX GbE + IPMI 4x Blackhole p150c cards totalling: XXX Tensix Cores XX big RISC-V cores XXX GB GDDR6 XXX MB OnChip SRAM XXX Tb/s Ethernet mesh 16x QSFPDD 800G ports for cardcard comms 8x passive directattach copper (DAC) cables (0.6m) all of this is powered by a single 1650W Platinum PSU passively cooled ready to"
X Link @TheAhmadOsman 2025-10-22T00:52Z 26.8K followers, 6135 engagements
"@thatmarketsguy You wouldn't be able to get a good Agentic model running on 8GB VRAM unfortunately But you can still get a lot of good performing models that aren't Agentic necessarily up and running just fine"
X Link @TheAhmadOsman 2025-10-22T01:54Z 26.8K followers, XXX engagements
"@aaryan_kakad yup ebay will be too expensive find a trusted seller (4+ sales) on r/hardwareswap and buy it from them mostly either AI folks or gamers"
X Link @TheAhmadOsman 2025-10-22T02:01Z 26.8K followers, XXX engagements
"when Codex says its working on resolving the dependency issues in my Python env"
X Link @TheAhmadOsman 2025-10-22T02:34Z 26.8K followers, 3945 engagements
"@rogerscissp I really don't wanna anger anyone But if you cannot get local LLMs to work it's a skill isue"
X Link @TheAhmadOsman 2025-10-22T03:12Z 26.8K followers, XXX engagements
"@joe_ptrkv_ch Evga Dell Founder Edition Asus"
X Link @TheAhmadOsman 2025-10-22T03:50Z 26.8K followers, XXX engagements
"the OpenAI fanboys replying to my ChatGPT Atlas tweet are insufferable i won't reply to all of that so consider this my X reply: you won go install it just don't come back crying when your job is automated lol"
X Link @TheAhmadOsman 2025-10-22T13:41Z 26.8K followers, 1473 engagements
"@Creatify_AI Doesn't seem like @grok algo likes it"
X Link @TheAhmadOsman 2025-10-22T19:35Z 26.8K followers, XX engagements
/creator/twitter::TheAhmadOsman