@HuggingPapers DailyPapers

DailyPapers posts on X about model, agentic, ai, llm the most. They currently have [------] followers and [---] posts still getting attention that total [------] engagements in the last [--] hours.

Engagements: [------] #

[--] Week [-------] +127%
[--] Month [-------] -44%
[--] Months [---------] +195%

Mentions: [--] #

[--] Week [--] -19%
[--] Month [---] -10%
[--] Months [-----] +230%

Followers: [------] #

[--] Week [------] +1.80%
[--] Month [------] +8.20%
[--] Months [------] +183%

CreatorRank: [-------] #

Social Influence

Social category influence technology brands 23.68% stocks 12.28% finance 2.63% travel destinations 1.75% cryptocurrencies 0.88%

Social topic influence model #786, agentic #232, ai 9.65%, llm #518, strong #3153, microsoft #2346, inference 4.39%, up to 3.51%, the first 3.51%, math 3.51%

Top accounts mentioned or mentioned by @huggingface @codewithimanshu @ysu_chatdata @kimimoonshot @googledeepmind @chrisuniverseb @wildpinesai @dsunitus @cryptodaaddy @calebfahlgren @barrakali @ghidorah_x @vsouthvpawv @alexwingfield_ @elangovankamesh @jrggllf

Top assets mentioned Microsoft Corp. (MSFT) Alphabet Inc Class A (GOOGL)

Top Social Posts

Top posts by engagements in the last [--] hours

"Microsoft just dropped VibeVoice on Hugging Face A novel framework generating expressive long-form multi-speaker conversational audio like podcasts from text. Synthesizes up to [--] minutes of speech with up to [--] distinct speakers https://huggingface.co/microsoft/VibeVoice-1.5B https://huggingface.co/microsoft/VibeVoice-1.5B"
X Link 2025-08-25T14:03Z 13.5K followers, 57.8K engagements

"Qwen just released Qwen3-VL their most powerful vision-language model on Hugging Face. It features comprehensive upgrades for visual perception reasoning and generation across diverse tasks. https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Instruct-GGUF https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Instruct-GGUF"
X Link 2025-11-02T04:37Z 13.4K followers, [----] engagements

"EvoCUA becomes #1 open-source Computer Use Agent Meituan's evolutionary framework achieves 56.7% on OSWorld by generating synthetic tasks running them in sandboxes and learning from failures"
X Link 2026-01-23T16:10Z 13.4K followers, [----] engagements

"iFSQ: Improving FSQ with [--] line of code Tencent Hunyuan team discovers the sweet spot for image generation is [--] bits. AR converges faster but diffusion achieves higher quality. The fix Replace tanh with 2.0*(1.6x)-1 to map Gaussian latents to uniform distribution and prevent activation collapse. https://twitter.com/i/web/status/2016242900405989428 https://twitter.com/i/web/status/2016242900405989428"
X Link 2026-01-27T20:12Z 13.5K followers, [----] engagements

"Qwen just released Qwen3-ASR on Hugging Face The most capable open-source speech recognition model yet supporting [--] languages & dialects with performance rivaling GPT-4o and Gemini"
X Link 2026-01-29T16:12Z 13.5K followers, [----] engagements

"Scaling Embeddings Scaling Experts Meituan's LongCat-Flash-Lite rethinks language model sparsity: allocate 30B+ params to embeddings instead of MoE experts. The result A 68.5B param model with only 3B active that beats MoE baselines in agentic & coding tasks"
X Link 2026-01-30T16:13Z 13.4K followers, [----] engagements

"OCRVerse by Meituan The first holistic OCR method that unifies text-centric and vision-centric OCR across [--] diverse scenariosfrom documents to charts web pages and molecules. Uses a novel two-stage SFT-RL training approach to outperform models 10-20x larger"
X Link 2026-01-30T20:08Z 13.4K followers, [----] engagements

"Reinforcement Learning via Self-Distillation SDPO converts rich textual feedback into dense learning signals without external teachers achieving [--] faster training and higher accuracy on code math and scientific reasoning"
X Link 2026-01-31T12:13Z 13.4K followers, [----] engagements

"ConceptMoE ByteDance's new paradigm shifts LLMs from token-level to adaptive concept-level processing. Dynamically merges similar tokens to allocate compute intelligently delivering 175% prefill speedups and +5.5 performance gains"
X Link 2026-01-31T16:12Z 13.3K followers, [----] engagements

"Self-Distillation Enables Continual Learning MIT & ETH Zurich researchers introduce SDFT for on-policy learning from demonstrations. It uses demonstration-conditioned models as their own teacher to reduce catastrophic forgetting outperforming SFT and enabling sequential skill learning. https://twitter.com/i/web/status/2017757611861446895 https://twitter.com/i/web/status/2017757611861446895"
X Link 2026-02-01T00:31Z 13.3K followers, [----] engagements

"LLMs that clean data agents that train themselves and Microsoft's new reasoning model This week's top AI papers on @huggingface (Feb 1-7): - Can LLMs Clean Up Your Mess Survey of data prep with LLMs - AgentFly: Fine-tuning LLM agents without fine-tuning LLMs - LongCat-Flash-Thinking-2601: 560B-parameter MoE reasoning model - rStar2-Agent by Microsoft: Agentic reasoning with code execution - Idea2Story: Automated scientific narrative generation - VibeVoice: 90-minute multi-speaker speech synthesis - daVinci-Dev: Agent-native mid-training for software engineering - Beyond Pass@1: Self-play for"
X Link 2026-02-01T14:09Z 13.3K followers, [----] engagements

"Linear representations shift during conversation New research shows that LLM representations evolve as you chat. What's 'factual' at the start can flip to 'non-factual' by the endchallenging static interpretability methods"
X Link 2026-02-01T16:11Z 13.4K followers, 12.3K engagements

"Golden Goose Nvidia's method to synthesize unlimited RLVR tasks from unverifiable internet text by converting reasoning-rich corpora into multiple-choice questions"
X Link 2026-02-03T00:25Z 13.4K followers, [----] engagements

"PISCES Annotation-free post-training for text-to-video models using Optimal Transport to align text and video embeddings. Dual OT-aligned rewards improve fidelity and prompt faithfulness across short and long video generators"
X Link 2026-02-03T04:38Z 13.4K followers, [----] engagements

"Paper: The first annotation-free reward supervision via Optimal Transport achieving state-of-the-art on VBench for both quality and semantic alignment. https://huggingface.co/papers/2602.01624 https://huggingface.co/papers/2602.01624"
X Link 2026-02-03T04:38Z 13.4K followers, [---] engagements

"Kimi K2.5: Visual Agentic Intelligence Moonshot just dropped Kimi K2.5 an open multimodal agentic model that parallelizes complex tasks across specialized sub-agents using Agent Swarmcutting latency by 4.5x while hitting SOTA across coding vision and reasoning"
X Link 2026-02-03T08:14Z 13.4K followers, [----] engagements

"NVIDIA just unleashed their MLPerf-tuned Qwen3-VL on Hugging Face A 235B parameter vision-language powerhouse with NVFP4 quantization built for record MLPerf v6.0 inference performance https://huggingface.co/nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4-MLPerf-Inference-Closed-V6.0 https://huggingface.co/nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4-MLPerf-Inference-Closed-V6.0"
X Link 2026-02-03T09:13Z 13.4K followers, 14.1K engagements

"Green-VLA A staged Vision-Language-Action framework for humanoid robots that achieves 69.5% first-item success on ALOHA (vs 35.6% baseline) and 71.8% on SimplerEnv through a five-stage curriculum from foundation models to RL alignment"
X Link 2026-02-03T12:14Z 13.3K followers, [----] engagements

"Vision-DeepResearch: First long-horizon multimodal deep-research MLLM Multi-turn multi-entity multi-scale visual/textual search with dozens of reasoning steps and hundreds of engine interactions. 8B & 30B-A3B models achieve SOTA on [--] benchmarks outperforming GPT-5 Gemini-2.5-Pro & Claude-4-Sonnet agents. https://twitter.com/i/web/status/2018721455060537519 https://twitter.com/i/web/status/2018721455060537519"
X Link 2026-02-03T16:21Z 13.4K followers, [----] engagements

"CodeOCR Vision language models can read code from images with 8x compression110 text tokens become just [--] visual tokens. The code stays recognizable while slashing compute costs"
X Link 2026-02-04T04:36Z 13.3K followers, [----] engagements

"NVIDIA just released GR00T N1.6 DROID on Hugging Face A Vision-Language-Action model for generalist humanoid robots. Achieves SOTA on simulation benchmarks and runs on the Fourier GR-1 robot"
X Link 2026-02-04T07:46Z 13.3K followers, [----] engagements

"Microsoft just released X-Reasoner on Hugging Face A vision-language model trained only on text that outperforms multimodal SOTA on reasoning benchmarks"
X Link 2026-02-04T09:46Z 13.3K followers, [----] engagements

"Quant VideoGen Solves the KV-cache memory bottleneck in autoregressive video generationreducing usage by 7x to fit on consumer GPUs with under 4% latency overhead while improving long-horizon consistency"
X Link 2026-02-05T04:38Z 13.3K followers, [----] engagements

"Achieves this through Semantic Aware Smoothing and Progressive Residual Quantization establishing a new Pareto frontier on LongCat Video HY WorldPlay and Self-Forcing benchmarks. https://huggingface.co/papers/2602.02958 https://huggingface.co/papers/2602.02958"
X Link 2026-02-05T04:38Z 13.3K followers, [---] engagements

"ERNIE [---] Baidu's trillion-parameter natively autoregressive foundation model that unifies multimodal understanding and generation across text image video and audio with ultra-sparse MoE and elastic training"
X Link 2026-02-05T08:15Z 13.4K followers, [----] engagements

"FASA: Frequency-aware Sparse Attention from Alibaba Discovers functional sparsity in RoPE frequency-chunks to dynamically predict token importance achieving nearly 100% full-KV performance on LongBench-V1 using only [---] tokens"
X Link 2026-02-05T12:14Z 13.5K followers, [----] engagements

"WideSeek-R1 Explores width scaling with multi-agent RL for broad information seeking. A 4B model matches DeepSeek-R1-671B performance with 170x fewer parameters. Performance scales consistently with more parallel subagents"
X Link 2026-02-05T16:19Z 13.3K followers, [----] engagements

"OmniSIFT A modality-asymmetric token compression framework for omni-modal LLMs that reduces token context by 75% while maintaining or exceeding full-token model performance. Uses spatio-temporal video pruning and vision-guided audio selection"
X Link 2026-02-06T00:24Z 13.5K followers, [----] engagements

"Context Forcing A novel framework for consistent long video generation that trains a long-context student via a long-context teacher eliminating the student-teacher mismatch that plagues existing streaming methods"
X Link 2026-02-06T20:10Z 13.3K followers, [----] engagements

"PaperBanana An agentic framework that automates creation of publication-ready academic illustrations. Orchestrates [--] specialized agents to transform scientific content into high-quality diagrams and statistical plots outperforming baselines across all metrics"
X Link 2026-02-07T00:23Z 13.3K followers, [----] engagements

"Paper: Project: Google Cloud AI Research and Peking University introduce PaperBananaBench with [---] NeurIPS [----] test cases. Code & dataset releasing soon. https://dwzhu-pku.github.io/PaperBanana/ https://huggingface.co/papers/2601.23265 https://dwzhu-pku.github.io/PaperBanana/ https://huggingface.co/papers/2601.23265"
X Link 2026-02-07T00:23Z 13.3K followers, [---] engagements

"UniReason [---] A unified reasoning framework from ByteDance that harmonizes image generation and editing through world knowledge-enhanced planning and self-reflective visual refinement mirroring human planning and refinement"
X Link 2026-02-07T04:35Z 13.3K followers, [----] engagements

"SWE-Universe Scales real-world software engineering environments to 807K+ multilingual instances from GitHub PRs using an agentic framework with iterative self-verification and hacking detection"
X Link 2026-02-07T08:09Z 13.3K followers, [----] engagements

"ReSID A recommendation-native tokenizer that rethinks representation learning and semantic quantization for generative recommenders introducing two novel components: FAMAE and GAOQ"
X Link 2026-02-07T16:15Z 13.5K followers, [----] engagements

"10 datasets tested: 10% avg improvement up to 122x tokenization speedup Dataset: Paper: https://huggingface.co/papers/2602.02338 https://huggingface.co/datasets/PIIR/ReSID-dataset https://huggingface.co/papers/2602.02338 https://huggingface.co/datasets/PIIR/ReSID-dataset"
X Link 2026-02-07T16:15Z 13.4K followers, [---] engagements

"FS-Researcher A file-system-based dual-agent framework that enables test-time scaling for long-horizon research tasks beyond context window limits. The file system serves as durable external memory allowing iterative refinement across agent sessions"
X Link 2026-02-07T20:10Z 13.4K followers, [----] engagements

"MemSkill Replaces rigid hand-crafted memory operations with learnable evolvable skills creating a self-improving closed-loop system for LLM agents"
X Link 2026-02-08T00:32Z 13.3K followers, [----] engagements

"HySparse A hybrid sparse attention architecture that interleaves full and sparse attention layers. Full layers serve as an oracle for token selection and KV cache sharing reducing memory by 10x while boosting performance over baselines"
X Link 2026-02-08T12:13Z 13.5K followers, [----] engagements

"This week's top AI research on @huggingface ERNIE [---] by Baidu: natively autoregressive foundation model for unified multimodal understanding and generation across text image video and audio with elastic training Green-VLA: staged vision-language-action framework for generalist robots with [----] hours of demonstrations achieving strong generalization across humanoids and manipulators Kimi K2.5 by Moonshot: visual agentic intelligence with Agent Swarm framework that dynamically parallelizes tasks reducing latency by 4.5x Vision-DeepResearch: multimodal deep-research capability with multi-turn"
X Link 2026-02-08T14:11Z 13.5K followers, [----] engagements

"This week's top AI research on @huggingface - ERNIE [---] by Baidu: natively autoregressive multimodal foundation model - Green-VLA: staged vision-language-action framework for generalist robots with 3k hours of demonstrations - Kimi K2.5 by @Kimi_Moonshot: visual agentic intelligence with Agent Swarm framework that dynamically parallelizes tasks reducing latency by 4.5x - PaperBanana by @GoogleDeepMind: automating academic illustration generation for AI scientists - Vision-DeepResearch: multimodal deep-research capability with multi-turn visual/textual search Read on"
X Link 2026-02-08T21:12Z 13.5K followers, [----] engagements

"On the Entropy Dynamics in Reinforcement Fine-Tuning A theoretical framework analyzing entropy evolution during RL fine-tuning of LLMs. Derives first-order expressions for entropy change extends to GRPO and proposes practical entropy-discriminator clipping methods"
X Link 2026-02-09T04:45Z 13.4K followers, [----] engagements

"Alibaba researchers provide the first principled understanding of entropy dynamics in RFT revealing why and how to control entropy for stable training. Paper: Framework: https://github.com/agentscope-ai/Trinity-RFT https://huggingface.co/papers/2602.03392 https://github.com/agentscope-ai/Trinity-RFT https://huggingface.co/papers/2602.03392"
X Link 2026-02-09T04:45Z 13.3K followers, [---] engagements

"OdysseyArena A new benchmark that reveals a critical bottleneck: even frontier LLMs struggle with long-horizon inductive reasoning. Agents must discover hidden rules from experience across four interactive environmentsnot just follow instructions"
X Link 2026-02-09T08:18Z 13.3K followers, [----] engagements

"Baichuan Inc released Baichuan-M3 A medical LLM that shifts from passive Q&A to active clinical decision support. Models physician workflows with proactive info gathering long-horizon reasoning and hallucination suppressionoutperforming GPT-5.2 on HealthBench"
X Link 2026-02-09T12:19Z 13.4K followers, [----] engagements

"MSign: A new optimizer from Microsoft researchers Prevents training instability in LLMs by restoring stable rank via matrix sign operations. Avoids gradient explosions with less than 7.0% computational overhead"
X Link 2026-02-10T00:30Z 13.4K followers, [----] engagements

"Modality Gap-Driven Subspace Alignment A novel training paradigm for MLLMs that tackles the persistent modality gap between vision and language. Introduces ReAlign (Anchor Trace Centroid Alignment) and ReVision for scalable training without expensive image-text pairs"
X Link 2026-02-10T04:49Z 13.4K followers, [----] engagements

"Paper: Code: ReAlign maps text representations into visual distributions using massive unpaired data decoupling MLLM training from dependence on costly image-text pairs. https://github.com/Yu-xm/ReVision https://huggingface.co/papers/2602.07026 https://github.com/Yu-xm/ReVision https://huggingface.co/papers/2602.07026"
X Link 2026-02-10T04:49Z 13.4K followers, [---] engagements

"NVIDIA just released Earth2Studio assets on Hugging Face A comprehensive collection of AI weather & climate model resources including GraphCast Pangu AIFS & more"
X Link 2026-02-10T08:14Z 13.4K followers, [----] engagements

"QuantaAlpha An evolutionary framework for LLM-driven alpha mining that discovers quantitative factors achieving 27.75% annualized return on CSI [---] with strong transfer to S&P [---] and CSI [---] markets"
X Link 2026-02-10T12:21Z 13.5K followers, [---] engagements

"Weak-Driven Learning A novel post-training paradigm where strong models improve by learning from weak agents like historical checkpoints. Achieves performance gains on math and code with zero additional inference cost"
X Link 2026-02-10T16:27Z 13.4K followers, [----] engagements

"Meta releases AIRS-Bench on Hugging Face A benchmark suite challenging AI agents with [--] tasks from SOTA ML papers across NLP math bioinformatics and more. Tests full research lifecycleidea generation experimentation iterative refinementwithout providing baseline code"
X Link 2026-02-10T20:17Z 13.4K followers, [----] engagements

"Recurrent-Depth VLA Replaces token-based reasoning with latent iterative refinement in VLA models achieving adaptive test-time compute with constant memory. Tasks failing at 0% with single-iteration reach 90% with four iterations while simpler tasks saturate quicklyup to [--] faster than previous methods. https://twitter.com/i/web/status/2021380976358638053 https://twitter.com/i/web/status/2021380976358638053"
X Link 2026-02-11T00:29Z 13.4K followers, [----] engagements

"OPUS A dynamic data selection framework for LLM pre-training that aligns with optimizer geometry (AdamW Muon) to overcome the "data wall". Achieves +2.2% accuracy gains with [--] compute reduction and only 4.7% overhead"
X Link 2026-02-11T04:49Z 13.4K followers, [----] engagements

"NVIDIA just dropped a massive kitchen robotics dataset on Hugging Face [---] hours of human-teleoperated demonstrations across [---] real-world tasks. https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-Kitchen-Sim-Demos https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-Kitchen-Sim-Demos"
X Link 2026-02-11T06:23Z 13.5K followers, 41K engagements

"Paper: Project: https://showlab.github.io/Olaf-World/ https://huggingface.co/papers/2602.10104 https://showlab.github.io/Olaf-World/ https://huggingface.co/papers/2602.10104"
X Link 2026-02-11T13:02Z 13.4K followers, [----] engagements

"Alibaba's Code2World A GUI world model that predicts next UI states via renderable code generation. It rivals GPT-5 & Gemini-3-Pro-Image on next UI prediction and boosts agent navigation success by +9.5% on AndroidWorld"
X Link 2026-02-11T16:26Z 13.5K followers, [---] engagements

"UI-Venus-1.5 by Ant Group A unified end-to-end GUI agent achieving state-of-the-art performance across benchmarks with robust real-world navigation for 40+ Chinese mobile apps"
X Link 2026-02-11T20:13Z 13.4K followers, [---] engagements

"Paper: Collection: Features 10B-token mid-training online RL and model merging. https://huggingface.co/collections/inclusionAI/ui-venus https://huggingface.co/papers/2602.09082 https://huggingface.co/collections/inclusionAI/ui-venus https://huggingface.co/papers/2602.09082"
X Link 2026-02-11T20:13Z 13.4K followers, [---] engagements

"Chain of Mindset A training-free framework that makes LLMs reason like humans by dynamically switching between four cognitive modes at each step. No more one-size-fits-all reasoning"
X Link 2026-02-12T00:27Z 13.5K followers, [----] engagements

"Paper: Code: https://github.com/QuantaAlpha/chain-of-mindset https://huggingface.co/papers/2602.10063 https://github.com/QuantaAlpha/chain-of-mindset https://huggingface.co/papers/2602.10063"
X Link 2026-02-12T00:27Z 13.4K followers, [---] engagements

"Paper: Model: Open source under Apache [---] runs on consumer hardware with 100-300 tok/s inference speed. https://huggingface.co/stepfun-ai/Step-3.5-Flash https://huggingface.co/papers/2602.10604 https://huggingface.co/stepfun-ai/Step-3.5-Flash https://huggingface.co/papers/2602.10604"
X Link 2026-02-12T04:44Z 13.5K followers, [----] engagements

"NVIDIA proposes PhyCritic A multimodal critic that unifies physical judging and reasoning. It uses self-referential evaluation: first generating its own physics-aware prediction as internal reference then judging candidate responses for improved stability"
X Link 2026-02-12T08:16Z 13.4K followers, [---] engagements

"Achieves 12-point gains On physical judgment over open-source baselines with strong generalization to general multimodal tasks. Self-referential finetuning drives performance. Project: Paper: https://huggingface.co/papers/2602.11124 https://research.nvidia.com/labs/lpr/phycritic/ https://huggingface.co/papers/2602.11124 https://research.nvidia.com/labs/lpr/phycritic/"
X Link 2026-02-12T08:16Z 13.4K followers, [---] engagements

"GENIUS: A benchmark for generative fluid intelligence Tests if multimodal models can handle gravity anomalies & visual metaphorschallenging them to induce patterns reason through constraints and adapt to novel scenarios beyond knowledge recall"
X Link 2026-02-12T12:18Z 13.5K followers, [----] engagements

"GRU-Mem by ByteDance Seed A gated recurrent memory framework for long-context reasoning. Uses two text-controlled gates: an update gate to prevent memory explosion and an exit gate for early termination. Achieves up to 400% inference speed acceleration via RL training"
X Link 2026-02-12T20:11Z 13.5K followers, [----] engagements

"Project page: Paper: https://huggingface.co/papers/2602.10560 https://alphalab-ustc.github.io/grumem-alphalab/ https://huggingface.co/papers/2602.10560 https://alphalab-ustc.github.io/grumem-alphalab/"
X Link 2026-02-12T20:11Z 13.5K followers, [---] engagements

"just released GLM-5 on Hugging Face 744B parameters with DeepSeek Sparse Attention and a novel async RL framework called slime. Best-in-class open-source performance on reasoning coding and agentic tasks. http://Z.ai http://Z.ai"
X Link 2026-02-12T21:45Z 13.5K followers, [----] engagements

"Towards Autonomous Mathematics Research Google's Aletheia generates verifies and revises mathematical proofs end-to-end. Solved [--] open problems wrote research papers without human calculation and evaluated 700+ problems using Gemini Deep Think"
X Link 2026-02-13T00:26Z 13.5K followers, [----] engagements

"Paper: Demonstrates first AI-generated proofs in arithmetic geometry and interacting particle systems plus a new taxonomy for quantifying AI autonomy levels in mathematical research. https://huggingface.co/papers/2602.10177 https://huggingface.co/papers/2602.10177"
X Link 2026-02-13T00:26Z 13.5K followers, [---] engagements

"Safety is Always Vanishing in Self-Evolving AI Societies New research proves multi-agent LLM systems cannot simultaneously achieve continuous self-improvement isolation and safety invariance. Statistical blind spots from isolated self-evolution irreversibly degrade safety alignment. https://twitter.com/i/web/status/2022169687912563193 https://twitter.com/i/web/status/2022169687912563193"
X Link 2026-02-13T04:43Z 13.5K followers, [----] engagements

"Learning beyond Teacher G-OPD is a generalized on-policy distillation framework that introduces reward extrapolation (ExOPD). By increasing the reward scaling factor beyond [--] students can surpass teacher performance in math reasoning and code generation"
X Link 2026-02-13T08:16Z 13.5K followers, [----] engagements

"Microsoft just released InfoAgent on Hugging Face RE-TRAC is a recursive trajectory compression framework that outperforms ReAct by 15-20% on BrowseComp"
X Link 2026-02-05T20:45Z 13.5K followers, [----] engagements

"VidVec Your MLLM already contains strong video representations. VidVec unlocks them for zero-shot video-text retrieval without any training beating trained models by up to 9.4% recall"
X Link 2026-02-14T00:25Z 13.5K followers, [----] engagements

"State-of-the-art on GAIA GPQA HLE and FrontierScience. Performs both algorithm discovery and wet lab experiments across multiple scientific domains. Paper: Collection: https://huggingface.co/collections/InternScience/internagent https://huggingface.co/papers/2602.08990 https://huggingface.co/collections/InternScience/internagent https://huggingface.co/papers/2602.08990"
X Link 2026-02-14T04:36Z 13.5K followers, [---] engagements

"SkillRL: Evolving Agents via Recursive Skill-Augmented RL A framework proving that skills beat scale. It enables LLM agents to automatically discover and evolve reusable skills from past experiences letting a 7B model beat GPT-4o while reducing token usage"
X Link 2026-02-14T08:09Z 13.5K followers, [----] engagements

"TermiGen Trains robust terminal agents by synthesizing 3500+ verified Docker environments and injecting errors into trajectories. Achieves 31.3% on TerminalBench establishing new open-weights SOTA and outperforming GPT-4o-mini"
X Link 2026-02-14T12:09Z 13.5K followers, [---] engagements

"LLaDA2.1 by Alibaba's Ant Group A token editing breakthrough for diffusion LLMs via joint threshold-decoding. The 100B Flash model hits [---] TPS on HumanEval+; a 16B Mini version balances speed and quality"
X Link 2026-02-14T16:08Z 13.5K followers, [----] engagements

"ByteDance Seed is back with SeedVR2 now on Hugging Face This one-step video restoration model leverages diffusion adversarial post-training for impressive results even on high-resolution videos"
X Link 2025-06-08T17:20Z 13.5K followers, 55.3K engagements

"DeepGen [---] A lightweight 5B unified multimodal model that outperforms 80B+ giants like HunyuanImage by 28% on WISE and Qwen-Image-Edit by 37% on UniREditBenchproving scale isn't everything"
X Link 2026-02-13T20:12Z 13.5K followers, [----] engagements

"Microsoft just released the VITRA Teleoperation Dataset on Hugging Face Real-world robot demos with 7-DoF arm dexterous hand & head-mounted camera. Each episode includes synchronized video + state/action data for training vision-language-action models"
X Link 2026-02-14T08:44Z 13.5K followers, [----] engagements

"P1-VL from Shanghai AI Lab The first open-source vision-language model to secure [--] gold medals on the HiPhO physics benchmark. P1-VL-235B-A22B outperforms GPT-5 and Gemini-2.5-Pro ranking No.2 globally with PhysicsMinions"
X Link 2026-02-14T20:10Z 13.5K followers, [----] engagements

"Snowflake releases Agent World Model [----] synthetic code-driven environments for agentic RL. Unlike LLM-simulated worlds these provide reliable state transitions and stable learning signals. Scales to 35K tools and 10K tasks with real SQLite databases"
X Link 2026-02-15T04:43Z 13.5K followers, 11.3K engagements

"The Data Wall is here. OPUS just built a ladder. Most upvoted papers on @huggingface this week (Feb 9-15): - OPUS (309 upvotes): Dynamic data selection framework tackling the Data Wall - Weak-Driven Learning (251): Making strong agents stronger via weak checkpoints - TermiGen (196): High-fidelity training for terminal agents - Code2World (Alibaba 186): GUI world model via renderable code - The Devil Behind Moltbook (182): Why Anthropic safety vanishes in self-evolving AI - QuantaAlpha (180): Evolutionary framework for financial alpha mining - Step [---] Flash (173): Frontier intelligence with"
X Link 2026-02-15T14:11Z 13.5K followers, [----] engagements

"Paper: Achieves state-of-the-art narrative alignment with negligible overhead by treating emotion as narrative compression not requiring dense attention or architectural cloning. https://huggingface.co/papers/2602.09070 https://huggingface.co/papers/2602.09070"
X Link 2026-02-15T16:08Z 13.5K followers, [---] engagements

"Model: Live demo: https://dotsocr.xiaohongshu.com https://huggingface.co/rednote-hilab/dots.ocr-1.5 https://dotsocr.xiaohongshu.com https://huggingface.co/rednote-hilab/dots.ocr-1.5"
X Link 2026-02-15T19:55Z 13.5K followers, [---] engagements

"Find it here: Features unified streaming/offline inference novel forced alignment and handles everything from speech to singing. https://huggingface.co/Qwen/Qwen3-ASR-1.7B https://huggingface.co/Qwen/Qwen3-ASR-1.7B"
X Link 2026-01-29T16:12Z 13.5K followers, [---] engagements

"Paper: Trained Qwen3 (4B/8B/14B) with GRPO on [----] environments: +12.11 points on BFCLv3 (8B) competitive on -bench best on MCP-Universe. AWM is the ONLY method that improves on ALL three OOD benchmarks. Models & dataset on Hugging Face. https://huggingface.co/papers/2602.10090 https://huggingface.co/papers/2602.10090"
X Link 2026-02-15T04:43Z 13.5K followers, [----] engagements

"Towards Agentic Intelligence for Materials Science A comprehensive survey charting the roadmap from passive predictors to autonomous LLM agents that plan act and learn across the full materials discovery loopintegrating simulation robotic labs and experimental platforms"
X Link 2026-02-15T08:10Z 13.5K followers, [----] engagements

"@huggingface Explore them here: http://hf.co/papers/week/2026-W07 http://hf.co/papers/week/2026-W07"
X Link 2026-02-15T14:11Z 13.5K followers, [---] engagements

"Our comprehensive recipe for building medical MLLMs includes tool-augmented agentic training and low-hallucination report generation. https://huggingface.co/papers/2602.12705 https://huggingface.co/papers/2602.12705"
X Link 2026-02-17T00:25Z 13.5K followers, [---] engagements

"NVIDIA just released Music Flamingo Think on Hugging Face Chain-of-thought reasoning for deep music understanding SOTA on 10+ benchmarks with theory-aware analysis of harmony structure timbre & lyrics https://huggingface.co/nvidia/music-flamingo-think-2601-hf https://huggingface.co/nvidia/music-flamingo-think-2601-hf"
X Link 2026-02-17T04:37Z 13.5K followers, [---] engagements

"Composition-RL A novel RL approach that automatically composes multiple problems into new verifiable questions tackling the data bottleneck when easy prompts dominate training and improving reasoning across 4B-30B models"
X Link 2026-02-13T16:17Z 13.5K followers, [---] engagements

"InternAgent-1.5 A unified agentic framework for long-horizon autonomous scientific discovery. It coordinates generation verification and evolution subsystems to compress weeks of research into minutes across biology earth science and materials"
X Link 2026-02-14T04:36Z 13.5K followers, 11.3K engagements

"Paper: Dataset: https://huggingface.co/datasets/microsoft/VITRA-TeleData https://huggingface.co/papers/2510.21571 https://huggingface.co/datasets/microsoft/VITRA-TeleData https://huggingface.co/papers/2510.21571"
X Link 2026-02-14T08:45Z 13.5K followers, [---] engagements

"Paper: Introduces RAMP (Reinforcement leArning via world Model-conditioned Policy) for robust cross-task adaptation. Project page: https://gigabrain05m.github.io/ https://huggingface.co/papers/2602.12099 https://gigabrain05m.github.io/ https://huggingface.co/papers/2602.12099"
X Link 2026-02-15T12:10Z 13.5K followers, [----] engagements

"NarraScore A framework that repurposes Vision-Language Models as affective sensors to generate soundtracks that understand video narratives. Uses dual-branch injection to align musical dynamics with story arcsfrom chase scenes to bittersweet momentssolving semantic blindness"
X Link 2026-02-15T16:08Z 13.5K followers, [---] engagements

"rednote-hilab just released dots.ocr-1.5 on Hugging Face A 3B-parameter multimodal OCR model that recognizes virtually any human script achieves SOTA multilingual document parsing and converts charts/diagrams directly into SVG code"
X Link 2026-02-15T19:55Z 13.5K followers, [----] engagements

"TurningPoint-GRPO Alibaba's new framework tackles sparse rewards in flow matching models with step-level incremental rewards. It identifies turning pointssteps that flip the local reward trendto capture long-term dependencies all detected via sign changes (hyperparameter-free). https://twitter.com/i/web/status/2023127165974253895 https://twitter.com/i/web/status/2023127165974253895"
X Link 2026-02-15T20:07Z 13.5K followers, [----] engagements

"Paper: Code: https://github.com/YunzeTong/TurningPoint-GRPO https://huggingface.co/papers/2602.06422 https://github.com/YunzeTong/TurningPoint-GRPO https://huggingface.co/papers/2602.06422"
X Link 2026-02-15T20:08Z 13.5K followers, [---] engagements

"Qwen just dropped a 397B parameter multimodal beast on Hugging Face Native vision-language model with 262K context window Early fusion training Gated Delta Networks and [---] languages https://huggingface.co/Qwen/Qwen3.5-397B-A17B https://huggingface.co/Qwen/Qwen3.5-397B-A17B"
X Link 2026-02-16T12:48Z 13.5K followers, [----] engagements

"Paper: Demo: FAC reveals a shared feature space across LLaMA Mistral and Qwen enabling cross-model knowledge transfer. [---] data efficiency. https://huggingface.co/spaces/Zhongzhi1228/synthesis-demo https://huggingface.co/papers/2602.10388 https://huggingface.co/spaces/Zhongzhi1228/synthesis-demo https://huggingface.co/papers/2602.10388"
X Link 2026-02-16T16:17Z 13.5K followers, [---] engagements

"Qute: Towards Quantum-Native Database A system that compiles SQL into quantum circuits and runs on real quantum processors. Features hybrid optimization selective quantum indexing and fidelity-preserving storage. Outperforms classical baselines at scale"
X Link 2026-02-17T04:40Z 13.5K followers, [---] engagements

"Open-source prototype: First to treat quantum computation as first-class execution not just simulation. https://github.com/weAIDB/Qute https://huggingface.co/papers/2602.14699 https://github.com/weAIDB/Qute https://huggingface.co/papers/2602.14699"
X Link 2026-02-17T04:40Z 13.5K followers, [---] engagements

"NVIDIA just released PhysicalAI kitchen assets on Hugging Face Interactive 3D appliances cookware & household objects to train embodied AI agents in realistic simulations. https://huggingface.co/datasets/nvidia/PhysicalAI-Kitchen-Assets https://huggingface.co/datasets/nvidia/PhysicalAI-Kitchen-Assets"
X Link 2026-02-17T05:51Z 13.5K followers, [----] engagements

"Olaf-World learns transferable actions from unlabeled video We introduce Seq-REPA aligning latent actions to observable visual effects across contexts. This enables zero-shot action transfer and data-efficient adaptation for video world models"
X Link 2026-02-11T13:02Z 13.5K followers, 11.2K engagements

"StepFun's Step [---] Flash A sparse MoE model with 196B parameters 11B active per token. Achieves frontier-level reasoning comparable to GPT-5.2 xHigh and Gemini [---] Pro at 1/6th the decoding cost. Ranks #1 on MathArena with 97.3% on AIME 2025"
X Link 2026-02-12T04:44Z 13.5K followers, 19K engagements

"Models & data: Paper: Key innovation: Stacked Channel Bridging framework + three-stage training delivers omni-capabilities with just 50M samples https://huggingface.co/papers/2602.12205 https://huggingface.co/deepgenteam/DeepGen-1.0 https://huggingface.co/papers/2602.12205 https://huggingface.co/deepgenteam/DeepGen-1.0"
X Link 2026-02-13T20:12Z 13.5K followers, [----] engagements

"RLinf-USER A unified system that treats physical robots as first-class hardware resources alongside GPUs enabling automatic discovery management and scheduling of heterogeneous robots for scalable real-world online policy learning in embodied AI"
X Link 2026-02-15T00:27Z 13.5K followers, [----] engagements

"GigaBrain-0.5M* A VLA model trained with world model-based reinforcement learning. Achieves near-perfect success on complex manipulation tasks and ranks #1 on RoboChallenge"
X Link 2026-02-15T12:10Z 13.5K followers, 10K engagements

"Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Instead of generating 300K samples like MAGPIE FAC identifies missing internal features and generates just 2K targeted samplesachieving the same performance on AlpacaEval 2.0"
X Link 2026-02-16T16:17Z 13.5K followers, [----] engagements

"SQuTR: Spoken Query Retrieval Benchmark A robustness benchmark for spoken query retrieval under realistic acoustic noise. 37K+ bilingual queries across [--] domains synthesized with [---] real speakers and evaluated at [--] noise levelsfrom clean to 0dB SNR"
X Link 2026-02-16T20:10Z 13.5K followers, [---] engagements

"Paper: Dataset: https://huggingface.co/datasets/SLLMCommunity/SQuTR https://huggingface.co/papers/2602.12783 https://huggingface.co/datasets/SLLMCommunity/SQuTR https://huggingface.co/papers/2602.12783"
X Link 2026-02-16T20:10Z 13.5K followers, [---] engagements

"MedXIAOHE: A Medical Vision-Language Model Achieves SOTA across medical benchmarks surpassing closed-source multimodal systems via entity-aware pretraining and RL-driven reasoning"
X Link 2026-02-17T00:25Z 13.5K followers, [----] engagements

"Query-as-Anchor from Ant Group A framework that transforms static user embeddings into dynamic scenario-adaptive representations using LLMs. Achieves +9.8% AUC improvement over strong baselines on [--] industrial benchmarks"
X Link 2026-02-17T08:16Z 13.5K followers, [---] engagements

Limited data mode. Full metrics available with subscription: lunarcrush.com/pricing