๐Ÿ”ด High Significance

Developer Tools

๐Ÿ”ด Heterogeneous Scientific Foundation Model Collaboration โ€” score 95 Sources: huggingface

Agentic large language model systems have demonstrated strong capabilities. However, their reliance on language as the universal interface fundamentally limits their applicability to many real-world problems, especially in scientific domains where domain-specific foundation models have been develope

๐Ÿ”ด Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling โ€” score 85 Sources: huggingface

Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, long-horizon consistency, and causal understanding. We argue that the field should move beyond appea

๐Ÿ”ด Co-Evolving Policy Distillation โ€” score 75 Sources: huggingface

RLVR and OPD have become standard paradigms for post-training. We provide a unified analysis of these two paradigms in consolidating multiple expert capabilities into a single model, identifying capability loss in different ways: mixed RLVR suffers from inter-capability divergence cost, while the pi

๐ŸŸก Notable

Model Releases

๐ŸŸก Efficient Training on Multiple Consumer GPUs with RoundPipe โ€” score 55 Sources: huggingface

Fine-tuning Large Language Models (LLMs) on consumer-grade GPUs is highly cost-effective, yet constrained by limited GPU memory and slow PCIe interconnects. Pipeline parallelism combined with CPU offloading mitigates these hardware bottlenecks by reducing communication overhead. However, existing PP

Developer Tools

๐ŸŸก ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control โ€” score 65 Sources: huggingface

Humanoid control systems have made significant progress in recent years, yet modeling fluent interaction-rich behavior between a robot, its surrounding environment, and task-relevant objects remains a fundamental challenge. This difficulty arises from the need to jointly capture spatial context, tem

๐ŸŸก Leveraging Verifier-Based Reinforcement Learning in Image Editing โ€” score 45 Sources: huggingface

While Reinforcement Learning from Human Feedback (RLHF) has become a pivotal paradigm for text-to-image generation, its application to image editing remains largely unexplored. A key bottleneck is the lack of a robust general reward model for all editing tasks. Existing edit reward models usually gi

๐ŸŸข Incremental

Model Releases

๐ŸŸข Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows โ€” score 35 Sources: huggingface

LLM agents are expected to complete end-to-end units of work across software tools, business services, and local workspaces. Yet many agent benchmarks freeze a curated task set at release time and grade mainly the final response, making it difficult to evaluate agents against evolving workflow deman

Developer Tools

๐ŸŸข Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling โ€” score 25 Sources: huggingface

Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack fine-grained length modeling, operating primarily at the coarse-grained seque

๐ŸŸข Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists โ€” score 15 Sources: huggingface

Existing research infrastructure is fundamentally document-centric, providing citation links between papers but lacking explicit representations of methodological evolution. In particular, it does not capture the structured relationships that explain how and why research methods emerge, adapt, and b

๐ŸŸข Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence โ€” score 5 Sources: huggingface

We introduce Nemotron 3 Nano Omni, the latest model in the Nemotron multimodal series and the first to natively support audio inputs alongside text, images, and video. Nemotron 3 Nano Omni delivers consistent accuracy improvements over its predecessor, Nemotron Nano V2 VL, across all modalities, ena

๐Ÿ“„ New Papers

TitleCategoryScoreLink
Heterogeneous Scientific Foundation Model Collaborationdeveloper_tool187Open
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modelingdeveloper_tool77Open
Co-Evolving Policy Distillationdeveloper_tool39Open
ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Controldeveloper_tool38Open
Efficient Training on Multiple Consumer GPUs with RoundPipemodel_release30Open
Leveraging Verifier-Based Reinforcement Learning in Image Editingdeveloper_tool28Open
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflowsmodel_release26Open
Length Value Model: Scalable Value Pretraining for Token-Level Length Modelingdeveloper_tool18Open
Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientistsdeveloper_tool17Open
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligencedeveloper_tool15Open