๐ด High Significance
Developer Tools
๐ด Heterogeneous Scientific Foundation Model Collaboration โ score 95
Sources: huggingface
Agentic large language model systems have demonstrated strong capabilities. However, their reliance on language as the universal interface fundamentally limits their applicability to many real-world problems, especially in scientific domains where domain-specific foundation models have been develope
๐ด Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling โ score 85
Sources: huggingface
Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, long-horizon consistency, and causal understanding. We argue that the field should move beyond appea
๐ด Co-Evolving Policy Distillation โ score 75
Sources: huggingface
RLVR and OPD have become standard paradigms for post-training. We provide a unified analysis of these two paradigms in consolidating multiple expert capabilities into a single model, identifying capability loss in different ways: mixed RLVR suffers from inter-capability divergence cost, while the pi
๐ก Notable
Model Releases
๐ก Efficient Training on Multiple Consumer GPUs with RoundPipe โ score 55
Sources: huggingface
Fine-tuning Large Language Models (LLMs) on consumer-grade GPUs is highly cost-effective, yet constrained by limited GPU memory and slow PCIe interconnects. Pipeline parallelism combined with CPU offloading mitigates these hardware bottlenecks by reducing communication overhead. However, existing PP
Developer Tools
๐ก ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control โ score 65
Sources: huggingface
Humanoid control systems have made significant progress in recent years, yet modeling fluent interaction-rich behavior between a robot, its surrounding environment, and task-relevant objects remains a fundamental challenge. This difficulty arises from the need to jointly capture spatial context, tem
๐ก Leveraging Verifier-Based Reinforcement Learning in Image Editing โ score 45
Sources: huggingface
While Reinforcement Learning from Human Feedback (RLHF) has become a pivotal paradigm for text-to-image generation, its application to image editing remains largely unexplored. A key bottleneck is the lack of a robust general reward model for all editing tasks. Existing edit reward models usually gi
๐ข Incremental
Model Releases
๐ข Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows โ score 35
Sources: huggingface
LLM agents are expected to complete end-to-end units of work across software tools, business services, and local workspaces. Yet many agent benchmarks freeze a curated task set at release time and grade mainly the final response, making it difficult to evaluate agents against evolving workflow deman
Developer Tools
๐ข Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling โ score 25
Sources: huggingface
Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack fine-grained length modeling, operating primarily at the coarse-grained seque
๐ข Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists โ score 15
Sources: huggingface
Existing research infrastructure is fundamentally document-centric, providing citation links between papers but lacking explicit representations of methodological evolution. In particular, it does not capture the structured relationships that explain how and why research methods emerge, adapt, and b
๐ข Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence โ score 5
Sources: huggingface
We introduce Nemotron 3 Nano Omni, the latest model in the Nemotron multimodal series and the first to natively support audio inputs alongside text, images, and video. Nemotron 3 Nano Omni delivers consistent accuracy improvements over its predecessor, Nemotron Nano V2 VL, across all modalities, ena
๐ New Papers
| Title | Category | Score | Link |
|---|---|---|---|
| Heterogeneous Scientific Foundation Model Collaboration | developer_tool | 187 | Open |
| Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling | developer_tool | 77 | Open |
| Co-Evolving Policy Distillation | developer_tool | 39 | Open |
| ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control | developer_tool | 38 | Open |
| Efficient Training on Multiple Consumer GPUs with RoundPipe | model_release | 30 | Open |
| Leveraging Verifier-Based Reinforcement Learning in Image Editing | developer_tool | 28 | Open |
| Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows | model_release | 26 | Open |
| Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling | developer_tool | 18 | Open |
| Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists | developer_tool | 17 | Open |
| Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence | developer_tool | 15 | Open |