๐Ÿ”ด High Significance

Model Releases

๐Ÿ”ด Large Language Models Explore by Latent Distilling โ€” score 75 Sources: huggingface

Generating diverse responses is crucial for test-time scaling of large language models (LLMs), yet standard stochastic sampling mostly yields surface-level lexical variation, limiting semantic exploration. In this paper, we propose Exploratory Sampling (ESamp), a decoding approach that explicitly en

Developer Tools

๐Ÿ”ด GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents โ€” score 95 Sources: huggingface

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability to perceive, interpret, and act over heterogeneous contexts s

๐Ÿ”ด RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments โ€” score 85 Sources: huggingface

We present RADIO-ViPE (Reduce All Domains Into One -- Video Pose Engine), an online semantic SLAM system that enables geometry-aware open-vocabulary grounding, associating arbitrary natural language queries with localized 3D regions and objects in dynamic environments. Unlike existing approaches tha

๐ŸŸก Notable

Model Releases

๐ŸŸก ClawGym: A Scalable Framework for Building Effective Claw Agents โ€” score 65 Sources: huggingface

Claw-style environments support multi-step workflows over local files, tools, and persistent workspace states. However, scalable development around these environments remains constrained by the absence of a systematic framework, especially one for synthesizing verifiable training data and integratin

๐ŸŸก Introducing Advanced Account Security โ€” score 50 Sources: lab_blog/OpenAI

Introducing Advanced Account Security: phishing-resistant login, stronger recovery, and enhanced protections to safeguard sensitive data and prevent account takeover.

๐ŸŸก Enabling a new model for healthcare with AI co-clinician โ€” score 50 Sources: lab_blog/DeepMind

Researching the path to AI-augmented care and development of an AI co-clinician.

Developer Tools

๐ŸŸก Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models โ€” score 55 Sources: huggingface

Diffusion large language models (dLLMs) offer parallel decoding and bidirectional context, but state-of-the-art dLLMs require billions of parameters for competitive performance. While existing distillation methods for dLLMs reduce inference steps within a single architecture, none address cross-arch

๐ŸŸก Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion โ€” score 45 Sources: huggingface

Controllable diffusion methods have substantially expanded the practical utility of diffusion models, but they are typically developed as isolated, backbone-specific systems with incompatible training pipelines, parameter formats, and runtime hooks. This fragmentation makes it difficult to reuse inf

๐ŸŸข Incremental

Model Releases

๐ŸŸข Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital โ€” score 10 Sources: huggingface

We study reliability in autonomous language-model agents that translate user mandates into validated tool actions under real capital. The setting is DX Terminal Pro, a 21-day deployment in which 3,505 user-funded agents traded real ETH in a bounded onchain market. Users configured vaults through str

Developer Tools

๐ŸŸข FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments โ€” score 35 Sources: huggingface

Large Language Models are being increasingly deployed as the decision-making core of autonomous agents capable of effecting change in external environments. Yet, in conversational benchmarks, which simulate real-world customer-centric issue resolution scenarios, these agents frequently fail due to t

๐ŸŸข Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding โ€” score 25 Sources: huggingface

RL post-training of frontier language models is increasingly bottlenecked by autoregressive rollout generation, making rollout acceleration a central systems challenge. Many existing efficiency methods improve throughput by changing the rollout or optimization regime, for example, through off-policy

๐ŸŸข Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising โ€” score 10 Sources: huggingface

We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critical limitations of prior unified world models (e.g., UWM) that only model 2D pixel-space and fail to

๐Ÿ“„ New Papers

TitleCategoryScoreLink
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agentsdeveloper_tool89Open
RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environmentsdeveloper_tool67Open
Large Language Models Explore by Latent Distillingmodel_release65Open
ClawGym: A Scalable Framework for Building Effective Claw Agentsmodel_release48Open
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Modelsdeveloper_tool44Open
The Inverse-Wisdom Law: Architectural Tribalism and the Consensus Paradox in Agentic Swarmscs.AI0Open
Evaluating Epistemic Guardrails in AI Reading Assistants: A Behavioral Audit of a Minimal Prototypecs.AI0Open
BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learningcs.AI0Open
Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agentscs.AI0Open
Mechanized Foundations of Structural Governance: Machine-Checked Proofs for Governed Intelligencecs.AI0Open
The Two Boundaries: Why Behavioral AI Governance Fails Structurallycs.AI0Open
Learning Rate Engineering: From Coarse Single Parameter to Layered Evolutioncs.AI0Open
Machine Collective Intelligence for Explainable Scientific Discoverycs.AI0Open
METASYMBO: Multi-Agent Language-Guided Metamaterial Discovery via Symbolic Latent Evolutioncs.AI0Open
BoostLoRA: Growing Effective Rank by Boosting Adapterscs.AI0Open

๐Ÿข Lab Blog Posts