π΄ High Significance
Model Releases
π΄ HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds β score 95
Sources: huggingface
We introduce HY-World 2.0, a multi-modal world model framework that advances our prior project HY-World 1.0. HY-World 2.0 accommodates diverse input modalities, including text prompts, single-view images, multi-view images, and videos, and produces 3D world representations. With text or single-view
π΄ How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data β score 75
Sources: huggingface
A widely adopted strategy for model enhancement is to use synthetic data generated by a stronger model for supervised fine-tuning (SFT). However, for emerging reasoning models like Qwen3-8B, this approach often fails to improve reasoning capabilities and can even lead to a substantial drop in perfor
Developer Tools
π΄ DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation β score 85
Sources: huggingface
Deep Research Agents (DRAs) aim to solve complex, long-horizon research tasks involving planning, retrieval, multimodal understanding, and report generation, yet their evaluation remains challenging due to dynamic web environments and ambiguous task definitions. We propose DR^{3}-Eval, a realistic a
π‘ Notable
Model Releases
π‘ Product Apr 17, 2026 Introducing Claude Design by Anthropic Labs Today, weβre launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers, and more. β score 50
Sources: lab_blog/Anthropic
Announcements Apr 7, 2026 Project Glasswing A new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks in an effort to secure the world's most critical software. Mar
π‘ Announcements Feb 4, 2026 Claude is a space to think Weβve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access without compromising user trust. β score 50
Sources: lab_blog/Anthropic
Product Apr 17, 2026 Introducing Claude Design by Anthropic Labs Today, weβre launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers, and more. Announcements Apr 7, 2026 Project Glasswing
π‘ Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems β score 45
Sources: huggingface
Claude Code is an agentic coding tool that can run shell commands, edit files, and call external services on behalf of the user. This study describes its comprehensive architecture by analyzing the publicly available TypeScript source code and further comparing it with OpenClaw, an independent open-
Developer Tools
π‘ RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework β score 65
Sources: huggingface
High-level autonomous driving requires motion planners capable of modeling multimodal future uncertainties while remaining robust in closed-loop interactions. Although diffusion-based planners are effective at modeling complex trajectory distributions, they often suffer from stochastic instabilities
π‘ GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens β score 55
Sources: huggingface
The efficient spatial allocation of primitives serves as the foundation of 3D Gaussian Splatting, as it directly dictates the synergy between representation compactness, reconstruction speed, and rendering fidelity. Previous solutions, whether based on iterative optimization or feed-forward inferenc
π’ Incremental
Developer Tools
π’ HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System β score 35
Sources: huggingface
While end-to-end Vision-Language-Action (VLA) models offer a promising paradigm for robotic manipulation, fine-tuning them on narrow control data often compromises the profound reasoning capabilities inherited from their base Vision-Language Models (VLMs). To resolve this fundamental trade-off, we p
π’ ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack β score 25
Sources: huggingface
Large language models (LLMs), despite being safety-aligned, exhibit brittle refusal behaviors that can be circumvented by simple linguistic changes. As tense jailbreaking demonstrates that models refusing harmful requests often comply when rephrased in past tense, a critical generalization gap is re
π’ UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards β score 15
Sources: huggingface
Retrieval-Augmented Generation (RAG) extends Large Vision-Language Models (LVLMs) with external visual knowledge. However, existing visual RAG systems typically rely on generic retrieval signals that overlook the fine-grained visual semantics essential for complex reasoning. To address this limitati
Infrastructure & Compute
π’ LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories β score 5
Sources: huggingface
This paper focuses on the alignment of flow matching models with human preferences. A promising way is fine-tuning by directly backpropagating reward gradients through the differentiable generation process of flow matching. However, backpropagating through long trajectories results in prohibitive me
π New Papers
| Title | Category | Score | Link |
|---|---|---|---|
| HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds | model_release | 122 | Open |
| DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation | developer_tool | 37 | Open |
| How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data | model_release | 36 | Open |
| RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework | developer_tool | 33 | Open |
| GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens | developer_tool | 27 | Open |
| BioHiCL: Hierarchical Multi-Label Contrastive Learning for Biomedical Retrieval with MeSH Labels | cs.AI | 0 | Open |
| DALM: A Domain-Algebraic Language Model via Three-Phase Structured Generation | cs.AI | 0 | Open |
| DataCenterGym: A Physics-Grounded Simulator for Multi-Objective Data Center Scheduling | cs.AI | 0 | Open |
| PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems | cs.AI | 0 | Open |
| Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies | cs.AI | 0 | Open |
| CLIMB: Controllable Longitudinal Brain Image Generation using Mamba-based Latent Diffusion Model and Gaussian-aligned Autoencoder | cs.AI | 0 | Open |
| VoodooNet: Achieving Analytic Ground States via High-Dimensional Random Projections | cs.AI | 0 | Open |
| Rethinking the Necessity of Adaptive Retrieval-Augmented Generation through the Lens of Adaptive Listwise Ranking | cs.AI | 0 | Open |
| Conjunctive Prompt Attacks in Multi-Agent LLM Systems | cs.AI | 0 | Open |
| HYPERHEURIST: A Simulated Annealing-Based Control Framework for LLM-Driven Code Generation in Optimized Hardware Design | cs.AI | 0 | Open |
π’ Lab Blog Posts
- Anthropic: Product Apr 17, 2026 Introducing Claude Design by Anthropic Labs Today, weβre launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers, and more.
- Anthropic: Announcements Feb 4, 2026 Claude is a space to think Weβve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access without compromising user trust.