๐Ÿ”ด High Significance

Model Releases

๐Ÿ”ด TAPS: Task Aware Proposal Distributions for Speculative Sampling โ€” score 95 Sources: huggingface

Speculative decoding accelerates autoregressive generation by letting a lightweight draft model propose future tokens that a larger target model then verifies in parallel. In practice, however, draft models are usually trained on broad generic corpora, which leaves it unclear how much speculative de

๐Ÿ”ด Gen-Searcher: Reinforcing Agentic Search for Image Generation โ€” score 75 Sources: huggingface

Recent image generation models have shown strong capabilities in generating high-fidelity and photorealistic images. However, they are fundamentally constrained by frozen internal knowledge, thus often failing on real-world scenarios that are knowledge-intensive or require up-to-date information. In

Developer Tools

๐Ÿ”ด Towards a Medical AI Scientist โ€” score 85 Sources: huggingface

Autonomous systems that generate scientific hypotheses, conduct experiments, and draft manuscripts have recently emerged as a promising paradigm for accelerating discovery. However, existing AI Scientists remain largely domain-agnostic, limiting their applicability to clinical medicine, where resear

๐ŸŸก Notable

Model Releases

๐ŸŸก Accelerating the next phase of AI โ€” score 50 Sources: lab_blog/OpenAI

OpenAI raises $122 billion in new funding to expand frontier AI globally, invest in next-generation compute, and meet growing demand for ChatGPT, Codex, and enterprise AI.

๐ŸŸก HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention โ€” score 45 Sources: huggingface

Token-level sparse attention mechanisms, exemplified by DeepSeek Sparse Attention (DSA), achieve fine-grained key selection by scoring every historical token for each query using a lightweight indexer, and then computing attention only over the selected subset. While the downstream sparse attention

Developer Tools

๐ŸŸก Emergent Social Intelligence Risks in Generative Multi-Agent Systems โ€” score 65 Sources: huggingface

Multi-agent systems composed of large generative models are rapidly moving from laboratory prototypes to real-world deployments, where they jointly plan, negotiate, and allocate shared resources to solve complex tasks. While such systems promise unprecedented scalability and autonomy, their collecti

๐ŸŸก EpochX: Building the Infrastructure for an Emergent Agent Civilization โ€” score 55 Sources: huggingface

General-purpose technologies reshape economies less by improving individual tools than by enabling new ways to organize production and coordination. We believe AI agents are approaching a similar inflection point: as foundation models make broad task execution and tool use increasingly accessible, t

๐ŸŸข Incremental

Model Releases

๐ŸŸข GEditBench v2: A Human-Aligned Benchmark for General Image Editing โ€” score 20 Sources: huggingface

Recent advances in image editing have enabled models to handle complex instructions with impressive realism. However, existing evaluation frameworks lag behind: current benchmarks suffer from narrow task coverage, while standard metrics fail to adequately capture visual consistency, i.e., the preser

๐ŸŸข ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks โ€” score 5 Sources: huggingface

Advances in diffusion, autoregressive, and hybrid models have enabled high-quality image synthesis for tasks such as text-to-image, editing, and reference-guided composition. Yet, existing benchmarks remain limited, either focus on isolated tasks, cover only narrow domains, or provide opaque scores

Developer Tools

๐ŸŸข On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models โ€” score 35 Sources: huggingface

Multimodal Continual Instruction Tuning aims to continually enhance Large Vision Language Models (LVLMs) by learning from new data without forgetting previously acquired knowledge. Mixture of Experts (MoE) architectures naturally facilitate this by incrementally adding new experts and expanding rout

๐ŸŸข Make Geometry Matter for Spatial Reasoning โ€” score 20 Sources: huggingface

Empowered by large-scale training, vision-language models (VLMs) achieve strong image and video understanding, yet their ability to perform spatial reasoning in both static scenes and dynamic videos remains limited. Recent advances try to handle this limitation by injecting geometry tokens from pret

๐Ÿ“„ New Papers

TitleCategoryScoreLink
TAPS: Task Aware Proposal Distributions for Speculative Samplingmodel_release147Open
Towards a Medical AI Scientistdeveloper_tool94Open
Gen-Searcher: Reinforcing Agentic Search for Image Generationmodel_release61Open
Emergent Social Intelligence Risks in Generative Multi-Agent Systemsdeveloper_tool57Open
EpochX: Building the Infrastructure for an Emergent Agent Civilizationdeveloper_tool51Open
WybeCoder: Verified Imperative Code Generationcs.AI0Open
WorldFlow3D: Flowing Through 3D Distributions for Unbounded World Generationcs.AI0Open
The Energy Footprint of LLM-Based Environmental Analysis: LLMs and Domain Productscs.AI0Open
APEX-EM: Non-Parametric Online Learning for Autonomous Agents via Structured Procedural-Episodic Experience Replaycs.AI0Open
Evaluating a Data-Driven Redesign Process for Intelligent Tutoring Systemscs.AI0Open
SemLoc: Structured Grounding of Free-Form LLM Reasoning for Fault Localizationcs.AI0Open
GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verificationcs.AI0Open
Towards Explainable Stakeholder-Aware Requirements Prioritisation in Aged-Care Digital Healthcs.AI0Open
"I Just Need GPT to Refine My Prompts": Rethinking Onboarding and Help-Seeking with Generative 3D Modeling Toolscs.AI0Open
Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?cs.AI0Open

๐Ÿข Lab Blog Posts