๐Ÿ”ด High Significance

Model Releases

๐Ÿ”ด Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips โ€” score 80 Sources: huggingface

Deep Neural Networks (DNNs) can be catastrophically disrupted by flipping only a handful of parameter bits. We introduce Deep Neural Lesion (DNL), a data-free and optimizationfree method that locates critical parameters, and an enhanced single-pass variant, 1P-DNL, that refines this selection with o

Developer Tools

๐Ÿ”ด Elucidating the SNR-t Bias of Diffusion Probabilistic Models โ€” score 95 Sources: huggingface

Diffusion Probabilistic Models have demonstrated remarkable performance across a wide range of generative tasks. However, we have observed that these models often suffer from a Signal-to-Noise Ratio-timestep (SNR-t) bias. This bias refers to the misalignment between the SNR of the denoising sample a

๐Ÿ”ด DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off โ€” score 80 Sources: huggingface

Reinforcement Learning with Verifiable Rewards (RLVR) has catalyzed significant advances in the reasoning capabilities of Large Language Models (LLMs). However, effectively managing the exploration and exploitation trade-off remains a critical challenge. In this paper, we fully analyze the explorati

๐ŸŸก Notable

Model Releases

๐ŸŸก Qwen3.5-Omni Technical Report โ€” score 65 Sources: huggingface

In this work, we present Qwen3.5-Omni, the latest advancement in the Qwen-Omni model family. Representing a significant evolution over its predecessor, Qwen3.5-Omni scales to hundreds of billions of parameters and supports a 256k context length. By leveraging a massive dataset comprising heterogeneo

๐ŸŸก PersonaVLM: Long-Term Personalized Multimodal LLMs โ€” score 55 Sources: huggingface

Multimodal Large Language Models (MLLMs) serve as daily assistants for millions. However, their ability to generate responses aligned with individual preferences remains limited. Prior approaches enable only static, single-turn personalization through input augmentation or output alignment, and thus

๐ŸŸก OpenAI helps Hyatt advance AI among colleagues โ€” score 50 Sources: lab_blog/OpenAI

Hyatt deploys ChatGPT Enterprise across its global workforce, using GPT-5.4 and Codex to improve productivity, operations, and guest experiences.

Developer Tools

๐ŸŸก Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems โ€” score 45 Sources: huggingface

Retrieval-Augmented Generation (RAG) systems critically depend on effective document chunking strategies to balance retrieval quality, latency, and operational cost. Traditional chunking approaches, such as fixed-size, rule-based, or fully agentic chunking, often suffer from high token consumption,

๐ŸŸข Incremental

Model Releases

๐ŸŸข Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning โ€” score 30 Sources: huggingface

Parallel reasoning enhances Large Reasoning Models (LRMs) but incurs prohibitive costs due to futile paths caused by early errors. To mitigate this, path pruning at the prefix level is essential, yet existing research remains fragmented without a standardized framework. In this work, we propose the

Developer Tools

๐ŸŸข Mind DeepResearch Technical Report โ€” score 30 Sources: huggingface

We present Mind DeepResearch (MindDR), an efficient multi-agent deep research framework that achieves leading performance with only ~30B-parameter models through a meticulously designed data synthesis and multi-stage training pipeline. The core innovation of MindDR lies in a collaborative three-agen

๐ŸŸข Motif-Video 2B: Technical Report โ€” score 5 Sources: huggingface

Training strong video generation models usually requires massive datasets, large parameter counts, and substantial compute. In this work, we ask whether strong text-to-video quality is possible at a much smaller budget: fewer than 10M clips and less than 100,000 H200 GPU hours. Our core claim is tha

Infrastructure & Compute

๐ŸŸข Where does output diversity collapse in post-training? โ€” score 15 Sources: huggingface

Post-trained language models produce less varied outputs than their base counterparts. This output diversity collapse undermines inference-time scaling methods that rely on varied samples, and risks homogenizing model outputs on creative and value-laden tasks. Prior work attributes collapse to speci

๐Ÿ“„ New Papers

TitleCategoryScoreLink
Elucidating the SNR-t Bias of Diffusion Probabilistic Modelsdeveloper_tool77Open
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Offdeveloper_tool64Open
Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flipsmodel_release64Open
Qwen3.5-Omni Technical Reportmodel_release59Open
PersonaVLM: Long-Term Personalized Multimodal LLMsmodel_release48Open
Towards Intelligent Legal Document Analysis: CNN-Driven Classification of Case Law Textscs.AI0Open
Semantic Entanglement in Vector-Based Retrieval: A Formal Framework and Context-Conditioned Disentanglement Pipeline for Agentic RAG Systemscs.AI0Open
SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Modelscs.AI0Open
CAPO: Counterfactual Credit Assignment in Sequential Cooperative Teamscs.AI0Open
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Playcs.AI0Open
WISV: Wireless-Informed Semantic Verification for Distributed Speculative Decoding in Device-Edge LLM Inferencecs.AI0Open
Before You Interpret the Profile: Validity Scaling for LLM Metacognitive Self-Reportcs.AI0Open
Co-evolving Agent Architectures and Interpretable Reasoning for Automated Optimizationcs.AI0Open
Screen Before You Interpret: A Portable Validity Protocol for Benchmark-Based LLM Confidence Signalscs.AI0Open
Concurrent Criterion Validation of a Validity Screen for LLM Confidence Signals via Selective Predictioncs.AI0Open

๐Ÿข Lab Blog Posts