| Post-Training with Policy Gradients: Optimality and the Base Model Barrier | cs.AI | 0 | Open |
| Learning Quadruped Walking from Seconds of Demonstration | cs.AI | 0 | Open |
| Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context | cs.AI | 0 | Open |
| Elenchus: Generating Knowledge Bases from Prover-Skeptic Dialogues | cs.AI | 0 | Open |
| A Systematic Investigation of Document Chunking Strategies and Embedding Sensitivity | cs.AI | 0 | Open |
| NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning | cs.AI | 0 | Open |
| Discovering the Hidden Role of Gini Index In Prompt-based Classification | cs.AI | 0 | Open |
| Diffusion Controller: Framework, Algorithms and Parameterization | cs.AI | 0 | Open |
| Masked Unfairness: Hiding Causality within Zero ATE | cs.AI | 0 | Open |
| Foundational World Models Accurately Detect Bimanual Manipulator Failures | cs.AI | 0 | Open |
| Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language | cs.AI | 0 | Open |
| SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education | cs.AI | 0 | Open |
| Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models | cs.AI | 0 | Open |
| RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified States | cs.AI | 0 | Open |
| Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning | cs.AI | 0 | Open |