AW · AI Watchtower

📄 New Papers

Title	Category	Link
Post-Training with Policy Gradients: Optimality and the Base Model Barrier	cs.AI	Open
Learning Quadruped Walking from Seconds of Demonstration	cs.AI	Open
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context	cs.AI	Open
Elenchus: Generating Knowledge Bases from Prover-Skeptic Dialogues	cs.AI	Open
A Systematic Investigation of Document Chunking Strategies and Embedding Sensitivity	cs.AI	Open
NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning	cs.AI	Open
Discovering the Hidden Role of Gini Index In Prompt-based Classification	cs.AI	Open
Diffusion Controller: Framework, Algorithms and Parameterization	cs.AI	Open
Masked Unfairness: Hiding Causality within Zero ATE	cs.AI	Open
Foundational World Models Accurately Detect Bimanual Manipulator Failures	cs.AI	Open
Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language	cs.AI	Open
SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education	cs.AI	Open
Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models	cs.AI	Open
RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified States	cs.AI	Open
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning	cs.AI	Open