๐Ÿ“„ New Papers

TitleCategoryScoreLink
Post-Training with Policy Gradients: Optimality and the Base Model Barriercs.AI0Open
Learning Quadruped Walking from Seconds of Demonstrationcs.AI0Open
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Contextcs.AI0Open
Elenchus: Generating Knowledge Bases from Prover-Skeptic Dialoguescs.AI0Open
A Systematic Investigation of Document Chunking Strategies and Embedding Sensitivitycs.AI0Open
NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learningcs.AI0Open
Discovering the Hidden Role of Gini Index In Prompt-based Classificationcs.AI0Open
Diffusion Controller: Framework, Algorithms and Parameterizationcs.AI0Open
Masked Unfairness: Hiding Causality within Zero ATEcs.AI0Open
Foundational World Models Accurately Detect Bimanual Manipulator Failurescs.AI0Open
Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Languagecs.AI0Open
SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Educationcs.AI0Open
Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Modelscs.AI0Open
RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified Statescs.AI0Open
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learningcs.AI0Open