| Reproduction Beyond Benchmarks: ConstBERT and ColBERT-v2 Across Backends and Query Distributions | cs.CL | 0 | Open |
| Demographic and Linguistic Bias Evaluation in Omnimodal Language Models | cs.CL | 0 | Open |
| FinTrace: Holistic Trajectory-Level Evaluation of LLM Tool Calling for Long-Horizon Financial Tasks | cs.CL | 0 | Open |
| Weird Generalization is Weirdly Brittle | cs.CL | 0 | Open |
| CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models | cs.CL | 0 | Open |
| Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension | cs.CL | 0 | Open |
| Mirroring Minds: Asymmetric Linguistic Accommodation and Diagnostic Identity in ADHD and Autism Reddit Communities | cs.CL | 0 | Open |
| ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models | cs.CL | 0 | Open |
| Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty | cs.CL | 0 | Open |
| Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models | cs.CL | 0 | Open |
| SEPTQ: A Simple and Effective Post-Training Quantization Paradigm for Large Language Models | cs.CL | 0 | Open |
| Who Wrote This Line? Evaluating the Detection of LLM-Generated Classical Chinese Poetry | cs.CL | 0 | Open |
| CircuitSynth: Reliable Synthetic Data Generation | cs.CL | 0 | Open |
| Training-Free Cross-Lingual Dysarthria Severity Assessment via Phonological Subspace Analysis in Self-Supervised Speech Representations | cs.CL | 0 | Open |
| AITP: Traffic Accident Responsibility Allocation via Multimodal Large Language Models | cs.CL | 0 | Open |