Chronos-2: From Univariate to Universal Forecasting Paper • 2510.15821 • Published Oct 17, 2025 • 19
AutoPR: Let's Automate Your Academic Promotion! Paper • 2510.09558 • Published Oct 10, 2025 • 51
TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning Paper • 2505.23719 • Published May 29, 2025 • 3
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26, 2025 • 37
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper • 2509.24897 • Published Sep 29, 2025 • 46
This Time is Different: An Observability Perspective on Time Series Foundation Models Paper • 2505.14766 • Published May 20, 2025 • 40
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors Paper • 2505.11770 • Published May 17, 2025 • 2
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5, 2025 • 133
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published Jun 12, 2025 • 73
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1, 2025 • 79
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models Paper • 2402.14848 • Published Feb 19, 2024 • 19
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published Mar 16, 2025 • 44