DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 15 days ago • 213 • 5
EntroPE: Entropy-Guided Dynamic Patch Encoder for Time Series Forecasting Paper • 2509.26157 • Published Sep 30 • 2 • 3
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity Paper • 2101.03961 • Published Jan 11, 2021 • 13 • 1
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 13 days ago • 147 • 5
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published 21 days ago • 107 • 3
Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics Paper • 2510.05137 • Published Oct 1 • 5 • 3
RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation Paper • 2509.16198 • Published Sep 19 • 127 • 21
MiMo-Embodied: X-Embodied Foundation Model Technical Report Paper • 2511.16518 • Published 27 days ago • 23 • 3
Multimodal Evaluation of Russian-language Architectures Paper • 2511.15552 • Published 28 days ago • 78 • 4
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published May 29 • 22 • 3
ROOT: Robust Orthogonalized Optimizer for Neural Network Training Paper • 2511.20626 • Published 22 days ago • 42 • 5
RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media Paper • 2002.12683 • Published Feb 28, 2020 • 1
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms Paper • 2511.17592 • Published 30 days ago • 118 • 3
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published 23 days ago • 49 • 3
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper • 2506.18898 • Published Jun 23 • 33 • 2