Data-Efficient RLVR via Off-Policy Influence Guidance Paper β’ 2510.26491 β’ Published Oct 30 β’ 10
Running on CPU Upgrade Featured 2.6k The Smol Training Playbook π 2.6k The secrets to building world-class LLMs
Running on Zero Featured 772 Qwen Image Edit β 772 Edit and enhance images based on descriptive instructions
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper β’ 2508.06471 β’ Published Aug 8 β’ 194
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper β’ 2508.06471 β’ Published Aug 8 β’ 194
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation β’ 480B β’ Updated Aug 21 β’ 113k β’ β’ 1.26k