Running 153 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 153 Building and scaling RL environments for LLM training
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand qgallouedec • Dec 4, 2025 • 69
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • 20 days ago • 44
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 151
Running on CPU Upgrade Featured 3.17k The Smol Training Playbook 📚 3.17k The secrets to building world-class LLMs
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism ariG23498 • Feb 12 • 20
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ huggingface • Feb 3 • 53