Running Featured 74 Distilling 100B+ Models 40x Faster with TRL 📝 74 TRL distillation for 100B+ teachers, 40x faster
Running on CPU Upgrade 231 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 231 Explore synthetic data experiments on a virtual bookshelf
Running on CPU Upgrade Featured 3.13k The Smol Training Playbook 📚 3.13k The secrets to building world-class LLMs
Running 3.82k The Ultra-Scale Playbook 🌌 3.82k The ultimate guide to training LLM on large GPU Clusters
Running on L40S Agents 589 MinerU Document Extraction Tools 📚 589 Easy converting PDF and Office docs into Markdown and JSON
Running 596 Scaling test-time compute 📈 596 Run advanced search strategies to boost LLM problem solving
Running 5 PL-MTEB: Polish Massive Text Embedding Benchmark 📈 5 Display evaluation results in a leaderboard
Running Featured 1.05k Can You Run It? LLM version 🚀 1.05k Calculate GPU needs for running LLMs on your hardware