Running 16 Defeating the trainer-generator precision mismatch in TRL 🎯 16 Download research PDF (Pro access required)
view article Article ⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch Jun 28, 2025 • 41
jinaai/jina-embeddings-v5-text-small-text-matching Sentence Similarity • 0.6B • Updated 17 days ago • 10.9k • 10
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 97
Running on CPU Upgrade Featured 3.14k The Smol Training Playbook 📚 3.14k The secrets to building world-class LLMs
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29, 2025 • 14
Running 98 Unlocking On-Policy Distillation for Any Model Family 📝 98 Visualize on-policy distillation for any model family
Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning Paper • 2508.09726 • Published Aug 13, 2025 • 15
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published Oct 7, 2025 • 111
view article Article Training and Finetuning Reranker Models with Sentence Transformers Mar 26, 2025 • 193