view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 18 days ago • 71
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 25 days ago • 50
view article Article The Open Source Community is backing OpenEnv for Agentic RL +18 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua, unseenmars • 28 days ago • 103
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • May 29 • 132
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • May 14 • 61
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Announcing the Hugging Face Fellowship Program merve, espejelomar • May 17, 2022 • 16
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation