view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
view article Article Occam’s Sheath: A Simpler Approach to AI Safety Guardrails daniel-de-leon • Oct 18, 2024 • 8
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13, 2025 • 29
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30, 2025 • 282
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face +5 burtenshaw, reach-vb, pcuenq, clem, rajatarya, jsulz, lysandre • Apr 5, 2025 • 149
view article Article Building Your Own AI Document Dream Team: A Generic Multi-Agent System ifahim • Apr 8, 2025 • 9
view article Article Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators bconsolvo • Jan 28, 2025 • 8
view article Article Model Card Generator Interface: Crafting Clear Insights into AI Models mitalipo • Sep 27, 2024 • 4
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Gaudi® Accelerator omarkhleif • Sep 9, 2024 • 8
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging akjindal53244 • Aug 19, 2024 • 79