DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 24 days ago • 354
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 28 days ago • 123
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 19 days ago • 867
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 4 days ago • 149
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 339
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 21 days ago • 49