Molmo2 Data Collection Artifacts for the Molmo2 data release • 16 items • Updated about 17 hours ago • 11
Datasets Wrapped 2025: Reasoning Collection The reasoning datasets that defined 2025. Part 1 of Datasets Wrapped 2025. #DatasetsWrapped2025 • 20 items • Updated about 18 hours ago • 1
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated about 12 hours ago • 27
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated about 12 hours ago • 33
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated about 12 hours ago • 73
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 2 days ago • 72
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 5 days ago • 32
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 4 days ago • 24
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 16 days ago • 240
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 15 days ago • 126
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated 19 days ago • 11