-
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation • 32B • Updated • 315 • 52 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation • 32B • Updated • 10.5k • 269 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation • 32B • Updated • 12.7k • 107 -
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation • 235B • Updated • 48 • 7
Collections
Discover the best community collections!
Collections trending this week
-
nvidia/Nemotron-3-Nano-RL-Training-Blend
Preview • Updated • 122 • 6 -
nvidia/Nemotron-Science-v1
Viewer • Updated • 226k • 225 • 6 -
nvidia/Nemotron-Instruction-Following-Chat-v1
Viewer • Updated • 288k • 276 • 10 -
nvidia/Nemotron-Math-Proofs-v1
Viewer • Updated • 925k • 193 • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 22.6k • 69 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 58.1k • • 384 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 436k • 138 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 113k • • 732
-
nvidia/Nemotron-RL-knowledge-web_search-mcqa
Viewer • Updated • 2.93k • 643 • 3 -
nvidia/Nemotron-RL-agent-workplace_assistant
Viewer • Updated • 1.8k • 621 • 7 -
nvidia/Nemotron-RL-instruction_following
Preview • Updated • 282 • 6 -
nvidia/Nemotron-RL-instruction_following-structured_outputs
Viewer • Updated • 9.95k • 742 • 23
-
MiniMaxAI/VTP-Small-f16d64
Image Feature Extraction • 0.2B • Updated • 8 -
MiniMaxAI/VTP-Base-f16d64
Image Feature Extraction • 0.3B • Updated • 13 -
MiniMaxAI/VTP-Large-f16d64
Image Feature Extraction • 0.7B • Updated • 10 -
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 74
-
allenai/Olmo-3.1-32B-Think
Text Generation • 32B • Updated • 960 • • 43 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B • Updated • 560 • 5 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation • 32B • Updated • 816 • 3 -
allenai/Olmo-3.1-32B-Instruct
Text Generation • 32B • Updated • 561 • • 20
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 653 • 28 -
nvidia/Nemotron-CC-Code-v1
Viewer • Updated • 216M • 14 • 7 -
nvidia/Nemotron-CC-v2.1
Viewer • Updated • 3.8B • 1.28k • 6 -
nvidia/Nemotron-Pretraining-Code-v2
Viewer • Updated • 836M • 30 • 10
-
Qwen3 VL Demo
😻320Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 10k • • 345 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 156k • • 335 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 8.46k • 24
-
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation • 32B • Updated • 315 • 52 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation • 32B • Updated • 10.5k • 269 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation • 32B • Updated • 12.7k • 107 -
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation • 235B • Updated • 48 • 7
-
MiniMaxAI/VTP-Small-f16d64
Image Feature Extraction • 0.2B • Updated • 8 -
MiniMaxAI/VTP-Base-f16d64
Image Feature Extraction • 0.3B • Updated • 13 -
MiniMaxAI/VTP-Large-f16d64
Image Feature Extraction • 0.7B • Updated • 10 -
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 74
-
nvidia/Nemotron-3-Nano-RL-Training-Blend
Preview • Updated • 122 • 6 -
nvidia/Nemotron-Science-v1
Viewer • Updated • 226k • 225 • 6 -
nvidia/Nemotron-Instruction-Following-Chat-v1
Viewer • Updated • 288k • 276 • 10 -
nvidia/Nemotron-Math-Proofs-v1
Viewer • Updated • 925k • 193 • 9
-
allenai/Olmo-3.1-32B-Think
Text Generation • 32B • Updated • 960 • • 43 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B • Updated • 560 • 5 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation • 32B • Updated • 816 • 3 -
allenai/Olmo-3.1-32B-Instruct
Text Generation • 32B • Updated • 561 • • 20
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 653 • 28 -
nvidia/Nemotron-CC-Code-v1
Viewer • Updated • 216M • 14 • 7 -
nvidia/Nemotron-CC-v2.1
Viewer • Updated • 3.8B • 1.28k • 6 -
nvidia/Nemotron-Pretraining-Code-v2
Viewer • Updated • 836M • 30 • 10
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 22.6k • 69 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 58.1k • • 384 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 436k • 138 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 113k • • 732
-
Qwen3 VL Demo
😻320Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 10k • • 345 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 156k • • 335 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 8.46k • 24
-
nvidia/Nemotron-RL-knowledge-web_search-mcqa
Viewer • Updated • 2.93k • 643 • 3 -
nvidia/Nemotron-RL-agent-workplace_assistant
Viewer • Updated • 1.8k • 621 • 7 -
nvidia/Nemotron-RL-instruction_following
Preview • Updated • 282 • 6 -
nvidia/Nemotron-RL-instruction_following-structured_outputs
Viewer • Updated • 9.95k • 742 • 23