Inference Providers
Active filters: GRPO
Any-to-Any
• Updated • 59
• 28
Text Generation
• 4B • Updated • 482
• • 3
mradermacher/Luna-Ethos-GGUF
4B • Updated • 49
• 2
Text Generation
• 4B • Updated • 48
• 2
Kimhi/AWARES-Qwen2.5-VL-7B
Image-Text-to-Text
• 8B • Updated • 57
• 2
mradermacher/SocialR1-8B-GGUF
Reinforcement Learning
• 4B • Updated • 835
• 1
mradermacher/SocialR1-8B-i1-GGUF
Reinforcement Learning
• 4B • Updated • 3.65k
• 1
mradermacher/AWARES-Qwen2.5-VL-7B-GGUF
8B • Updated • 516
• 1
mradermacher/AWARES-Qwen2.5-VL-7B-i1-GGUF
8B • Updated • 766
• 1
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated • 14
• • 24
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
• 1B • Updated • 30
• • 1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
1B • Updated • 100
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
1B • Updated • 157
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
• 1B • Updated • 20
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
• 1B • Updated • 19
Reinforcement Learning
• Updated mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated • 112
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated • 258
• 1
alpha-ai/Deep-Reason-SMALL-V0-GGUF
3B • Updated • 32
• 1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
• 3B • Updated • 8
• 2
mradermacher/Deep-Reason-SMALL-V0-GGUF
3B • Updated • 84
• 2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
3B • Updated • 228
• 1
alpha-ai/qwen2.5-reason-thought-lite-GGUF
3B • Updated • 43
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
• 3B • Updated • 5
• alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
3B • Updated • 39
• 2
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
• 3B • Updated • 7
mradermacher/Cogito-R1-GGUF
33B • Updated • 164
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
• 1B • Updated • 4
• mradermacher/Cogito-R1-i1-GGUF
33B • Updated • 704
AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
3B • Updated • 140
• 1