NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28, 2025 • 73 • 18
YuvrajSingh9886/LFM2.5-350M-grpo-summarization-quality-bleu Summarization • 0.4B • Updated 8 days ago • 263 • 2