FlowRL: Matching Reward Distributions for LLM Reasoning Paper • 2509.15207 • Published Sep 18, 2025 • 115
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation Paper • 2504.07448 • Published Apr 10, 2025 • 1
deepseek-ai/DeepSeek-Coder-V2-Instruct-0724 Text Generation • 236B • Updated Oct 8, 2024 • 92.8k • 114
sunzx0810/gte-Qwen2-7B-instruct-Q5_K_M-GGUF Sentence Similarity • 8B • Updated Jun 25, 2024 • 412 • 5
lmstudio-community/Mistral-Nemo-Instruct-2407-GGUF Text Generation • 12B • Updated Nov 4, 2024 • 4.15k • 33