unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF Any-to-Any • 401B • Updated Jun 18, 2025 • 8.06k • 42
deepseek-ai/DeepSeek-V3.2-Speciale Text Generation • 685B • Updated Dec 1, 2025 • 30.4k • 630
HMoE: Heterogeneous Mixture of Experts for Language Modeling Paper • 2408.10681 • Published Aug 20, 2024 • 10