caiovicentino1/Qwopus3.5-9B-v3-PolarQuant-Q5
Text Generation • 9B • Updated • 2.37k • 8
Full-stack HLWQ: Q5 weights + torchao INT4 + Q3 KV cache · formerly PolarQuant Unified
Note HLWQ Q5 · 16.6 GB · 27B Gemma-4 26B-A4B MoE · per-expert, consumer GPU ready
Note Full-stack HLWQ on MiniMax-M2.7 229B MoE