These are UD quantizations of huihui-ai/Huihui-gemma-4-26B-A4B-it-abliterated, packaged for llama.cpp / GGUF inference.

Quick Start

  1. Download the latest release of llama.cpp.
  2. Download your preferred model variant from the files below.
  3. Use the corresponding mmproj file for multimodal inference.

Which version should I choose?

These variants are built using a Unsloths Amazing tensor distribution recipe to preserve as much quality as possible while reducing memory usage.

Notes

Downloads last month
17,101
GGUF
Model size
25B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for groxaxo/Huihui-gemma-4-26B-A4B-it-abliterated-GGUF

Quantized
(7)
this model