GLM-ASR-Nano-2512 โ€” GGUF

GGUF conversions and quantisations of zai-org/GLM-ASR-Nano-2512 for use with CrispStrobe/CrispASR.

Available variants

File Quant Size Notes
glm-asr-nano.gguf F16 4.3 GB Full precision
glm-asr-nano-q8_0.gguf Q8_0 2.3 GB High quality
glm-asr-nano-q4_k.gguf Q4_K 1.3 GB Best size/quality tradeoff

All variants produce correct transcription on test audio.

Model details

  • Architecture: Whisper encoder (1280d, 32L, partial RoPE) + 4-frame projector + Llama LLM (2048d, 28L, GQA 16/4)
  • Parameters: 1.5B
  • Languages: 17 (Mandarin, English, Cantonese, + 14 more)
  • License: MIT
  • Outperforms OpenAI Whisper V3 on benchmarks (lowest avg error rate 4.10)

Usage with CrispASR

git clone https://github.com/CrispStrobe/CrispASR && cd CrispASR
cmake -S . -B build && cmake --build build -j8

# Auto-detect backend from GGUF
./build/bin/crispasr -m glm-asr-nano-q4_k.gguf -f audio.wav

# Explicit backend
./build/bin/crispasr --backend glm-asr -m glm-asr-nano-q4_k.gguf -f audio.wav -osrt

Conversion

python models/convert-glm-asr-to-gguf.py --input zai-org/GLM-ASR-Nano-2512 --output glm-asr-nano.gguf
crispasr-quantize glm-asr-nano.gguf glm-asr-nano-q4_k.gguf q4_k
Downloads last month
441
GGUF
Model size
2B params
Architecture
glmasr
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/glm-asr-nano-GGUF

Quantized
(3)
this model