Update README.md
Browse files
README.md
CHANGED
|
@@ -7,8 +7,8 @@ base_model:
|
|
| 7 |
---
|
| 8 |
# Qwen3-Coder-30B-A3B-Instruct-nvfp4
|
| 9 |
|
| 10 |
-
**Format:** NVFP4 — weights & activations quantized to FP4 with dual scaling.
|
| 11 |
-
**Base model:** `Qwen/Qwen3-Coder-30B-A3B-Instruct`
|
| 12 |
**How it was made:** One-shot calibration with LLM Compressor (NVFP4 recipe), long-seq calibration with nvidia/OpenCodeInstruct.
|
| 13 |
|
| 14 |
> Notes: Keep `lm_head` in high precision; calibrate on long, domain-relevant sequences.
|
|
|
|
| 7 |
---
|
| 8 |
# Qwen3-Coder-30B-A3B-Instruct-nvfp4
|
| 9 |
|
| 10 |
+
**Format:** NVFP4 — weights & activations quantized to FP4 with dual scaling.
|
| 11 |
+
**Base model:** `Qwen/Qwen3-Coder-30B-A3B-Instruct`
|
| 12 |
**How it was made:** One-shot calibration with LLM Compressor (NVFP4 recipe), long-seq calibration with nvidia/OpenCodeInstruct.
|
| 13 |
|
| 14 |
> Notes: Keep `lm_head` in high precision; calibrate on long, domain-relevant sequences.
|