Firworks commited on
Commit
0644434
·
verified ·
1 Parent(s): 8053f73

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -7,8 +7,8 @@ base_model:
7
  ---
8
  # Qwen3-Coder-30B-A3B-Instruct-nvfp4
9
 
10
- **Format:** NVFP4 — weights & activations quantized to FP4 with dual scaling.
11
- **Base model:** `Qwen/Qwen3-Coder-30B-A3B-Instruct`
12
  **How it was made:** One-shot calibration with LLM Compressor (NVFP4 recipe), long-seq calibration with nvidia/OpenCodeInstruct.
13
 
14
  > Notes: Keep `lm_head` in high precision; calibrate on long, domain-relevant sequences.
 
7
  ---
8
  # Qwen3-Coder-30B-A3B-Instruct-nvfp4
9
 
10
+ **Format:** NVFP4 — weights & activations quantized to FP4 with dual scaling.
11
+ **Base model:** `Qwen/Qwen3-Coder-30B-A3B-Instruct`
12
  **How it was made:** One-shot calibration with LLM Compressor (NVFP4 recipe), long-seq calibration with nvidia/OpenCodeInstruct.
13
 
14
  > Notes: Keep `lm_head` in high precision; calibrate on long, domain-relevant sequences.