Update README.md
Browse files
README.md
CHANGED
|
@@ -53,7 +53,6 @@ Below is a comparison between the base model and this IPO-trained version.
|
|
| 53 |
- **Base Model:** Qwen/Qwen3-0.6B-Base
|
| 54 |
- **SFT Model Used:** [AIPlans/Qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/Qwen3-0.6b-SFT-hs2)
|
| 55 |
- **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
|
| 56 |
-
- **Optimizer:** AdamW
|
| 57 |
- **Learning Rate:** 5e-7
|
| 58 |
- **Beta:** 0.01
|
| 59 |
- **Epochs:** 3
|
|
|
|
| 53 |
- **Base Model:** Qwen/Qwen3-0.6B-Base
|
| 54 |
- **SFT Model Used:** [AIPlans/Qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/Qwen3-0.6b-SFT-hs2)
|
| 55 |
- **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
|
|
|
|
| 56 |
- **Learning Rate:** 5e-7
|
| 57 |
- **Beta:** 0.01
|
| 58 |
- **Epochs:** 3
|