sorakritt commited on
Commit
72bda53
·
verified ·
1 Parent(s): d51430c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -1
README.md CHANGED
@@ -53,7 +53,6 @@ Below is a comparison between the base model and this IPO-trained version.
53
  - **Base Model:** Qwen/Qwen3-0.6B-Base
54
  - **SFT Model Used:** [AIPlans/Qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/Qwen3-0.6b-SFT-hs2)
55
  - **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
56
- - **Optimizer:** AdamW
57
  - **Learning Rate:** 5e-7
58
  - **Beta:** 0.01
59
  - **Epochs:** 3
 
53
  - **Base Model:** Qwen/Qwen3-0.6B-Base
54
  - **SFT Model Used:** [AIPlans/Qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/Qwen3-0.6b-SFT-hs2)
55
  - **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
 
56
  - **Learning Rate:** 5e-7
57
  - **Beta:** 0.01
58
  - **Epochs:** 3