sorakritt commited on
Commit
d51430c
·
verified ·
1 Parent(s): c313299

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -51,7 +51,7 @@ Below is a comparison between the base model and this IPO-trained version.
51
 
52
  - **Method:** IPO (Identity Preference Optimization)
53
  - **Base Model:** Qwen/Qwen3-0.6B-Base
54
- - **SFT Model Used:** [AIPlans/qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/qwen3-0.6b-SFT-hs2)
55
  - **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
56
  - **Optimizer:** AdamW
57
  - **Learning Rate:** 5e-7
@@ -71,7 +71,7 @@ Below is a comparison between the base model and this IPO-trained version.
71
  import torch
72
  from transformers import AutoModelForCausalLM, AutoTokenizer
73
 
74
- model_id = "sorakritt/qwen3-0.6b-IPO-hs2"
75
 
76
  model = AutoModelForCausalLM.from_pretrained(
77
  model_id,
@@ -86,6 +86,7 @@ inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
86
 
87
  outputs = model.generate(**inputs, max_new_tokens=128, do_sample=True, temperature=0.7)
88
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 
89
 
90
  ## Model Card Author
91
  Premanand Jena - AIPlans Research Intern,
 
51
 
52
  - **Method:** IPO (Identity Preference Optimization)
53
  - **Base Model:** Qwen/Qwen3-0.6B-Base
54
+ - **SFT Model Used:** [AIPlans/Qwen3-0.6b-SFT-hs2](https://huggingface.co/AIPlans/Qwen3-0.6b-SFT-hs2)
55
  - **Precision:** bfloat16 (Training), bfloat16 (Final Weights)
56
  - **Optimizer:** AdamW
57
  - **Learning Rate:** 5e-7
 
71
  import torch
72
  from transformers import AutoModelForCausalLM, AutoTokenizer
73
 
74
+ model_id = "AIPlans/Qwen3-0.6B-IPO"
75
 
76
  model = AutoModelForCausalLM.from_pretrained(
77
  model_id,
 
86
 
87
  outputs = model.generate(**inputs, max_new_tokens=128, do_sample=True, temperature=0.7)
88
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
89
+ ```
90
 
91
  ## Model Card Author
92
  Premanand Jena - AIPlans Research Intern,