jodiox/olmo3-190m-zh-nano-sft

SFT(有监督微调)版本:基于 jodiox/olmo3-190m-zh-nano, 使用对话格式数据进行微调,学习指令遵循能力。

数据来源

训练配置

  • LR:5e-05(低 LR 避免灾难性遗忘)
  • Warmup:5.0%
  • Max Seq Length:2048

用法

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("jodiox/olmo3-190m-zh-nano-sft")
tok = AutoTokenizer.from_pretrained("jodiox/olmo3-190m-zh-nano-sft")
Downloads last month
34
Safetensors
Model size
22M params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jodiox/olmo3-190m-zh-nano-sft

Unable to build the model tree, the base model loops to the model itself. Learn more.

Space using jodiox/olmo3-190m-zh-nano-sft 1