Image-Text-to-Text
GGUF
llama.cpp
vision
vlm
conversational

Ollama error "key not found in model: deepseek2.expert_weights_scale"

#7
by fenixionsoul - opened

Tried to run on Ollama - hf.co/tencent/Youtu-VL-4B-Instruct-GGUF:Q8_0
and got:
Ollama error 500: {"error":"llama runner process has terminated: error loading model: error loading model hyperparameters: key not found in model: deepseek2.expert_weights_scale"}

Tencent org
edited Feb 10

@fenixionsoul The current released version of Ollama does not support direct inference with the Youtu-VL-4B model. You can use llama.cpp instead. For details, please refer to https://huggingface.co/tencent/Youtu-VL-4B-Instruct-GGUF#%F0%9F%9A%80-quickstart

Tencent org

Full support for this model is available starting from Ollama 0.30.4.
Please upgrade your Ollama to this version, after that run:
ollama run youtu/youtu-vl to start the model.

Yinsongliu changed discussion status to closed

Sign up or log in to comment