Qwen3 Vl 8B Instruct

Forkjoin.ai conversion of Qwen/Qwen3-VL-8B-Instruct to GGUF format for edge deployment.

Model Details

Source Model: Qwen/Qwen3-VL-8B-Instruct
Format: GGUF
Converted by: Forkjoin.ai

Usage

With llama.cpp

./llama-cli -m qwen3-vl-8b-instruct-gguf.gguf -p "Your prompt here" -n 256

With Ollama

Create a Modelfile:

FROM ./qwen3-vl-8b-instruct-gguf.gguf

ollama create qwen3-vl-8b-instruct-gguf -f Modelfile
ollama run qwen3-vl-8b-instruct-gguf

About Forkjoin.ai

Forkjoin.ai runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities.

All conversions are optimized for edge deployment within browser and mobile memory constraints.

License

Apache 2.0 (follows upstream model license)

Downloads last month: 57

GGUF

Model size

8B params

Architecture

qwen3vl

Hardware compatibility

4-bit

Model tree for forkjoin-ai/qwen3-vl-8b-instruct-gguf

Base model

Qwen/Qwen3-VL-8B-Instruct

Quantized

(72)

this model