Qwen3 Vl 8B Instruct
Forkjoin.ai conversion of Qwen/Qwen3-VL-8B-Instruct to GGUF format for edge deployment.
Model Details
- Source Model: Qwen/Qwen3-VL-8B-Instruct
- Format: GGUF
- Converted by: Forkjoin.ai
Usage
With llama.cpp
./llama-cli -m qwen3-vl-8b-instruct-gguf.gguf -p "Your prompt here" -n 256
With Ollama
Create a Modelfile:
FROM ./qwen3-vl-8b-instruct-gguf.gguf
ollama create qwen3-vl-8b-instruct-gguf -f Modelfile
ollama run qwen3-vl-8b-instruct-gguf
About Forkjoin.ai
Forkjoin.ai runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities.
All conversions are optimized for edge deployment within browser and mobile memory constraints.
License
Apache 2.0 (follows upstream model license)
- Downloads last month
- 57
Hardware compatibility
Log In to add your hardware
4-bit
Model tree for forkjoin-ai/qwen3-vl-8b-instruct-gguf
Base model
Qwen/Qwen3-VL-8B-Instruct