NSFWVision qwen3 vl 8b V3 (GGUF)

GGUF quants for https://huggingface.co/GitMylo/nsfwvision-qwen3-vl-8b-v3-safetensors. An experimental model with the vision tower and base model trained, the mmproj (translator) frozen. Afterwards, the trained base model was reset, acting like a sponge during training.

A finetune of the vision tower of Qwen3-VL-8B-Instruct-abliterated-v2.
During training, the model was unfrozen, after training finished, the model was reset to the original weights (abliterated). Allowing the model to train and later resetting it allows it to "soak" in the training and avoid biasing the model's style of writing and prompt following capabilities. It was used effectively as a sponge to catch loss related to sentence structure, allowing the vision tower to actually learn the concepts.

Downloads last month: 17,826

GGUF

Model size

8B params

Architecture

qwen3vl

Hardware compatibility

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for GitMylo/nsfwvision-qwen3-vl-8b-v3-gguf

Base model

Qwen/Qwen3-VL-8B-Instruct

Quantized

(72)

this model