ebircak/gemma-4-31B-it-4bit-NVFP4A16-GPTQ

Wonderfull model

by clayboby - opened 3 days ago

Good Quality
Much Better than LilaRest/gemma-4-31B-it-NVFP4-turbo and cyankiwi/gemma-4-31B-it-AWQ-4bit
Any optimize skill？

PS: I patched vllm and run it in single 5090 32G with 200K context and 1 image input

ebircak

Owner 3 days ago

Performance wise I am just trying to quantize what makes sense and calibration wise the calibration dataset is a mix of coding and tool call corpus as those are the my top usages. Recipe and dataset composition are in the model card so it should be easy to evaluate and replicate.

clayboby

44 minutes ago

Really good. I test GSM8k-Platinum 、MMLU-CoT、IFEval and HumanEval and my own test, almost no quanlity loss compare to goole. Best model for agent like openclaw or hermes

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment