DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters Updated Jul 27, 2025 • 181
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4 Text Generation • Updated Aug 7, 2024 • 133k • 109
DavidAU/AI_Autocorrect__Auto-Creative-Enhancement__Auto-Low-Quant-Optimization__gguf-exl2-hqq-SOFTWARE Text Generation • Updated Jul 27, 2025 • 65
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • 410B • Updated Sep 13, 2024 • 1.19k • 36
hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4 Text Generation • 410B • Updated Aug 7, 2024 • 153 • 16
hugging-quants/Meta-Llama-3.1-405B-Instruct-BNB-NF4 Text Generation • 423B • Updated Sep 16, 2024 • 26 • 5
hugging-quants/Meta-Llama-3.1-8B-Instruct-BNB-NF4 Text Generation • 8B • Updated Aug 8, 2024 • 249 • 8
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit Text Generation • 8B • Updated Jul 29, 2024 • 305 • 4
ModelCloud/Meta-Llama-3.1-70B-Instruct-gptq-4bit Text Generation • 71B • Updated Jul 27, 2024 • 123 • 4
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • 71B • Updated Aug 7, 2024 • 9.43k • 23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4 Text Generation • 8B • Updated Aug 7, 2024 • 24k • 42
sunnyyy/openbuddy-llama3.1-8b-v22.1-131k-Q4_K_M-GGUF Text Generation • 8B • Updated Jul 25, 2024 • 277
azhiboedova/Meta-Llama-3.1-8B-Instruct-AQLM-2Bit-1x16 Text Generation • 2B • Updated Aug 28, 2024 • 11 • 13