Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model
about 13 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-block
updated
a model
about 13 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-block
updated
a model
about 13 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16