Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Efficient Intelligence and Systems
community
Efficient-ML
Activity Feed
Follow
33
AI & ML interests
Low-bit Quantization of Large Language Models (LLMs)
Recent Activity
AaronHuangWei
authored
a paper
about 14 hours ago
MC#: Mixture Compressor for Mixture-of-Experts Large Models
AaronHuangWei
authored
a paper
about 14 hours ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
AaronHuangWei
submitted
a paper
1 day ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
View all activity
Team members
9
Efficient-ML
's models
52
Sort: Recently updated
Efficient-ML/GPTQ-for-Qwen3
Updated
May 12
Efficient-ML/Qwen3-awq
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w4-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w4-128
Updated
May 7
•
1
Efficient-ML/Qwen3-14B-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-14B-base-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-base-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w4-perchannel
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w4-128
Updated
May 7
Efficient-ML/Qwen3-4B-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-4B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-4B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-4B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w8-perchannel
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w8-128
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w8-perchannel
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w8-128
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-14B-base-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-14B-base-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-8B-base-gptq-w8-perchannel
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w8-128
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w4-perchannel
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w4-128
Updated
May 5
Previous
1
2
Next