Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ElliotGao
tclf90
65
1
18
Follow
sdreddyjabra's profile picture
JunHowie's profile picture
Salvadori's profile picture
21 followers
·
18 following
AI & ML interests
None yet
Recent Activity
new
activity
3 days ago
QuantTrio/GLM-5.2-Int4-Int8Mix:
GLM-5.2-Int4-Int8Mix generates only "!!!!!" on H100 8×80GB with vLLM 0.23.0/0.24.0 — IndexShare sparse indexer K-cache never populated during prefill
liked
a model
9 days ago
festr2/GLM-5.2-Int8Mix-NVFP4
new
activity
10 days ago
QuantTrio/GLM-5.2-Int4-Int8Mix:
AWQ 4bit
View all activity
Organizations
tclf90
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
QuantTrio/GLM-5.2-Int4-Int8Mix
3 days ago
GLM-5.2-Int4-Int8Mix generates only "!!!!!" on H100 8×80GB with vLLM 0.23.0/0.24.0 — IndexShare sparse indexer K-cache never populated during prefill
3
#3 opened 4 days ago by
kinggenguo
New activity in
QuantTrio/GLM-5.2-Int4-Int8Mix
10 days ago
AWQ 4bit
2
#2 opened 10 days ago by
MatthieuZ
New activity in
QuantTrio/GLM-4.7-AWQ
about 2 months ago
Revert remplate
#8 opened about 2 months ago by
s-yanev
Support for structured output
#7 opened about 2 months ago by
s-yanev
New activity in
QuantTrio/DeepSeek-V3.1-AWQ-Lite
about 2 months ago
Fix chat_template crash when assistant message omits the `content` key
#5 opened about 2 months ago by
qgallouedec
New activity in
QuantTrio/DeepSeek-V3.2-Exp-AWQ
about 2 months ago
Fix chat_template crash when assistant message omits the `content` key
#4 opened about 2 months ago by
qgallouedec
New activity in
QuantTrio/DeepSeek-V3.1-AWQ
about 2 months ago
Fix chat_template crash when assistant message omits the `content` key
#5 opened about 2 months ago by
qgallouedec
New activity in
QuantTrio/Qwen3.6-35B-A3B-AWQ
2 months ago
Any plans for a calibration-based AWQ build for better long-context stability?
2
#6 opened 2 months ago by
hyunw55
PPLX or KLD, or other benchmark
1
#4 opened 2 months ago by
HenkTenk
New activity in
QuantTrio/GLM-5-AWQ
3 months ago
[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?
🤗
1
10
#6 opened 3 months ago by
ag1988
New activity in
QuantTrio/Qwen3.5-122B-A10B-AWQ
3 months ago
CUDA version 13?
1
#1 opened 3 months ago by
pathosethoslogos
New activity in
QuantTrio/gemma-4-31B-it-AWQ
3 months ago
Request for awq of the gemma 4 26B A4B MoE
6
#1 opened 3 months ago by
rks2302
New activity in
QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ
3 months ago
AWQ 4/5/6-bit request for Qwopus3.5-27B-v3
🚀
❤️
3
3
#2 opened 3 months ago by
celikburak
New activity in
QuantTrio/Qwen3.5-27B-AWQ
3 months ago
AWQ 4-bit version of this Opus-Distilled-v2 model?
9
#5 opened 3 months ago by
celikburak
New activity in
QuantTrio/Qwen3.5-27B-AWQ
4 months ago
--max-model-len 32768 seems a bit too small for agent use cases ?
3
#3 opened 4 months ago by
edwarddukewu
My personal vLLM launch cmd on my old personal 2x3090 workstation
7
#1 opened 4 months ago by
tclf90
New activity in
QuantTrio/Qwen3.5-35B-A3B-AWQ
4 months ago
Can't get vLLM running on 1xRTX 4090
3
#1 opened 4 months ago by
slyfox1186
New activity in
cyankiwi/Qwen3.5-27B-AWQ-4bit
4 months ago
Easy to fall into infinite loop
👍
1
7
#2 opened 4 months ago by
dwaynedu
New activity in
QuantTrio/GLM-5-AWQ
4 months ago
GLM-5-AWQ vLLM 部署指南
👍
1
2
#2 opened 4 months ago by
CharlesChen2023
Great work
5
#1 opened 4 months ago by
JoeyHwong
Load more