arxiv:2411.02355
Eldar Kurtić
ekurtic
AI & ML interests
Efficient inference
Recent Activity
updated
a model
about 1 hour ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-INT8-fake-quant
published
a model
about 1 hour ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-INT8-fake-quant
updated
a model
about 1 hour ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-FP8-fake-quant