34 28 33

Georgi Gerganov

ggerganov

https://ggerganov.com

AI & ML interests

ggml

Recent Activity

new activity 1 day ago

ggml-org/Qwen3.6-35B-A3B-MTP-GGUF:fix

new activity 1 day ago

ggml-org/Qwen3.6-27B-MTP-GGUF:fixed repo id

updated a model 1 day ago

ggml-org/Qwen3.6-35B-A3B-MTP-GGUF

View all activity

Organizations

upvoted an article about 1 month ago

Article

Using OCR models with llama.cpp

ggml-org

•

Apr 10

• 28

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 895

upvoted a collection about 2 months ago

Gemma 4

Collection

4 items • Updated Apr 2 • 33

upvoted an article about 2 months ago

Article

Falcon Perception

tiiuae

•

Apr 1

• 67

upvoted 2 articles 2 months ago

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

nvidia

•

Mar 17

• 64

Article

Introducing Storage Buckets on the Hugging Face Hub

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 194

upvoted a changelog 3 months ago

Hugging Face Changelog

Community Evals and Benchmark Repositories

Feb 5

• 79

upvoted an article 3 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 505

upvoted an article 4 months ago

Article

New in llama.cpp: Anthropic Messages API

ggml-org

•

Jan 19

• 45

upvoted an article 5 months ago

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 135

upvoted an article 6 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

upvoted a collection 9 months ago

Gemma 3-270m

Collection

Collection of models for Gemma 3-270m • 4 items • Updated Dec 16, 2025 • 22

upvoted an article 11 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 776

upvoted a collection 11 months ago

NextCoder

Collection

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9, 2025 • 77

upvoted an article about 1 year ago

Article

The Transformers Library: standardizing model definitions

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 122

upvoted 2 collections about 1 year ago

Qwen 3

Collection

13 items • Updated Dec 16, 2025 • 10

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.79k

upvoted an article about 1 year ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

julien-c

•

Apr 25, 2025

• 308

upvoted a collection about 1 year ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Mar 12 • 218

upvoted an article over 1 year ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

jsulz, yuchenglow, znation, saba9

•

Feb 12, 2025

• 80

Georgi Gerganov

AI & ML interests

Recent Activity

Organizations

ggerganov's activity

Using OCR models with llama.cpp

Welcome Gemma 4: Frontier multimodal intelligence on device

Falcon Perception

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Introducing Storage Buckets on the Hugging Face Hub

Community Evals and Benchmark Repositories

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

New in llama.cpp: Anthropic Messages API

New in llama.cpp: Model Management

Transformers v5: Simple model definitions powering the AI ecosystem

SmolLM3: smol, multilingual, long-context reasoner

The Transformers Library: standardizing model definitions

Tiny Agents: an MCP-powered agent in 50 lines of code

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub