view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 331
Favorite Models Collection Mostly uncensored models for low VRAM budget • 23 items • Updated 2 days ago • 5
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 895
🍺 The Bartenders 🍺 Collection This is a collection of models that I've trained on data collected through conversations with frontier models GPT, Claude, Perplexity and myself. • 9 items • Updated 3 days ago • 3