Qwerky Optimized Hybrid Attention Experiments I can't believe it's not attention. QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated about 1 month ago • 173 • 2 QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 23 days ago • 394 • 3
QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated about 1 month ago • 173 • 2
QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 23 days ago • 394 • 3
Qwerky Optimized Hybrid Attention Experiments I can't believe it's not attention. QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated about 1 month ago • 173 • 2 QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 23 days ago • 394 • 3
QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated about 1 month ago • 173 • 2
QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 23 days ago • 394 • 3
QwerkyAI/Qwerky-Optimized-Llama3.1-Mamba-0.2-8B-Instruct Text Generation • 9B • Updated 23 days ago • 394 • 3
QwerkyAI/Qwerky-Optimized-Llama3.2-Mamba-0.2-3B-Instruct Text Generation • 4B • Updated about 1 month ago • 173 • 2