Model Specs

The model configuration (derived from meta_002849.json) is as follows:

Parameter Value
Layers 18
Embedding Dim 1152
Heads (Q/KV) 9 / 9 (GQA)
Vocab Size 65,536
Max Seq Len 2048
Window Pattern SSSL

Quick Start

This model uses a custom architecture and cannot be loaded directly via the standard transformers library. Please use the source code from the official GitHub repository.

git clone https://github.com/DestineG/nanochat.git
cd nanochat
git checkout v1.0
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support