Model Specs
The model configuration (derived from meta_002849.json) is as follows:
| Parameter | Value |
|---|---|
| Layers | 18 |
| Embedding Dim | 1152 |
| Heads (Q/KV) | 9 / 9 (GQA) |
| Vocab Size | 65,536 |
| Max Seq Len | 2048 |
| Window Pattern | SSSL |
Quick Start
This model uses a custom architecture and cannot be loaded directly via the standard transformers library. Please use the source code from the official GitHub repository.
git clone https://github.com/DestineG/nanochat.git
cd nanochat
git checkout v1.0
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support