Model Specs

The model configuration (derived from meta_002849.json) is as follows:

Parameter	Value
Layers	18
Embedding Dim	1152
Heads (Q/KV)	9 / 9 (GQA)
Vocab Size	65,536
Max Seq Len	2048
Window Pattern	SSSL

Quick Start

This model uses a custom architecture and cannot be loaded directly via the standard transformers library. Please use the source code from the official GitHub repository.

git clone https://github.com/DestineG/nanochat.git
cd nanochat
git checkout v1.0

Downloads last month: 2

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support