stefan-it/nanochat-german-data
Viewer β’ Updated β’ 51.2M β’ 657
This repository hosts the first base German nanochat model.
It was pretrained with a modified version of the awesome nanochat implementation from Andrej Karpathy. The model was trained on 8xA100 from Lambda.
Notice: this repo hosts the final checkpoint from the original implementation. More information about the pretraining can be found in this repo, where the HF Transformers-compatible model lives.
The model is licences under a permissive Apache 2.0 license.