nix-apollo

Results 1 issues of nix-apollo

**Describe the bug** The huggingface tiny stories configs claim to support `n_ctx=2048`. However, the model was only trained with sequence length 512 (as mentioned [here](https://huggingface.co/roneneldan/TinyStories-33M)). The models in fact get...