nix-apollo
Results
1
issues of
nix-apollo
**Describe the bug** The huggingface tiny stories configs claim to support `n_ctx=2048`. However, the model was only trained with sequence length 512 (as mentioned [here](https://huggingface.co/roneneldan/TinyStories-33M)). The models in fact get...