Ozawa

Results 2 issues of Ozawa

First of all, thank you very much for your work. I try to train the model `Gemma-2B 32K seq len with 2K segment size` on a single A6000Ada 48G But...

When I run the training example, it prompts `NotADirectoryError: [Errno 20] Not a directory: '/home/jovyan/.cache/huggingface/datasets/downloads/73eca96a974f65c46cdf67acc0d23b976b9c57ce310d35ad7cfda8b6dc67001d/gov_report/train.jsonl'` How can I solve it? Thanks a lot