cdleong
cdleong
Trying to think through desired behavior if we create a continue flag, and it now interacts with the overwrite flag. directory exists already | continue flag is set to... |...
OK, working from outside in: - training.py is apparently what runs when we do joeynmt train - It takes in just one arg currently, the path to a config.yml. Presumably...
Also, based on a slack discussion with Julia, the default should be continue=False, overwrite=False, so as to err on the non-overwrite side, as maybe they used a config with overwrite...
In related news: https://www.philschmid.de/sagemaker-spot-instance someday having S3 checkpointing would be cool!
One suggestion in the slack would be to break the new notebook code into two parts * One notebook that takes in a HuggingFace dataset at the top, and proceeds...
> > > One suggestion in the slack would be to break the new notebook code into two parts > > * One notebook that takes in a HuggingFace dataset...
https://colab.research.google.com/drive/1RWOle7RHy_wq0uGWxmAq1ZfmEQIFsCHj#scrollTo=h1Ddy4_AOKdm could make for a starting point. This notebook shows how to download a HuggingFace dataset and write it out to files of the format JoeyNMT expects... I think
In my case I simply took the spaces out, and that prevented any issues. As in, I used `/content/drive/MyDrive/` instead of `/content/drive/My Drive/`, and also manually renamed my `Hani Machine...
Ah, I think maybe I forgot that you can right-click the Drive name in Google Colab and rename it.  I think I changed my drive name to `MyDrive` and...
I will rename it again and see if it breaks.