thai2nmt
thai2nmt copied to clipboard
Hosting huggingface/datasets version on Internet Archive
Since git-lfs has limits, I'm considering changing the huggingface/datasets download link to: https://archive.org/details/scb_mt_enth_2020
Is this still an issue? If things area better now, may we have checkpoint/data on huggingface?
Ideally, I'd like to see this in a form ready to be load:
model_name = "..."
tokenizer = XLMTokenizer.from_pretrained(model_name)
model = XLMTokenizer.from_pretrained(model_name)
translated = model.generate(...)
Anything I can help with?