thai2nmt icon indicating copy to clipboard operation
thai2nmt copied to clipboard

Hosting huggingface/datasets version on Internet Archive

Open cstorm125 opened this issue 4 years ago • 1 comments

Since git-lfs has limits, I'm considering changing the huggingface/datasets download link to: https://archive.org/details/scb_mt_enth_2020

cstorm125 avatar Dec 03 '20 14:12 cstorm125

Is this still an issue? If things area better now, may we have checkpoint/data on huggingface?

Ideally, I'd like to see this in a form ready to be load:

model_name = "..."
tokenizer = XLMTokenizer.from_pretrained(model_name)
model = XLMTokenizer.from_pretrained(model_name)
translated = model.generate(...)

Anything I can help with?

chrisadas avatar May 02 '23 00:05 chrisadas