tutorials icon indicating copy to clipboard operation
tutorials copied to clipboard

pipeline_tutorial failing due to dead torchtext link

Open clee2000 opened this issue 1 year ago • 3 comments

Line 55 of https://github.com/pytorch/tutorials/blob/082c8b1bddb48b75f59860db3679d8c439238f10/intermediate_source/pipeline_tutorial.py is using torchtext to download a dataset that can’t be accessed right now (maybe got taken down, I’m looking for an alternative link but torchtext is no longer maintained)

Can this tutorial be rewritten to use a different dataset? Can the entire tutorial be deprecated?

Ex: https://github.com/pytorch/tutorials/actions/runs/7992713944/job/21826864521 requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-2-v1.zip

cc @kwen2501 @H-Huang @wconstab

clee2000 avatar Feb 21 '24 21:02 clee2000

I'd vote for deprecating the tutorial as nobody maintains the software or the tutorial now

kwen2501 avatar Feb 21 '24 23:02 kwen2501

I've got the same error with Transformer tutorial (https://pytorch.org/tutorials/beginner/transformer_tutorial.html).

karelvaculik avatar Feb 22 '24 07:02 karelvaculik

Will send a PR to deprecate. Thanks!

svekars avatar Feb 22 '24 17:02 svekars