datasets icon indicating copy to clipboard operation
datasets copied to clipboard

Dataset Viewer issue for gsarti/flores_101

Open lewtun opened this issue 2 years ago • 2 comments

Link

https://huggingface.co/datasets/gsarti/flores_101

Description

It seems like streaming isn't supported for this dataset:

Server Error
Status code:   400
Exception:     NotImplementedError
Message:       Extraction protocol for TAR archives like 'https://dl.fbaipublicfiles.com/flores101/dataset/flores101_dataset.tar.gz' is not implemented in streaming mode. Please use `dl_manager.iter_archive` instead.

Owner

No

lewtun avatar Jun 26 '22 11:06 lewtun

Related to https://github.com/huggingface/datasets/issues/4562#issuecomment-1166911751

I'll assign @albertvillanova

severo avatar Jun 27 '22 08:06 severo

I'm just wondering why we don't have this dataset under:

  • the facebook namespace
  • or the canonical dataset flores: why does this only have 2 languages?

albertvillanova avatar Jun 27 '22 09:06 albertvillanova

fwiw: the dataset viewer is working. Renaming the issue

severo avatar Sep 25 '23 12:09 severo