Tom van der Weide
Tom van der Weide
Hi! Thanks for reporting this. This should now be fixed (that file wasn't included in the package, so we added it). Could you retry? Kind regards, Tom
Thanks for your detailed bug report! This is caused by that _GCS_BUCKET was made empty in this commit: https://github.com/tensorflow/datasets/commit/b78fc27c4f830c590c28002b1a1d07ef14e588dc I'll contact the people who changed it, but with the holidays...
A fix was submitted. Could you test with tfds nightly if it now works?
Hey, Thanks for reporting this! The download links seem to be broken on the dataset website: https://github.com/AI-Lab-Makerere/ibean/. Could you perhaps contact the owner and ask for updated links?
Do you have a link that is working? It seems like that site has been removed...
The SeqIO get_dataset function in the TFDS data source calls tfds.load which makes sure the dataset is downloaded and prepared. However, in `_get_maybe_sharded_dataset` in seqio/experimental.py it is getting the shards...
I have submitted a fix in TFDS. Could you retry with tfds-nightly? If it's working, we'll release a new version of TFDS.
I just [submitted](https://github.com/tensorflow/datasets/commit/8bfc778b4d09196e2053d1aa648c7d93b0b35efd) the code that restores metadata loading from GCS>. We'll do a new release soon, but it should be nightly tomorrow.
Have you looked into builder_from_directories? (see https://github.com/tensorflow/datasets/blob/303814edf2437e607574d6c128d85ac6cfa6c30a/tensorflow_datasets/core/read_only_builder.py#L152) This merges multiple datasets.
Do I understand correctly that you have a non-static data source from which you create a TFDS dataset? The data source regularly has new data appended to it. When new...