torchdatasets issues

Apparent mismatch between official pip version `0.2.0` and GitHub tagged version of `0.2.0`

1

First, thanks for an elegant library that has saved me a significant amount of time over the past couple years. Now, the problem: Since the name change, Ive been trying...

JacobARose

python3.6 order (MRO) for bases type, GenericMeta, _DataPipeMeta

1

``` Traceback (most recent call last): File "test-dataset.py", line 1, in from dataset import func File "/root/torch-cache-test/dataset/func.py", line 3, in from . import nocache File "/root/torch-cache-test/dataset/nocache.py", line 1, in from...

siriusa51

Document gotcha when using DataLoader with workers

Using `.cache()` (with the default memory cacher) does nothing when the Dataset is used in a multi-process DataLoader. This is a gotcha that should probably pointed out in the documentation...

w-m

Multiple concatenation with logical or operator yields nested concatenation

Concatenation of two datasets with the logical operator works as intended: `concat_2 = images | images` While concatenation of more datasets (`concat_3 = images | images | images`) yields a...

w-m

Support stratified subsampler

1

Hi! From what I can see currently there is no simple way in pytorch to perform a stratified subsampling of the training dataset. I think it fits this library scope...

trenta3

Benchmark

Can you share some benchmark when using this library vs when not use it? Thank you.

NguyenVanThanhHust

HDF5 Support

1

Thanks for this amazing library. I was wondering for large datasets with millions of images, would it make sense to cache in a single file (e.g., HDF5) instead of creating...

bearpaw

enhancement

torchdatasets
torchdatasets copied to clipboard

Metadata

Apparent mismatch between official pip version `0.2.0` and GitHub tagged version of `0.2.0`

python3.6 order (MRO) for bases type, GenericMeta, _DataPipeMeta

Document gotcha when using DataLoader with workers