imitation-learning Issues when loading one episode

Issues when loading one episode

Open yanii opened this issue 7 years ago • 5 comments

First, thanks for releasing your dataset!

I'm trying to load it using h5py, and I've found that one file in the training set (data_06790.h5) consistently causes an error of the sort:

IOError: Unable to open file (Unable to find a valid file signature)

Removing this episode from the dataset, everything else loads. I've downloaded the data twice now, and both downloads have the same MD5 hash, so I don't think it's corruption from the download:

8b61e53ec55e1f79004d79fb61ab8d6c CORL2017ImitationLearningData.tar.gz

Jan 29 '18 15:01 yanii

Hello @yanii I think this is probably correct. It could happen that some of parts of the data were corrupted on the way. My data loading system would just ignore corrupted h5 files, so I probably missed this one.

Jan 29 '18 15:01 felipecode

Oh, OK, so it's likely that the results you reported also excluded that file? Hoping to have some comparable numbers on the metric front :-)

Jan 29 '18 15:01 yanii

Yes, the numbers I reported likely also excluded this file. But anyway i think 200 images shouldn't make much difference on the model :-) .

I will download again and check. Maybe there is some mistake since I was not using all the variables. Thanks again for contributing !

Jan 29 '18 16:01 felipecode

The target data vector is in the correct order by the way, false alarm on that front, I had messed up something!

Jan 30 '18 18:01 yanii

import h5py filename='data_06790.h5' try: data = h5py.File(filename, 'r') except: print('file has bugs') #the output is 'file has bugs'

Mar 09 '18 07:03 Fengmoon93

imitation-learning imitation-learning copied to clipboard

Issues when loading one episode

imitation-learning
imitation-learning copied to clipboard