imitation-learning
imitation-learning copied to clipboard
Issues when loading one episode
First, thanks for releasing your dataset!
I'm trying to load it using h5py, and I've found that one file in the training set (data_06790.h5
) consistently causes an error of the sort:
IOError: Unable to open file (Unable to find a valid file signature)
Removing this episode from the dataset, everything else loads. I've downloaded the data twice now, and both downloads have the same MD5 hash, so I don't think it's corruption from the download:
8b61e53ec55e1f79004d79fb61ab8d6c CORL2017ImitationLearningData.tar.gz
Hello @yanii I think this is probably correct. It could happen that some of parts of the data were corrupted on the way. My data loading system would just ignore corrupted h5 files, so I probably missed this one.
Oh, OK, so it's likely that the results you reported also excluded that file? Hoping to have some comparable numbers on the metric front :-)
Yes, the numbers I reported likely also excluded this file. But anyway i think 200 images shouldn't make much difference on the model :-) .
I will download again and check. Maybe there is some mistake since I was not using all the variables. Thanks again for contributing !
The target data vector is in the correct order by the way, false alarm on that front, I had messed up something!
import h5py filename='data_06790.h5' try: data = h5py.File(filename, 'r') except: print('file has bugs') #the output is 'file has bugs'