mil icon indicating copy to clipboard operation
mil copied to clipboard

Dataset problem

Open emigmo opened this issue 6 years ago • 5 comments

Hi, Tianhe There are some problems with the dataset from http://rail.eecs.berkeley.edu/datasets/mil_data.zip I use the following code to check the dataset of './data/sim_push'

import glob
import pickle
file_dir = './data/sim_push'
file_list = glob.glob(file_dir + "/*.pkl")
bad_file = []
for i in range(len(file_list)):
    try:
        with open(file_list[i], 'rb') as f:
        data = pickle.load(f)
    except:
        bad_file.append(file_list[i])
print("open(file_list): bad_file: ", len(bad_file))
print(bad_file)

There are 78 files which can't be loaded using pickle. If I change the open(file_list[i], 'rb') as open(file_list[i]), there would be 753 files which can't be loaded (all the files can't be loaded normally).

I use python 2.7.6 and python 3.6.3 to load your pickle file, but get the same problem For python 2.7.6: I get the print(pickle.format_version) --> 2.0 For python 3.6.3: I get the print(pickle.format_version) --> 4.0

I guess that the problem may be the version of the pickle package. Could you tell me which version of pickle you used when you dump the pickle file?

emigmo avatar Aug 05 '18 13:08 emigmo

What's the error message when loading the pickle files failed?

tianheyu927 avatar Aug 06 '18 19:08 tianheyu927

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa8 in position 269495: invalid start byte

emigmo avatar Aug 07 '18 08:08 emigmo

Any idea here? or redump and upload your mil_data.zip and tell me your pickle's version.

emigmo avatar Aug 11 '18 14:08 emigmo

Do you solve this problem?

raozhongyu avatar Oct 30 '20 04:10 raozhongyu

try this

results = pickle.load(f, encoding="latin1")

igiorgi avatar Mar 22 '21 14:03 igiorgi