libsvmdata icon indicating copy to clipboard operation
libsvmdata copied to clipboard

SUSY dataset fails

Open thomasryck opened this issue 3 years ago • 1 comments

The file susy.bz2 does not exist anymore on the fetched url.

thomasryck avatar Jun 17 '22 11:06 thomasryck

Hello, This seems non immediate to fix:

import lzma
from sklearn.datasets import load_svmlight_file
f = lzma.open("SUSY.xz")
res = load_svmlight_file(f)

yields

ValueError: could not convert string to float: b'6.646450757980346680e-0114:8.746069669723510742e-01'

it seems a space is missing between e-01 and 14: in the original file

@thomasryck feel free to contribute to this open source project by investigating more.

mathurinm avatar Jun 17 '22 17:06 mathurinm