libsvmdata
libsvmdata copied to clipboard
SUSY dataset fails
The file susy.bz2 does not exist anymore on the fetched url.
Hello, This seems non immediate to fix:
import lzma
from sklearn.datasets import load_svmlight_file
f = lzma.open("SUSY.xz")
res = load_svmlight_file(f)
yields
ValueError: could not convert string to float: b'6.646450757980346680e-0114:8.746069669723510742e-01'
it seems a space is missing between e-01 and 14: in the original file
@thomasryck feel free to contribute to this open source project by investigating more.