Joaquin Vanschoren
Joaquin Vanschoren
Thanks! Fixed by https://github.com/openml/OpenML/pull/1106 Please check and close if everything works as expected.
Hi richherr, sorry for the late reply. Most likely the validation email was sent to your spam folder. Did you check there? If resetting your password fails, you can try...
Hi all, Did you see the API docs? https://new.openml.org/api Under run: /run/trace/{id} I do notice now that the example has a broken link. More documentation is always better, but I...
I fixed 61, 250, and 252. I probably touched 250 and 252 when working on the chunked data issue. Not sure why 61 was altered. All the rest seem to...
I don't understand what's happening here. Server says that the checksum is: ``` > wget https://www.openml.org/api/v1/data/310 && cat 310 | grep md5_checksum 250dde8b316c3de43ce11f647675554b ``` When I download it manually from...
Indeed dataset 310 has charset=unknown-8bit on the server. 2 options: - change the charset of the file on the server - adapt the python API to compute the md5 hash...
Hmm, I can fix the encoding on the server, but that also changes the md5 hash. The first line actually outputs ASCII and fixes the unknown encoding issue. If we...
1) It's not about a specific charset, but we should check whether the encoding is known and parses without errors. If relevant, most files on the server use charset 'us-ascii'....
I searched for all non- utf8/ascii dataset files, and found: - 6 dataset files with an unknown 8bit encoding - 11 datasets files with encoding Latin-1 (iso-8859-1) - 1 dataset...
If the encoding is unknown, the encoding is labeled something like 'unknown-8bit'. I assume you want PHP, so maybe you could use this to see if the encoding could be...