OpenML
OpenML copied to clipboard
Sparse arff with empty line fails on publish of the dataset entity
Description
A dataset with the following sparse_arff representation fails to be published:
'% Synthetic dataset created from a Pandas DataFrame with Sparse columns\n@RELATION Temp\n\n@ATTRIBUTE input1 REAL\n@ATTRIBUTE input2 REAL\n@ATTRIBUTE y REAL\n\n@DATA\n{ }\n{ 1 1.0,2 1.0 }\n{ 0 0.0,2 1.0 }\n{ 0 1.0,1 1.0 }\n'
equivalently,
% Synthetic dataset created from a Pandas DataFrame with Sparse columns
@RELATION Temp
@ATTRIBUTE input1 REAL
@ATTRIBUTE input2 REAL
@ATTRIBUTE y REAL
@DATA
{ }
{ 1 1.0,2 1.0 }
{ 0 0.0,2 1.0 }
{ 0 1.0,1 1.0 }
Expected Results
The dataset be published.
Actual Results
The following error:
OpenMLServerException: Error parsing dataset ARFF file - Arff error in dataset file: trailing characters on data line (l.9)