OpenML icon indicating copy to clipboard operation
OpenML copied to clipboard

Sparse arff with empty line fails on publish of the dataset entity

Open Neeratyoy opened this issue 4 years ago • 0 comments

Description

A dataset with the following sparse_arff representation fails to be published:

'% Synthetic dataset created from a Pandas DataFrame with Sparse columns\n@RELATION Temp\n\n@ATTRIBUTE input1 REAL\n@ATTRIBUTE input2 REAL\n@ATTRIBUTE y REAL\n\n@DATA\n{ }\n{ 1 1.0,2 1.0 }\n{ 0 0.0,2 1.0 }\n{ 0 1.0,1 1.0 }\n'

equivalently,

% Synthetic dataset created from a Pandas DataFrame with Sparse columns
@RELATION Temp

@ATTRIBUTE input1 REAL
@ATTRIBUTE input2 REAL
@ATTRIBUTE y REAL

@DATA
{  }
{ 1 1.0,2 1.0 }
{ 0 0.0,2 1.0 }
{ 0 1.0,1 1.0 }

Expected Results

The dataset be published.

Actual Results

The following error: OpenMLServerException: Error parsing dataset ARFF file - Arff error in dataset file: trailing characters on data line (l.9)

Neeratyoy avatar Mar 02 '20 15:03 Neeratyoy