liac-arff icon indicating copy to clipboard operation
liac-arff copied to clipboard

Allow tab separated instance data

Open jondo opened this issue 9 years ago • 4 comments

I have got a lot of ARFF files with tab separated instance data, and Weka can read them. Could you please also read them? (I know that officially "Attribute values for each instance are delimited by commas.")

jondo avatar Oct 08 '15 15:10 jondo

Hi, I don't think liac-arff should support anything which is not in the documentation. Regarding that, I filed an issue at the wekalist.

mfeurer avatar Oct 09 '15 12:10 mfeurer

Arff specs are updated. Do you want to work on this and submit a pull request?

mfeurer avatar Oct 12 '15 08:10 mfeurer

Great to hear! Sorry - I have switched to using pandas.read_table for now, and I will stay with that because this fits well to my next step of calling pandas.merge for joining tables.

jondo avatar Oct 15 '15 07:10 jondo

load and loads could receive a parameter delimiter (just like csv module), with defaults to ,, and simply redirect it to the data conversion procedure (which uses csv).

renatopp avatar Oct 15 '15 13:10 renatopp