DataFrame LoadCsv improvements

Open vsfeedback opened this issue 1 year ago • 0 comments

This issue has been moved from a ticket on Developer Community.

The DataFrame.LoadCsv method could be improved in a number of ways:

Use double when the precision in the data allows it. Now float is always used when the data is floating point.
Allow the user to define NaN in floating point columns data. In R code that we are also using, NA is typically used.
Speed improvements if possible.
I think there is a problem with defining culture info in the parameter. I have to do: Thread.CurrentThread.CurrentCulture = new CultureInfo("en-US"); before the call to LoadCsv to make floating point data with decimal points load correctly (in Sweden).

(private comment, text removed)

Aug 05 '24 23:08 vsfeedback