Stephanie Kirmer

Results 6 comments of Stephanie Kirmer

That's not the issue, sorry if I was unclear. The issue is that the test sample should not default to `0.30000000000000004 ` when it is not specified. The error is...

That's a problem, though, because when I use dask my data gets large enough that the rounding error makes my train and test set sizes add up to one more...

I'd also note that all numbers greater than .66 appear to result in some error in this regard- and you're probably right that the float arithmetic is the problem- so...

Requested pointer to the spot where sklearn handles this problem: https://github.com/scikit-learn/scikit-learn/blob/8ea176ae0ca535cdbfad7413322bbc3e54979e4d/sklearn/model_selection/_split.py#L1826-L1841

I think this file we're compiling with the one-line descriptions should link to the respective data dictionaries. Edit: I gotta learn to read. I think that this might be more...

As we were discussing in the slack channel, just to document it here, ggraph might be a good tool to use to illustrate the links between our datasets in a...