Riccardo Cappuzzo

Results 216 comments of Riccardo Cappuzzo

> Technically, the coverage should be the same, because we're replacing an error with another one, but I can add a few tests to make codecov happier makes sense, but...

What is the status of this PR?

I think this issue can be split into two parts: 1. A simple PR that modifies the example showing that there is the need to shuffle 2. Possibly, a second...

As discussed IRL, to address this issue we can't shuffle the dataset as part of the example: the dataset should be shuffled before loading it. This means that the current...

> I think that you are pushing a bit the meaning of "order_by" and trying to make it do something that is not related to ordering. @GaelVaroquaux did you mean...

> What about subsampling the scatterplots? It could be that, or it could be a 2d histogram (or something along those lines) I opened the issue to keep track of...

Currently, the TableReport has the `order_by` parameter that can take a column, and that plots all numerical columns as functions of the given column if it is numerical or a...

A much bigger problem I found while testing out the different plots is that setting `order_by` in a large dataset causes the TableReport to take an extremely long time to...

Now I get the reasoning. I am fine with shelving the 2d plot idea, but the problem of plotting not even that large datasets with `order_by` set to anything remains,...