Riccardo Cappuzzo
Riccardo Cappuzzo
Talking with @Vincent-Maladiere, we thought of improving the error message by detecting when one of X and y is sampled, but the other isn't. This should be done in `.skb.apply`...
ready for review, since we're not going to detect subsampling in this PR
> Hello [@rcap107](https://github.com/rcap107)! Shouldn't this be something defined in scikit-learn [class](https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/utils/_bunch.py) itself? Hey @MarieSacksick! In skrub, the `Bunch` class is only used for fetching the datasets, so I think it...
> On this topic, I feel that exposing filenames and changing some examples to load from filename is a higher priority than changing the repr agreed, this issue is lower...
I think constant features should be kept, because even if they may not be useful for learning, they might still be useful for the user in some way. I am...
FWIW, the misalignment between bins and labels is something I've seen in general matplotlib use, so I don't know how it could be addressed specifically in the TableReport > I...
This was addressed (partially) in #1340 for the expressions. From what I remember, the hang up was deciding where to display the message about having too many columns. By keeping...
skrub meeting suggestion: `ApplyToCols` or `ApplyToColumns`
I just realized there are more sbd functions spread around other files that should be updated, like in _datetime_encoder.py and _to_datetime.py