Gael Varoquaux

Results 40 issues of Gael Varoquaux

Installing this package forces an upgrade of core libraries (such as a numpy). This poses problem (basically, I am needing to explain environments to beginners).

When a user is accessing the dev version of the website, we should add a big read warning on the docs telling the user that she is accessing a development...

help wanted
Documentation

To be done between sprints, and not during a sprint :)

enhancement

At some point in the future, I think that we should rename the "SuperVectorizer" to "TableVectorizer", a name that is slightly more specific of what it does and more "serious"...

enhancement

Scikit-learn has a numpy-doc checker (and a blacklist of files to skip). We should see how they do it and port it to dirty-cat

enhancement
help wanted
Documentation
CI / Build

In its current version, MinHashEncoder uses a dictionary to cache all the inputs. With a very large dataset, this will lead to a memory explosion. I suggest two modifications: *...

Based on a comment by @adrinjalali : improve things slightly.

No Changelog Needed

### Describe the workflow you want to enable When I interact when non-advanced users a recurrent difficulty for them is finding information and understanding what is going on. ### Describe...

New Feature
RFC

### Problem Description Using float64 instead of float32 typically incurs compute and memory loads, and users do not have this in mind. ### Feature Description We should add an option...

enhancement

### Describe the issue linked to the documentation We should briefly mention to_datetime (explaining the benefit compared to pd.to_datetime) on https://skrub-data.org/stable/encoding.html#encoding-dates ### Suggest a potential alternative/fix One sentence on the...

documentation
no changelog needed