skrub icon indicating copy to clipboard operation
skrub copied to clipboard

TableVectorizer tests only check pandas dataframes

Open rcap107 opened this issue 6 months ago • 1 comments

Tests in test_table_vectorizer.py are all using example pandas only dataframes, rather than using df_module and testing pandas, pandas nullable types, and polars

We should update all the tests to address this

rcap107 avatar Jun 13 '25 15:06 rcap107

from the skrub meeting: this is low priority and something to do when we have spare time

rcap107 avatar Jun 16 '25 09:06 rcap107

Hi @rcap107, since this issue was discussed to be a low priority. If it is alright, I can take it up. With this, I will also be able to get more familiar with the codebase.

Let me know if it's okay for me to proceed, or if there is any other instruction for me.

shivanshutiwari35 avatar Jul 19 '25 14:07 shivanshutiwari35

Hi @shivanshutiwari35, thanks for offering your help! Yes, you're more than welcome to work on this issue.

You might want to work on this a few tests at a time (at least at the beginning), there are a lot of tests to go through and it might be easier to review and go over a few at a time rather than covering everything.

rcap107 avatar Jul 19 '25 17:07 rcap107

Thanks @rcap107 , I’ll start with a small batch of tests and open a PR for review. Based on the feedback, I’ll continue iterating in parts to keep it manageable.

shivanshutiwari35 avatar Jul 21 '25 08:07 shivanshutiwari35

Hey @shivanshutiwari35, thank you for your interest! If you haven't already, I recommend that you read the contributing documentation to get familiar with our workflow.

Vincent-Maladiere avatar Jul 21 '25 12:07 Vincent-Maladiere

Thanks, @rcap107, I’ll go through the contributing guide thoroughly and get started.

shivanshutiwari35 avatar Jul 21 '25 12:07 shivanshutiwari35

Closed by #1618

rcap107 avatar Nov 05 '25 17:11 rcap107