Jiting Xu
Jiting Xu
## Description of changes when I was working on a data science project, it has more than 128 columns in the table, when I run `table.info()` it throws the error...
### What happened? Here is the code could be used to reproduce the error: ```python import ibis ibis.options.interactive = True con = ibis.bigquery.connect(project_id="xxx", dataset_id="xxx") t = ibis.memtable({ "x": ["a", "b"],...
## Description of changes When I am working on the https://github.com/ibis-project/ibis/pull/9139#issuecomment-2101711159, I had a `RecursionError: maximum recursion depth exceeded while calling a Python object` when union large number of tables,...
Adding support matrix for ibisml `Step`, there are two ways to collect information for this: - Execute each step against the actual backend. - We need to setup the actual...
Hi, I am not sure if this is the right way to test `Step` on different backends. I just use this as the start point for collecting better solutions and...
enable unit tests on different backends - backends testing env setup - enable unit tests on different backends - add this to CI
randomly partition a dataset into subsets while ensuring reproducibility Reference: - https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html - https://rsample.tidymodels.org/reference/initial_split.html
Unknown categories are currently ignored in the current encoding implementations. While we should consider adding an option to handle this in the future, it's not a high priority at the...
Backlog item, details to follow.
~We are currently targeting the [NVTabular demo on RecSys2020 Challenge](https://nvidia-merlin.github.io/NVTabular/main/examples/winning-solution-recsys2020-twitter/01-02-04-Download-Convert-ETL-with-NVTabular-Training-with-XGBoost.html) as a demo workflow.~ **Update:** Due to the RecSys2020 demo data being unavailable (and against Twitter's terms to share), we...