Pieter Gijsbers
Pieter Gijsbers
A `TPOT.fit` call may fail when there are outlier minority classes (with certain metrics). ## Context of the issue When running the benchmark we encountered this issue sometimes, for instance...
As per the title, it will be useful for ASKL2 to have a configurable `portfolio` and `policy selector`. It's beneficial for research (avoiding 'cheating' through meta-learning in a benchmark) or...
I encountered an issue while using `mljar-supervised` with a `boolean series` as target. The produced predictions are not in `[False, True]` but instead `[0, 1]`. ```python import numpy as np...
Related to https://github.com/dask/distributed/issues/1932, but it asks to silence for different API component and for all warnings. I have a setup where I expect some jobs to fail, but I do...
**Proposed feature:** Allow a specific account under which new student assignment repositories should be created. **Background** I'm a TA at a course taught using Github Classroom. The classroom/course is owned...
I wanted to create a task for [/d/43072](https://www.openml.org/d/43072) with target `upselling` (binary). I accidentally marked it as a `SUPERVISED_REGRESSION` task (since the previous task I made was regression): ```python import...
When editing `ignore_attribute` field with the data edit API with multiple attributes to ignore, only the last one is used: ``` import openml new_id = openml.datasets.fork_dataset(41702) # new ID was:...
Classification and Regression tasks feature estimation procedures: (ordered) holdout, r-repeated k-fold cross-validation and test on training data. Currently the split files are organized as following (ARFF notation): ``` @attribute type...
Proposal to include `numberOfInstances` and `numberOfFeatures` qualities in the dataset description
The dataset `description.xml` contains some of the most useful meta-data of the dataset. I think the number of instances/rows and the number of features should be added here. Those features...
There are error codes for each entity to indicate no results are found for a query ([e.g. for flows](https://github.com/openml/OpenML/blob/d9a9f63a1b95f9aa18c3e7eb7ed3eb8143ec45f8/openml_OS/views/pages/api_new/v1/xml/pre.php#L271)). I was wondering why the choice was made to work with...