Soledad Galli

Results 86 issues of Soledad Galli

In Borderline SMOTE, the variant 2 is supposed to interpolate between the minority in danger and other neighbors from the minority, and then between the minority in danger and some...

Type: Question

In [this line](https://github.com/scikit-learn-contrib/imbalanced-learn/blob/f177b05/imblearn/over_sampling/_smote/base.py#L515), when adding the median(std) to the OHE matrix to estimate the distance of categorical features, the median is divided by 2. Is this a bug? or is...

Type: Question

#### Describe the bug In SMOTE-NC, the distance between categorical features is determined by the median of the standard deviation of the continuous features in the minority class. This median...

#### Describe the bug Neighbourhood cleaning rule procedure: 1. Split data T into the class of interest C (minority) and the rest of data O. 2. Identify noisy data A1...

#### Describe the bug ENN and RENN accept integers in the init parameter n_neighbors, to indicate the number of neighbours for the KNN algorithm The default is 3. However, a...

#### Describe the bug [cross_val_predict](https://github.com/scikit-learn-contrib/imbalanced-learn/blob/f177b05/imblearn/under_sampling/_prototype_selection/_instance_hardness_threshold.py#L148) returns an array with the probabilities of each class. The array will have as many columns as classes the target. Then the code takes the...

I ran a `gp_minimize` using `EIps` and then `EI` and I found that the search with EIps takes longer. I expected the opposite. Does this make sense? Here is [a...

The OrdinalEncoder has an errors argument which can either raise an error or output NaNs when encountering new categories. For this particular class, it'd make sense to output -1 when...

good first issue
enhancement

As per description here: https://medium.com/towards-data-science/mrmr-explained-exactly-how-you-wished-someone-explained-to-you-9cf4ed27458b and references therein.

At the moment, the DecisionTreeDiscretiser returns the values of the tree predictions as the replacement of the original variables. I would like to add the option to return integers from...

enhancement