imbalanced-learn icon indicating copy to clipboard operation
imbalanced-learn copied to clipboard

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Results 85 imbalanced-learn issues
Sort by recently updated
recently updated
newest added

#### Describe the issue linked to the documentation The InstanceHardnessThreshold class docstring states that a string can be passed as an argument for the estimator argument and lists the possible...

good first issue

What is the "regression" referring to in the [EasyEnsembleClassifier documentation](https://imbalanced-learn.org/stable/references/generated/imblearn.ensemble.EasyEnsembleClassifier.html?highlight=easyensemble#ra96f85e96852-1)?

For: Documentation
good first issue

#### Describe the bug setup.py:setup doesn't specify required python version via `python_requires` which leads to broken installations using pip with for example Python 3.7. #### Steps/Code to Reproduce Run with...

Type: Bug

#### Describe the bug `RandomOverSampler().fit_resample` throws `KeyError: 'Only the Series name can be used for the key in Series dtype mappings.'` #### Steps/Code to Reproduce ```python X_res, y_res = sampling.fit_resample(X,...

Status: More Info Needed

I have been using imblearn Pipeline to apply SMOTE, but I have realized that it is sampling both the train and validation sets. I get the same results when I...

Status: More Info Needed

#### Is your feature request related to a problem? Please describe currently in `classification_report_imbalanced`, when setting `output_dict` to `True` it ignores the given `target_names`. **Example:** ``` from pprint import pprint...

Type: Bug

In Borderline SMOTE, the variant 2 is supposed to interpolate between the minority in danger and other neighbors from the minority, and then between the minority in danger and some...

Type: Question

In [this line](https://github.com/scikit-learn-contrib/imbalanced-learn/blob/f177b05/imblearn/over_sampling/_smote/base.py#L515), when adding the median(std) to the OHE matrix to estimate the distance of categorical features, the median is divided by 2. Is this a bug? or is...

Type: Question

#### Describe the bug The estimator_ object fit by `CondensedNearestNeighbour()` (and probably other sampling strategies) is incorrect when y has multiple classes (and possibly also for binary classes). In particular,...

Type: Bug
Package: under_sampling

Hi, I have a question regarding the generation of synthetic samples via SMOTE. The comments in the source code state, that a new sample is generated in the following manner:...

Type: Question