imbalanced-learn
imbalanced-learn copied to clipboard
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
#### Description In case of a sparse training set X, the algorithm uses scipy.sparse to make the computation more efficient. A synthetic sample will only be generated when X[row].nnz is...
#### What does this implement/fix? Explain your changes. This is an implementation of the Multilabel SMOTE (MLSMOTE) algorithm described in the paper: Charte, F. & Rivera Rivas, Antonio & Del...
Is the balanced random forest first randomly undersampled and then bootstrapped? Or do you first bootstrap it and then random undersampling? I want to figure out what happens to the...
#### Is your feature request related to a problem? Please describe Currently we are doing research on comparing classifiers' effectiveness on highly imbalanced data: `SVC(kernel="linear", C=0.025)`, `SVC(gamma=2, C=1)`, `GaussianProcessClassifier(1.0 *...
#### Describe the issue linked to the documentation Currently `plot_resampling` takes the first two columns for data, instead of the first two PCA components. https://imbalanced-learn.org/stable/auto_examples/over-sampling/plot_comparison_over_sampling.html #### Suggest a potential alternative/fix...
#### Describe the bug - Hello, I have found that ```RandomOverSampler().fit_resample``` has different memory usage in different versions. In my program, when the imblearn version was **0.8.1**, the peak memory...
#### Describe the issue linked to the documentation References of [`KMeansSMOTE`](https://imbalanced-learn.org/stable/references/generated/imblearn.over_sampling.KMeansSMOTE.html) should be updated. #### Suggest a potential alternative/fix ``` Felix Last, Georgios Douzas, Fernando Bacao, “Oversampling for Imbalanced Learning...
Hi everyone, I'm fairly new in the machine learning field, so my apologies if the question seems very simple. I'm trying to do some classification on several datasets with some...
#### Describe the bug I've updated the Debian package of sklearn to 1.4 which is supposed to work with imbalanced-learn 0.12.0. Unfortunately the build on Debian fails as you can...
Added the option to add a pipeline as an estimator for instance hardness threshold. Currently using a pipeline as an estimator fails because of the instance check. I think it's...