dataprep icon indicating copy to clipboard operation
dataprep copied to clipboard

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

Results 139 dataprep issues
Sort by recently updated
recently updated
newest added

# Description Implementation of `clean_language` as described in #664. # How Has This Been Tested? By testing the default setting, specifying formats and changing knowledge bases. # Snapshots: ![image](https://user-images.githubusercontent.com/66409637/123722562-2d92a000-d8bb-11eb-962a-a9ed56cf18fb.png) ![image](https://user-images.githubusercontent.com/66409637/123722634-4ef38c00-d8bb-11eb-800c-19a02463254c.png)...

# Description Created a juypter script for Covid Tracking Api to demonstrate the dynamic URL feature as we did for twitter.

**Problem statement** When using plot_diff for two or more dataframes of different sizes, I would prefer using density histograms, instead of absolute values histograms. In such cases, when one dataframe...

type: enhancement

## Summary Add the show details tab in the variables section of create_diff_report. ## Design-level Explanation Actions - Add show details tab on the report UI for every column -...

type: enhancement

clean_country() applied to England and Scotland throws NaN. I believe this would happen for all countries belonging to UK. It would be nice if the function recognices both cases: United...

:eyes: Some source code analysis tools can help to find opportunities for improving software components. :thought_balloon: I propose to [increase the usage of augmented assignment statements](https://docs.python.org/3/reference/simple_stmts.html#augmented-assignment-statements "Augmented assignment statements") accordingly....

type: enhancement

Python 3.7.9 (default, Aug 31 2020, 12:42:55) [GCC 7.3.0] :: Anaconda, Inc. on linux dataprep 0.3.0 code: from dataprep.eda import create_report bug: AttributeError Traceback (most recent call last) in 1...

type: bug

**Is your feature request related to a problem? Please describe.** The droplist of Aggrid is a commercial function, which means that we need to pay for it when publishing it....

type: enhancement

**Describe the bug** Currently the frequency plot of a column that contains only integers displays bins with boundaries that are not integral. For example, the column containing these values `[1,...

type: bug
triage required

**Describe the bug** The `col1` should be na values, but it's missing value is zero in the Variables section. ![image](https://user-images.githubusercontent.com/18078770/137571092-7f38f902-4f77-4658-8d66-f263d30feaf1.png) **To Reproduce** ```python from dataprep.eda import create_report import pandas as...

type: bug