AutoClean icon indicating copy to clipboard operation
AutoClean copied to clipboard

Python package for automated data preprocessing & cleaning.

Results 5 AutoClean issues
Sort by recently updated
recently updated
newest added

hello I saw an issue someone said they want a feature that wont delete some duplicate columns so I changed the outlier class and added a outlier_skip if any columns...

Some numeric columns such as coordinates shouldn't be included in the outliers processing, so it would be a desirable feature to define a list of columns that outliers processing should...

https://github.com/elisemercury/AutoClean/blob/0c8695e2663fb4c61d19469c6b8380e5be00b8c8/AutoClean/__init__.py#L1

Hey, im new to coding and im trying AutoClean on a dataset but i keep getting this error: TypeError: cannot safely cast non-equivalent float64 to int64. According to ChatGPT this...

'manual' value's working was mentioned in the README.md but does not work ``` from AutoClean import AutoClean dataset = pd.read_csv("Titanic-Dataset.csv") pipeline = AutoClean(dataset, mode='manual', duplicates=True, missing_num=False, missing_categ=False, encode_categ=False, extract_datetime=False, outliers=False,...