pycytominer icon indicating copy to clipboard operation
pycytominer copied to clipboard

FeatureRequest: Automatically generated dataframe schemas to catch errors

Open kenibrewer opened this issue 4 months ago • 0 comments

Feature type

  • [X] Add new functionality

  • [ ] Change existing functionality

General description of the proposed functionality

Story: As a pycytominer user, I would like to receive more descriptive error messages about problems with my data. Pycytominer could automatically generate a DataframeSchema to check for the column names I specified in arguments and make sure there aren't NaN or inf values for operations where that will cause errors. By returning an error message with the specific column and row that contain problematic values, I will be more easily able to work with large distributed datasets.

Feature example

Coming

Alternative Solutions

No response

Additional information

No response

kenibrewer avatar Mar 27 '24 13:03 kenibrewer