auto-sklearn icon indicating copy to clipboard operation
auto-sklearn copied to clipboard

Update `PIPELINE_DATA_DTYPE`

Open eddiebergman opened this issue 4 years ago • 0 comments

PIPELINE_DATA_DTYPE is currently outdated, we now accept pandas dataframes, need to update pipeline components to ensure this is supported throughout and update the type to reflect this.

This was indentified from autosklearn/pipeline/components/data_preprocessing/imputation/categorical_imputation.py

Also need to be sure about the type of sparse matrices we accept.

PIPELINE_DATA_DTYPE = Union[
    np.ndarray,
    scipy.sparse.bsr_matrix,
    scipy.sparse.coo_matrix,
    scipy.sparse.csc_matrix,
    scipy.sparse.csr_matrix,
    scipy.sparse.dia_matrix,
    scipy.sparse.dok_matrix,
    scipy.sparse.lil_matrix,
]

eddiebergman avatar Nov 04 '21 16:11 eddiebergman