auto-sklearn
auto-sklearn copied to clipboard
Update `PIPELINE_DATA_DTYPE`
PIPELINE_DATA_DTYPE is currently outdated, we now accept pandas dataframes, need to update pipeline components to ensure this is supported throughout and update the type to reflect this.
This was indentified from autosklearn/pipeline/components/data_preprocessing/imputation/categorical_imputation.py
Also need to be sure about the type of sparse matrices we accept.
PIPELINE_DATA_DTYPE = Union[
np.ndarray,
scipy.sparse.bsr_matrix,
scipy.sparse.coo_matrix,
scipy.sparse.csc_matrix,
scipy.sparse.csr_matrix,
scipy.sparse.dia_matrix,
scipy.sparse.dok_matrix,
scipy.sparse.lil_matrix,
]