scikit-learn-intelex
scikit-learn-intelex copied to clipboard
New feature required- Support different data sources
Hi, Currently daal4py accepts only CSV, np.array and pandas dataframes. If data more than to fit into memory and located in databases different solutions are come up such as polars df which uses connectorx to access data stored in databases and ibis that uses databases as backend such as ClickHouse. So, it would be great if daal4py can accept different data sources, to handle bigger data sizes.
here is the error I got when try to read a clickhouse table using ibis into daal4py Got type 'ClickhouseTable' when expecting string, array, or list of 1d-arrays.
here is the error I got when try to read a polars df into daal4py Got type 'DataFrame' when expecting string, array, or list of 1d-arrays.
Regards Mohamed
We are considering adding support of different dataframes - https://data-apis.org/dataframe-protocol/latest/purpose_and_scope.html