datasets icon indicating copy to clipboard operation
datasets copied to clipboard

Support loading a dataset from SQLite files

Open albertvillanova opened this issue 4 years ago • 2 comments

As requested by @julien-c, we could eventually support loading a dataset from SQLite files, like it is the case for JSON/CSV files.

albertvillanova avatar Oct 15 '21 10:10 albertvillanova

for reference Kaggle has a good number of open source datasets stored in sqlite

Alternatively a tutorial or tool on how to convert from sqlite to parquet would be cool too

julien-c avatar Jun 20 '22 13:06 julien-c

Hello, could we leverage pandas.read_sql for this?

This would be basically the same as CSVBuilder , but uses pandas.read_sql(..., chunksize=1) instead of pandas.read_csv(..., iterator=True)

I'm happy to work on this :)

self-assign

Dref360 avatar Aug 30 '22 21:08 Dref360