ipt
ipt copied to clipboard
Add support for DuckDB
This simple patch adds support for DuckDB. It just requires a single extra line to the Dockerfile:
ADD https://repo1.maven.org/maven2/org/duckdb/duckdb_jdbc/1.0.0/duckdb_jdbc-1.0.0.jar /usr/local/tomcat/webapps/ROOT/WEB-INF/lib
This allows to read a large variety of data sources, such as Parquet files.
For example:
- Database:
:memory: - SQL Statement:
install httpfs; load httpfs;
select * from read_parquet('s3://gbif-open-data-eu-central-1/occurrence/2024-08-01/occurrence.parquet/000000');
Interesting!
@mike-podolskiy90, this would be better as a Maven dependency so non-Docker users can benefit.
Thanks @frafra and @MattBlissett , I'll have a look
I really like to see it happen, so I rebased the branch and added duckdb as a Maven dependency :)