icedb issues

On-disk support (network fs)

4

Have a mode where it operates off disk instead of S3 in case of using something like AWS fsx lustre. That might provide better managed performance, especially for reading the...

danthegoodman1

enhancement

Upgrade duckdb version

Use latest version, and use latest credentials methods for S3

danthegoodman1

enhancement

Avro insert support

3

Can insert an avro record, which since it includes the schema, will use that schema for the insert. It will be used both for setting initial table types, as well...

danthegoodman1

documentation

enhancement

Performance comparisons

1

Compare ingesting and queries on the same data set to: - [ ] BigQuery - [ ] Athena - [ ] ClickHouse - [ ] MotherDuck - [ ] Parquet...

danthegoodman1

documentation

help wanted

Concurrent log file download

90+% of the download time is waiting for time to first byte (avg 86ms in region on AWS). If we do batches of these and buffer in memory it should...

danthegoodman1

enhancement

Github events has 232M rows and lots of example queries: https://ghe.clickhouse.tech/ https://clickhouse.com/docs/en/getting-started/example-datasets/nyc-taxi has 3B rows but is smaller in size can do this too and much less complex schema Should...

danthegoodman1

documentation

Rewrite partition concurrency

When performing partition rewrite, we should be able to specify how many files will be processed concurrently.

danthegoodman1

enhancement

help wanted

Partition removal example

1

Show how to do partition removal. Example of TTL, and example of user ID deletion

danthegoodman1

help wanted

example

Partition rewrite example

Make example showing how to filter out a given user ID's data Related to #106

danthegoodman1

help wanted

example

Example using datafusion

According to https://github.com/apache/arrow-datafusion-python/issues/442#issuecomment-1685809731 it should be able to do this with https://github.com/danthegoodman1/IceDBS3Proxy

danthegoodman1

help wanted

example

icedb
icedb copied to clipboard

Metadata

On-disk support (network fs)

Upgrade duckdb version

Avro insert support

Performance comparisons

Concurrent log file download

Performance test (against BQ)

Rewrite partition concurrency

Partition removal example

Partition rewrite example

Example using datafusion

← Metadata

Owner

Metadata

icedb icedb copied to clipboard

Metadata

← Metadata

Owner

Metadata

icedb
icedb copied to clipboard