dione issues

Add simple HTTP server implementation for fetches

We can implement a simple, Java-native, standalone HTTP server that gets requests for key(s) and returns the payload (as json, etc.). Then, users can implement their own web server to...

shay1bz

Add support for split files in all file formats

## Summary Currently we don't support file split in all formats. Relates to #10

shay1bz

Any support for Delta files ?

2

## Summary Does this library support delta files (https://github.com/delta-io/delta/) Links to #PR/#Issue ## Detailed Description what is the problem? how can we solve it?

nnani

Parquet index

### Summary Adding the option to have Parquet index, instead of Avro btree. This is for batch-only use cases, where fetches are rare or not used at all. In such...

shay1bz

Support non-partitioned tables

# Summary currently we hard code in `createIndex`, for example: `partitioned by` when we create the index table

uzadude

Add IT script

Add IT test script that generates some big table, index it, and asserts the basic functionalities. Such Spark script should work smoothly on any Hadoop cluster (with HDFS).

shay1bz

Trim common hdfs prefix in index DF

Try to recognize common path prefixes on runtime, and trim them. For example, files in a standard table might look like: ``` hdfs://my_cluster/foo/bar/my_table/dt=2020-01-01/part-0000.parquet hdfs://my_cluster/foo/bar/my_table/dt=2020-01-01/part-0001.parquet ... ``` On read, before the...

shay1bz

dione
dione copied to clipboard

Metadata

Add simple HTTP server implementation for fetches

Add support for split files in all file formats

Any support for Delta files ?

Parquet index

Support non-partitioned tables

Add IT script

Trim common hdfs prefix in index DF

Add missing partitions - add optional "min" partition

Consider datasource v2

Switch filesDF to HadoopRDD

← Metadata

Owner

Metadata

dione dione copied to clipboard

Metadata

← Metadata

Owner

Metadata

dione
dione copied to clipboard