connectors
connectors copied to clipboard
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Cache the partition pruning result in FilteredDeltaScanImpl to improve performance (#399) Signed-off-by: sonhmai
We perform partition pruning inside of [FilteredDeltaScanImpl](https://github.com/delta-io/connectors/blob/master/standalone/src/main/scala/io/delta/standalone/internal/scan/FilteredDeltaScanImpl.scala) by - having some input query / expression `expr` - creating a [PartitionRowRecord](https://github.com/delta-io/connectors/blob/master/standalone/src/main/scala/io/delta/standalone/internal/data/PartitionRowRecord.scala) for a given AddFile `partitionRowRecord` - evaluating `partitionRowRecord` against `expr`...
Support following expression type in [column stats based file pruning](https://docs.google.com/document/d/1D2ukbGInriKTOZJl3r3pCZoQRji40rSuFgnOyzhHz7k/edit). * IsNull * IsNotNull * Not
Support following expression type in [column stats based file pruning](https://docs.google.com/document/d/1D2ukbGInriKTOZJl3r3pCZoQRji40rSuFgnOyzhHz7k/edit). * equalTo * greaterThan * greaterThanOrEqual * lessThan * lessThanOrEqual * OR * AND
[FilteredDeltaScan](https://github.com/delta-io/connectors/blob/master/standalone/src/main/scala/io/delta/standalone/internal/scan/FilteredDeltaScanImpl.scala) in Standalone now implemented partition filter, it is to: * having some input query / expression expr. * creating a [PartitionRowRecord](https://github.com/delta-io/connectors/blob/master/standalone/src/main/scala/io/delta/standalone/internal/data/PartitionRowRecord.scala) for a given AddFile partitionRowRecord. * evaluating partitionRowRecord...
If you look at the source jars that are being published for delta standalone, the source jars are just empty. For example you can see this in the latest `io.delta:delta-standalone_2.13:0.4.1`...
Adding Table/SQL support for Delta-Flink Sink connector. The Table architecture for Flink connectors is showed on below diagram: ![image](https://user-images.githubusercontent.com/7932805/187198747-02943535-72a8-4cc7-ac1b-fb87a06592de.png) This PR provides Delta specific implementation for: - DynamicTableSinkFactory - DynamitTableSink...
Extracts the column level stats from parquet file and add it to the delta log in json format, fixes https://github.com/delta-io/connectors/issues/371 - The implementation is not complete as it may not...
Currently our comparison expressions fail with `IllegalArgumentException` for decimal type arguments that don’t have equal precision/scale since we check for equal data types. This should be valid (and is in...
Hey there, when using power query to get the set of files for the delta connector, the following query results in an emtpy table when using the Power BI Delta...