dantengsky
dantengsky
I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/ ## Summary WIP: - [x] tweak `Procedure` interface adapt to the style of new-processor, so that procedure can...
**Summary** We have two internal patches of parqeut2, which mainly address the requirement - acquire the parquet file meta, right after the parquet file has been written, without re-read the...
**Summary** While collecting the min/max values of columns, we kept the exact values of them. For columns of type string(alike), the min/max values may be large(say, a column of type...
**Summary** currently, the "number of distinct values" we feed to the bloom filter is the number of rows, which is too conservative (naive:). we should use something like hyperloglog to...
**Summary** Observed that while keep ingesting data to the fuse table (at the same scale, single committer), the time of insertion keeps increasing, and seems to be proportional to the...
**Summary** Currently, committing of fuse table snapshot utilizes a simple OCC mechanism(exponential backoff back retry), in the condition of high contentions, it may lead to bad performance, with lots of...
**Summary** - Data Ingestion - [x] #2193 - [x] #2194 - [x] #1781 - [x] #1783 - [ ] #1785 - [x] #2361 - [ ] #2383 - Data Scanning...
I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/ ## Summary Introduces basic segment information like the number of blocks/ rows at the snapshot level. aims to...
I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/ ## Summary Enables delta binary packed encoding for `[U]int8/16/32/64`, `Date`, `Time`, `Timestamp`, `Duration`, and `Decimal` in Parquet serialization....