Rob Meng
Rob Meng
total sample size is defined as `sample_rate * num_centroids` or `sample_rate * (2 ^ n_bits)`, for quantization the orders of magnitude for each is usually very different; the means that...
## Context S3 does not support transactional-operations like `*_if_not_exist`. We would like to support transactional-operations on S3 for concurrent writers. ## Requirements * Implement a native locking mechanism in Lance...
repro ``` very_large_dataset.optimize.compact_files(num_threads=a_lot) ```
**Problem** It's often hard trying to saturate IO throughput of object stores with a single read stream (which usually implies a single file is sequentially read from or written to)...
this is technically not a breaking change since we are making the return consistent with the existing type annotation. marked as a breaking change so users are aware of this...
bootleg parallel migration tool using compaction task execution facility In next PR, I'll add a `force_rewrite` option to rewrite files even when the file size is equal to the desired...
repro Cargo.toml ```toml [package] name = "test-rs" version = "0.1.0" edition = "2021" [dependencies] reqwest = { version = "~0.12", default_features = false, features = ["rustls-tls-native-roots"] } ``` ```rust use...
Hi, I absolutely love the project. We use it in many places. Would it be possible to allow jitter on the durations? We use moka for various RPC caching. When...
This PR adds a way to easily write test for concurrent transactions. we define a new class called `ParallelOperations` where we can define: * the initial state of a dataset...