lr4d
lr4d
Supersedes https://github.com/JDASoftwareGroup/kartothek/pull/260 I tried to rebase this branch on top of master but it was quite the adventure to do so. I excluded the changes to the changelog and it...
## Scenario We use `update_dataset_from_ddf` with `shuffle=True`. We `bucket_by` the column `c1`, with a specific `n_buckets`. Imagine we now store the value of `bucket_by` and` num_buckets` alongside the Parquet metadata...
### Problem description Currently, `DatasetMetadataBase.query` returns all partitions when a query is made for an non-existing column or a column without index (see https://github.com/JDASoftwareGroup/kartothek/issues/198). ### Proposed solution Implement a strict...
### Problem description 1. Kartothek deletes all partitions of a dataset when a non-existing column is provided in the `delete_scope`. 2. Kartothek deletes all partitions of a dataset (without warning)...
### Problem description > So I have tried out multiple things and having type safety even in NumPy doesn't work, if you don't want to scan types in objects columns...
Draft from #62 : ``` Secondary Indices ----------------- The ability to build and maintain `inverted indices `_ are an additional feature provided by ``kartothek``. In general, an index is a...
I was a little scared when I saw all these `mut`s, so I tried to remove at least one. I can't really tell if it's any better than the original...
### Description of your *pull request* and other information Allows a user to download intepretation audio tracks alongside the main video of a Zoom recording. A user can use the...