lakeFS
lakeFS copied to clipboard
lakeFS - Data version control for your data lake | Git for data
Speed up metadata client JOIN operations by telling the driver how to JOIN metaranges in linear time. We will use this to transform garbage collector from a quadratic process to...
Several possible applications: - Show spanning of requests taking more than x seconds. - Record spanning for any versioning operations: diff/merge/log.
- Discovery: Consider the right structure of the software: different jars by hadoop version (similar to our Spark client), or something else. - Implement packaging
Add lakectl command: `lakectl fs sync some/local/dir lakefs://example-repo/example-branch/prefix/`
Support migration of lakeFS database between different KV implementations by adding 2 flows: 1. Export - dump all the data from old database into file 2. Import - load data...
@johnnyaug Here's what I've done step by step: - I created a new folder called "advanced." - I moved 4 files to this folder - for each file, I changed...