lakeFS
lakeFS copied to clipboard
lakeFSFS: Define target performance and measure performance
- Define target performance for lakeFSFS used with Spark and Hive tables
- Define performance benchmarks and guide to running them
- Measure performance and document results
Target performance and measurements should be defined for:
- Write a 1000-partition Parquet file (for instance).
- Read from all objects of a 1000-partition Parquet file (for instance). A good refinement is to read all fields and also to read just one field from the file.