seafowl
seafowl copied to clipboard
Local object cache for S3
When using an S3-compatible object store, DataFusion / Seafowl currently download the object every time they need to scan through it. We should let users dedicate some scratch space on a Seafowl instance as object cache. Potential options:
- write a wrapper around the
ObjectStore
trait that proxies an upstream object store and uses a local FS as cache - use https://github.com/kahing/goofys / + catfs OR s3fs?