volsync icon indicating copy to clipboard operation
volsync copied to clipboard

Investigate sparse & large file handling

Open JohnStrunk opened this issue 3 years ago • 2 comments

Describe the feature you'd like to have. Investigate how the various movers handle both sparse and large files.

What is the value to the end user? (why is it a priority?) VM images, in particular, can be rather large. We should characterize what performance we expect (and optimize where we can). These images may also be sparse, and detecting/preserving that can also be a big benefit.

How will we know we have a good solution? (acceptance criteria)

  • Understand what performance we should expect w/ large files.
    • Are small changes in the file handled efficiently?
    • Are there any optimizations that would make it better?
  • Understand how sparse files are handled.
    • Is sparseness preserved?

Additional context Rsync & sparse: https://gergap.wordpress.com/2013/08/10/rsync-and-sparse-files/ Related to backube/scribe#122 Prompted by @mykaul

JohnStrunk avatar Jun 02 '21 13:06 JohnStrunk

You should look at what has been implemented in oVirt's image-io (see https://github.com/oVirt/ovirt-imageio/blob/master/examples/sparse-stream for example) to optimize sparse image copy.

mykaul avatar Aug 01 '21 09:08 mykaul

It would be interesting to compare a NFS-based solution vs rsync. NFS is supposed to handle large file transfers better. Since rsync is build on top of SSH, I wonder if this is the bottleneck.

moonlight16 avatar Mar 30 '23 21:03 moonlight16