diffy icon indicating copy to clipboard operation
diffy copied to clipboard

[Feature request] Option to perform diff without tempfiles

Open davidbodow-st opened this issue 1 year ago • 1 comments

Overview

When calling a diff of two large strings, a system that is low on disk space can fail with ENOSPC when attempting to write the strings to tempfiles.

Use Case

I would like to diff strings in memory, especially if the system has trimmed down the free disk substantially. For example, we use a large Sidekiq deployment, and having overprovisioned disk for 1000s of workers (only a few of which need to run disks) ends up costing a non-trivial amount. We could create a "high disk queue", but that seems more complex than adding a generally useful diffing option here.

Implementation Ideas

Bash process substitution can be used to avoid writing the tempfiles when calling diffbin:

❯ diff <(echo foo) <(echo bar)
1c1
< foo
---
> bar

We could handle passing the process substitution args instead of the usual args by implementing a new source in #diff: https://github.com/samg/diffy/blob/main/lib/diffy/diff.rb#L43

Other

I'd be happy to open a PR for this if there is interest in the feature, but no bandwidth otherwise.

davidbodow-st avatar Jul 13 '23 20:07 davidbodow-st

Hi @davidbodow-st - I'd be happy to look at a PR for the feature you're describing. You'll have to be careful that it's implemented in a way which is secure and not susceptible to shell code injection (i.e. some way data in the diffed files can be interpreted as shell commands). Let me know when you have something ready to review and we could look at getting it included in the gem.

samg avatar Jul 20 '23 04:07 samg