coreutils
coreutils copied to clipboard
dd: unexpectedly consumes all bytes from stdin
Executive summary:
Setup:
cargo build -p uu_dd
echo 'abcd' > infile
GNU dd:
$ (dd bs=1 skip=1 count=0 status=none && cat -) < infile
bcd
uutils dd:
$ (./target/debug/dd bs=1 skip=1 count=0 status=none && cat -) < infile
(it produces no output).
I'm going to be very explicit about this because it helped me understand the problem here. The way I'm reading the command line is:
ddmeans copy bytes from stdin to stdout,bs=1means blocks of 1 byte from the input and write blocks of 1 byte to the output,skip=1means skip 1 block of the input,count=0means write 0 blocks to the output,&&means do the second command only if the first command terminates with exit status 0,cat -means copy bytes from stdin to stdout,< infilemeans redirect the bytes ofinfileinto stdin,
Altogether this could be described in English as "send bytes of infile into stdin, have dd skip the first byte and stop, then have cat copy the remaining bytes to stdout".
So the problem seems to be that by the time cat attempts to read from stdin, there are no bytes available to read, but there should be. I took a look in the code for dd and it doesn't seem to be reading more bytes than it should: https://github.com/uutils/coreutils/blob/1194a8ce534e88cf213ddc3b66bb8d357455cf56/src/uu/dd/src/dd.rs#L182 Perhaps this has something to do with how Rust provides a handle to stdin?
(I discovered this issue when looking into test failures in the GNU test tests/misc/head-c.sh.)
The corresponding GNU test for dd is in tests/dd/not-rewound.sh.
There's a similar issue with od; a corresponding test is in tests/misc/od-N.sh.
I made an attempt to solve this issue. The problem is that an attempt to skip bytes by read_skip leads to consuming all stdin by a rust application: here we lose stdin
A possible workaround is to seek Stdin by interpreting it as File https://github.com/ArtemSkrebkov/coreutils/tree/stop_dd_consume_whole_stdin
Inspired by https://github.com/rust-lang/rust/issues/72802 So if https://github.com/rust-lang/rust/issues/72802 is resolved, the implementation above can be updated to make it safe
Let me know your opinions for the approach suggested. If maintainers are fine with that, I will clean up the branch and make a PR.
I think it would be a welcome addition.
Alright, I'll clean that up and make a PR.