coreutils icon indicating copy to clipboard operation
coreutils copied to clipboard

dd: unexpectedly consumes all bytes from stdin

Open jfinkels opened this issue 3 years ago • 5 comments

Executive summary:

Setup:

cargo build -p uu_dd
echo 'abcd' > infile

GNU dd:

$ (dd bs=1 skip=1 count=0 status=none && cat -) < infile
bcd

uutils dd:

$ (./target/debug/dd bs=1 skip=1 count=0 status=none && cat -) < infile

(it produces no output).


I'm going to be very explicit about this because it helped me understand the problem here. The way I'm reading the command line is:

  • dd means copy bytes from stdin to stdout,
  • bs=1 means blocks of 1 byte from the input and write blocks of 1 byte to the output,
  • skip=1 means skip 1 block of the input,
  • count=0 means write 0 blocks to the output,
  • && means do the second command only if the first command terminates with exit status 0,
  • cat - means copy bytes from stdin to stdout,
  • < infile means redirect the bytes of infile into stdin,

Altogether this could be described in English as "send bytes of infile into stdin, have dd skip the first byte and stop, then have cat copy the remaining bytes to stdout".

So the problem seems to be that by the time cat attempts to read from stdin, there are no bytes available to read, but there should be. I took a look in the code for dd and it doesn't seem to be reading more bytes than it should: https://github.com/uutils/coreutils/blob/1194a8ce534e88cf213ddc3b66bb8d357455cf56/src/uu/dd/src/dd.rs#L182 Perhaps this has something to do with how Rust provides a handle to stdin?

(I discovered this issue when looking into test failures in the GNU test tests/misc/head-c.sh.)

jfinkels avatar Feb 01 '22 03:02 jfinkels

The corresponding GNU test for dd is in tests/dd/not-rewound.sh.

jfinkels avatar Feb 02 '22 23:02 jfinkels

There's a similar issue with od; a corresponding test is in tests/misc/od-N.sh.

jfinkels avatar Feb 08 '22 02:02 jfinkels

I made an attempt to solve this issue. The problem is that an attempt to skip bytes by read_skip leads to consuming all stdin by a rust application: here we lose stdin

A possible workaround is to seek Stdin by interpreting it as File https://github.com/ArtemSkrebkov/coreutils/tree/stop_dd_consume_whole_stdin

Inspired by https://github.com/rust-lang/rust/issues/72802 So if https://github.com/rust-lang/rust/issues/72802 is resolved, the implementation above can be updated to make it safe

Let me know your opinions for the approach suggested. If maintainers are fine with that, I will clean up the branch and make a PR.

ArtemSkrebkov avatar May 24 '22 14:05 ArtemSkrebkov

I think it would be a welcome addition.

jfinkels avatar Jun 14 '22 01:06 jfinkels

Alright, I'll clean that up and make a PR.

ArtemSkrebkov avatar Jun 14 '22 08:06 ArtemSkrebkov