lzbench Add support for Tamp (low-memory intended for embedded targets)

Tamp is a low-memory, DEFLATE-inspired lossless compression library intended for embedded targets. Tamp is intended for situations where previous heatshrink was used. Tamp offers higher compression ratios, better tooling, better API, small firmware, and a small memory footprint (barely larger than the window buffer). Tamp has an easy to install CLI, python library, and C implementation.

The design priorities (in order) of Tamp is:

Low memory usage.
Good compression ratios.
Small firmware size.

Here's an example output from running the following on a M1 macbook air; typical use-case is level 10:

$ ./lzbench -t16,16 -etamp silesia.tar
lzbench 1.8 (64-bit MacOS)  (null)
Assembled by P.Skibinski

Compressor name         Compress. Decompress. Compr. size  Ratio Filename
memcpy                  29634 MB/s 29552 MB/s   211975168 100.00 silesia.tar
tamp 1.3.1 -8            13.0 MB/s   188 MB/s   114873328  54.19 silesia.tar
tamp 1.3.1 -9            10.2 MB/s   196 MB/s   107632159  50.78 silesia.tar
tamp 1.3.1 -10           6.24 MB/s   202 MB/s   102660646  48.43 silesia.tar
tamp 1.3.1 -11           3.67 MB/s   206 MB/s    99280285  46.84 silesia.tar
tamp 1.3.1 -12           2.16 MB/s   216 MB/s    95567672  45.08 silesia.tar
tamp 1.3.1 -13           1.23 MB/s   227 MB/s    93114331  43.93 silesia.tar
tamp 1.3.1 -14           0.71 MB/s   236 MB/s    91469264  43.15 silesia.tar
tamp 1.3.1 -15           0.40 MB/s   243 MB/s    90653607  42.77 silesia.tar
done... (cIters=1 dIters=1 cTime=16.0 dTime=16.0 chunkSize=1706MB cSpeed=0MB)

Feb 25 '24 21:02 BrianPugh

It cannot decompress it's own 'files'. Also levels below 8 crash.

Mar 10 '24 01:03 tansy

It cannot decompress it's own 'files'.

Can you elaborate? What issues are you seeing?

Also levels below 8 crash.

This is intentional, in Tamp levels below 8 are invalid. This is why I have the range begins at 8 in lzbench.h

Mar 10 '24 03:03 BrianPugh

It seems to not be able to correctly decompress compressed data. Lzbench check correctness of decompressed result and if it differs from original then it indicates error. In simple terms - your decompressor does not decompress its own compressed stream into original state.

$ lzbench-tamp -etamp reymont 
lzbench 1.8 (32-bit Linux)

Compressor name         Compress. Decompress. Compr. size  Ratio Filename
tamp 1.3.1 -8            5.12 MB/s      ERROR     3281840  49.52 reymont
tamp 1.3.1 -9            3.38 MB/s      ERROR     3014779  45.49 reymont
tamp 1.3.1 -10           2.14 MB/s      ERROR     2847981  42.97 reymont
^C

Mar 10 '24 18:03 tansy

I know it's not a great response, but it works fine on my M1 macos machine :D

I'll try and replicate when I get my hands on my linux box in a week.

Mar 11 '24 14:03 BrianPugh

I just ran this on a 64 bit linux machine without issues:

$ ./lzbench -t16,16 -etamp silesia.tar
lzbench 1.8 (64-bit Linux)  Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz
Assembled by P.Skibinski

Compressor name         Compress. Decompress. Compr. size  Ratio Filename
memcpy                  13128 MB/s 13567 MB/s   211975168 100.00 silesia.tar
tamp 1.3.1 -8            9.19 MB/s   101 MB/s   114873328  54.19 silesia.tar
tamp 1.3.1 -9            5.82 MB/s   104 MB/s   107632159  50.78 silesia.tar
tamp 1.3.1 -10           3.53 MB/s   109 MB/s   102660646  48.43 silesia.tar

I then cross-compiled it for 32-bit:

$ ./lzbench -t16,16 -etamp reymont
lzbench 1.8 (32-bit Linux)  Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz
Assembled by P.Skibinski

Compressor name         Compress. Decompress. Compr. size  Ratio Filename
memcpy                  15503 MB/s 15493 MB/s     6627202 100.00 reymont
tamp 1.3.1 -8            9.16 MB/s      ERROR     3281840  49.52 reymont
tamp 1.3.1 -9            6.23 MB/s      ERROR     3014779  45.49 reymont
tamp 1.3.1 -10           4.00 MB/s      ERROR     2847981  42.97 reymont

Investigating what the cause could be.

Mar 22 '24 22:03 BrianPugh

@tansy this should be fixed now by 2a9d7d9. The issue was that I was pointing at the reported size int64_t with a size_t *. This isn't an issue if the int64_t size is initialized to 0, but it was never explicitly initialized, so the upper 4 bytes were just whatever garbage was on the stack. Because the code was using a size_t * pointer, only the 4 lower bytes were being updated.

Mar 22 '24 22:03 BrianPugh

Yes, it works now.

Mar 24 '24 08:03 tansy

This is intentional, in Tamp levels below 8 are invalid

What's the rationale behind that?

Mar 24 '24 09:03 tansy

With tamp, the compression level directly corresponds to the window size. Tamp's header uses 3 bits to represent the window size:

Number of bits, minus 8, used to represent the size of the shifting window. e.g. A 12-bit window is encoded as the number 4, 0b100. This means the smallest window is 256 bytes, and largest is 32768.

For the API, it was decided that the user should just provide values in range [8, 15] instead of [0, 7] as those values are more meaningful.

Mar 24 '24 16:03 BrianPugh

It may be more meaningful to you but not neccesarily to average user, who doesn't (even) know what the sliding window is.

Mar 26 '24 12:03 tansy

Tamp doesn't perform any allocations, so the user must provide the window buffer. It is much more natural to do:

TampCompressor compressor;
const WINDOW_BITS = 10;
window_buffer = malloc(1 << WINDOW_BITS);
TampConf conf = {
    .window = WINDOW_BITS,  // this will change in example below
    .literal=8,
}
tamp_compressor_init(&compressor, &conf, window_buffer);

rather than

    .window = WINDOW_BITS - 8,  // magical number 8

In this PR, we could change the range expressed by lzbench to something like [1, 8] by adding a constant, but that seems unnecessarily confusing to me.

Mar 26 '24 15:03 BrianPugh