rmast

Results 184 comments of rmast

Unfortunately no change whatsoever in compression factor: ``` robert@robert-virtual-machine:~$ pdfcomp out.pdf out_ckakadu.pdf Compression factor: 47.39642946807007 robert@robert-virtual-machine:~$ which pdfcomp /usr/local/bin/pdfcomp robert@robert-virtual-machine:~$ ls -al /usr/local/bin/pdfcomp -rwxr-xr-x 1 root root 1003 jun 12...

An improvement with the new raw copied version of [compress-pdf-images](https://raw.githubusercontent.com/internetarchive/archive-pdf-tools/pdf-metadata-tooling/bin/compress-pdf-images) ``` pdfcomp out.pdf out_ckakadu3.pdf Compression factor: 105.51087492544183 robert@robert-virtual-machine:~$ pdfcomp out.pdf out_ckakadu3.pdf Compression factor: 105.51087492544183 robert@robert-virtual-machine:~$ pdfimages -list out_ckakadu3.pdf page num...

DjVu uses about 25 dpi for the foreground-picture: https://www.cs.tufts.edu/~nr/cs257/archive/leon-bottou/jei-1998.ps.gz ![image](https://user-images.githubusercontent.com/3341558/173438040-e3c0b83b-1b9e-4973-9256-b6958a05d770.png)

With fg_downsample=12 inside compress-pdf-images robert@robert-virtual-machine:~$ pdfcomp out.pdf out_ckakadu4.pdf Compression factor: 300.4531241641256 However one unacceptable artifact appears, the rest of the page is fine: ![image](https://user-images.githubusercontent.com/3341558/173441671-8381cd7d-72c5-41d9-87e5-cb7e6873bc58.png) Where the original shows: ![image](https://user-images.githubusercontent.com/3341558/173441774-59333752-0523-45c2-9358-6a7a4871ee12.png) ```...

I've run the bankstatement in the fully open source didjvu, It does not have the faint text nor the strange artifact shown due to bad foreground/background choices. robert@robert-virtual-machine:~/didjvu$ ./didjvu encode...

I only added fg_downsample=12 inside compress-pdf-images close to fg_downsample=3 I think the most convenient goal for me would be to be able to scan in paperwork that is handed over...

I just tried some bg_slope values, and 43000 results in this: [ocrmypdf_compkakadufullfgbgslope43000.pdf](https://github.com/internetarchive/archive-pdf-tools/files/8984080/ocrmypdf_compkakadufullfgbgslope43000.pdf) ``` robert@robert-virtual-machine:~/Downloads$ pdfimages -list ocrmypdf_compkakadufullfgbgslope43000.pdf page num type width height color comp bpc enc interp object ID x-ppi...

Jbig2enc has to be compiled manually against the right version of Libleptonica as there is no packaged version. As far as I can see jbig2enc is updated until Libleptonica 1.83...

As Mac os is a kind of Unix I would expect all components to be compilable, all sources are available, but I don't know whether anyone has spent the effort...

MacOS supports these via Homebrew: https://ocrmypdf.readthedocs.io/en/latest/jbig2.html