dbpedia-docs
dbpedia-docs copied to clipboard
more efficient re-compression from bzip2 to gzip
- re-compression should be performed by piping the uncompressed data stream from the bzip decompressor directly into gzip, something along the lines of:
for a in *.bz2; do
bunzip2 -c "$a" | gzip - > "${a%\.bz2}.gz";
done
- if
pbzip2is available on the machine, using pbzip2 instead of bzip2 can drastically reduce the time necessary for the decompression of bzip2 archives, so it should be used if available:
BZIP2_CMD=`which pbzip2 &> /dev/null && echo 'pbzip2' || echo 'bzip2'`