deterministic_zip
deterministic_zip copied to clipboard
A tool to generate consistent zip files (useful for terraform updates of aws lambda functions).
deterministic_zip
A tool to generate consistent zip files.
This tool was specifically built to prevent zip file changes from triggering
AWS Lambda function updates when running terraform apply. Before this change,
every re-build of the zipfile would result in a different zip, even if its
contents had not changed.
Requirements
This tool currently only runs on python3.7+ in order to use deflate compression level 9.
Installation
pip install deterministic_zip
Creating a deterministic_zip
Run the following, and verify that your zip produces the same sha256 hash:
echo "The first file." > first
echo "The second file." > second
deterministic_zip archive.zip first second
sha256sum archive.zip
If you have \n line endings the result should be:
3afbd7c9b42bd5539ffd5c40499d3d1825157ed83791dce8d7ff2694189d28d6
If you have \r\n line endings (Windows) the result should be:
40e16270d62f15e7a192e88b1b301fa6540c86e7e897036b56be513341d376ed
How does it work?
Great question! There are three tricks to building a deterministic zip:
-
Files must be added to the zip in the same order. Directory iteration order may vary across machines, resulting in different zips.
deterministic_zipsorts all files before adding them to the zip archive. -
Files in the zip must have consistent timestamps. If I share a directory to another machine, the timestamps of individual files may differ, despite having identical content. To achieve timestamp consistency,
deterministic_zipsets the timestamp of all added files to2019-01-01 00:00:00. -
Files in the zip must have consistent permissions. File permissions look like
-rw-r--r--for a file that is readable by all users, and only writable by the user who owns the file. Similarly executable files might have permissions that look like:-rwxr-xr-xor-rwx------.deterministic_zipsets the permission of all files added to the archive to either-r--r--r--, or-r-xr-xr-x. The latter is only used when the user runningdeterministic_ziphas execute access on the file.
Note: deterministic_zip does not modify nor update timestamps of any
files it adds to archives. The techniques used above apply only to the copies
of files within archives deterministic_zip creates.