sjisunzip
sjisunzip copied to clipboard
A rather stupid simple program to unbreak s-jis encoded filenames in a zip since nobody else seems to want to do it.
sjisunzip
This is a pretty braindead command line utility that simply forces the encoding to the right values to extract a Shift JIS encoded zip file ('Code page 932') on a western/ansi encoding system.
Usage:
sjisunzip someFile.zip [toFolder]
sjisunzip [-r] someFile.zip
-r: Recode file to {filename}_utf8.zip
Examples:
sjisunzip aFile.zip
sjisunzip aFile.zip MyNewFolder
You can also just drop a zip file onto the program since that'll pass it as the first argument and the contents will be extracted in the same directory.
If you've ever received a zip file from a friend, or the wrong damn gnu mirror or whatever that passed through Japan then you've probably seen garbled filenames
data:image/s3,"s3://crabby-images/4e2e5/4e2e5c6d09b61da684118670b0f54e3463980591" alt="example_1"
Well this program forces the opened zip to the correct encoding then extracts the file to a more reasonable UTF encoding.
data:image/s3,"s3://crabby-images/11efa/11efa1288cfe10cc5d581eae3ab031b95b44bee5" alt="example_2"
You can even just reencode the zip file to a less busted-ass one so you don't have this creeping horror issue in the future
data:image/s3,"s3://crabby-images/ac0a8/ac0a8f9a5c49c7b406e89b2dce62ba404d244903" alt="example_3"
The filenames and paths should be untangled when done.
data:image/s3,"s3://crabby-images/89a2f/89a2f7319afc9b46afcef7740a85a8844a29982d" alt="example_4"
Bonus fact: When this type of transitive corruption occurs, the output characters are called Mojibake. That's almost cute enough to not be awful anymore.