UTF8 problems when extracting files with special characters
Troubleshooting docs
- [X] My problem is not solved in the Troubleshooting docs
Anaconda default channels
- [X] I do NOT use the Anaconda default channels (pkgs/* etc.)
How did you install Mamba?
Micromamba
Search tried in issue tracker
yes
Latest version of Mamba
- [X] My problem is not solved with the latest version
Tried in Conda?
I do not have this problem with Conda, just with Mamba
Describe your issue
Unfortunately, libarchive doesn't seem to be very good at handling UTF8 encoded paths in packages (.tar.bz2 files). I created a test-case in rattler-build to create a package with weird file names (utf8, long files, etc.). Unfortunately micromamba fails at extracting the files correctly. Specifically a file with File(Glob …).tmSnippet doesn't work since the ellipsis is UTF8 encoded.
We might want to consider to use our implementation to create the files instead of relying on the archive_writer.
I also tested tar.exe (pre-installed on Windows) and it has the same UTF problem when extracting the .tar.bz2 file.
mamba info / micromamba info
No response
Logs
No response
environment.yml
No response
~/.condarc
No response