mamba icon indicating copy to clipboard operation
mamba copied to clipboard

UTF8 problems when extracting files with special characters

Open wolfv opened this issue 1 year ago • 0 comments

Troubleshooting docs

  • [X] My problem is not solved in the Troubleshooting docs

Anaconda default channels

  • [X] I do NOT use the Anaconda default channels (pkgs/* etc.)

How did you install Mamba?

Micromamba

Search tried in issue tracker

yes

Latest version of Mamba

  • [X] My problem is not solved with the latest version

Tried in Conda?

I do not have this problem with Conda, just with Mamba

Describe your issue

Unfortunately, libarchive doesn't seem to be very good at handling UTF8 encoded paths in packages (.tar.bz2 files). I created a test-case in rattler-build to create a package with weird file names (utf8, long files, etc.). Unfortunately micromamba fails at extracting the files correctly. Specifically a file with File(Glob …).tmSnippet doesn't work since the ellipsis is UTF8 encoded.

We might want to consider to use our implementation to create the files instead of relying on the archive_writer.

I also tested tar.exe (pre-installed on Windows) and it has the same UTF problem when extracting the .tar.bz2 file.

mamba info / micromamba info

No response

Logs

No response

environment.yml

No response

~/.condarc

No response

wolfv avatar Jan 30 '24 12:01 wolfv