bulk_extractor icon indicating copy to clipboard operation
bulk_extractor copied to clipboard

read error exception in Windows with unicode file names

Open RandomRhythm opened this issue 7 years ago • 3 comments

have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>

Was also getting the error with other unicode characters not just the copyright symbol.

RandomRhythm avatar May 09 '17 18:05 RandomRhythm

Wow!!! Great news! I will take a look.

Sent from my phone!

On May 9, 2017, at 2:03 PM, RandomRhythm [email protected] wrote:

have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>

Was also getting the error with other unicode characters not just the copyright symbol.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

simsong avatar May 09 '17 23:05 simsong

Did it crash, or keep going?

Sent from my phone!

On May 9, 2017, at 2:03 PM, RandomRhythm [email protected] wrote:

have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>

Was also getting the error with other unicode characters not just the copyright symbol.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

simsong avatar May 09 '17 23:05 simsong

It kept going but didn't process the files with unicode in the file name. Renaming to remove unicode characters was a successful workaround.

RandomRhythm avatar May 12 '17 16:05 RandomRhythm