bulk_extractor
bulk_extractor copied to clipboard
read error exception in Windows with unicode file names
have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>
Was also getting the error with other unicode characters not just the copyright symbol.
Wow!!! Great news! I will take a look.
Sent from my phone!
On May 9, 2017, at 2:03 PM, RandomRhythm [email protected] wrote:
have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>
Was also getting the error with other unicode characters not just the copyright symbol.
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.
Did it crash, or keep going?
Sent from my phone!
On May 9, 2017, at 2:03 PM, RandomRhythm [email protected] wrote:
have a file named ©_test.txt and when running bulk extractor against it (technically against the folder recursively) I get the following error: Exception read error skipping (D:\Temp\test/┬⌐_test.txt|0) <debug:exception name='read error' pos0='(D:\Temp\test/©_test.txt|0)' >read error</debug:exception>
Was also getting the error with other unicode characters not just the copyright symbol.
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.
It kept going but didn't process the files with unicode in the file name. Renaming to remove unicode characters was a successful workaround.