bulk_extractor icon indicating copy to clipboard operation
bulk_extractor copied to clipboard

Bulk-extractor generates SegFault midway through processing complex directory hierarchy with -R

Open zdavatz opened this issue 3 years ago • 4 comments

Test case:

./bulk_extractor -o out -R <directory>

output:

zsh: segmentation fault  bulk_extractor

zdavatz avatar Apr 03 '23 06:04 zdavatz

Hi. We are thankful for your bug report.

First, you should know that bulk_extractor has a restart system. You can simply re-run the command (hit up-arrow and return) and the program will continue where it left off, avoiding the data that made it crash.

Second, we are thrilled to get your data. If you don't want to post a link here, please email me at [email protected].

Third, if you want to try to debug this yourself, we have instructions for how to do that:

https://github.com/simsong/bulk_extractor/tree/main/doc/Diagnostic_Notes

Thanks again!

simsong avatar Apr 03 '23 11:04 simsong

All the report files, that where generated after rerunning the command. report.xml.tar.gz

zdavatz avatar Apr 04 '23 10:04 zdavatz

I've obtained the .tar file of the directory and could process it without error, so the problem appears to be the directory iterator. If I can replicate this, I'll create a test case for the iterator that makes a large number of files in the directory tree and tries to process them.

simsong avatar Apr 05 '23 12:04 simsong

Okay. I'm turning my attention back to this but find that the I cannot reproduce it. Do you have a dataset I can use?

simsong avatar Jan 25 '24 03:01 simsong