BirdNET-Analyzer
BirdNET-Analyzer copied to clipboard
analyze.py seems to skip many files if it encounters a corrupt file
I analyzed about 2.4 million .wav files in junk files of about 250.000 to 300.000 files. In two of these junks there were 4 corrupted files. Because of these 4 corrupted files, about 6550 other uncorrupted files could not be analyzed. It seems that analyze.py gets some sort of "hiccup" when it encounters a corrupt file and skips hundreds of subsequent files that have no problem.
Is this something that could be optimized in a newer version?
If the Analyzer encounters an error or corrupt file, the file just gets skipped. Could you check your error_log.txt and log.txt for error codes if this happens again?
I encountered a similar issue. I had a folder with 25 audio files, but none of the files were being uploaded for analysis. After doing some checks, I discovered that one of the files was corrupt. The corrupt file was not being skipped, and as a result, the other files were not uploaded for analysis either. Once I deleted the corrupt file, the entire folder uploaded successfully with all the remaining files.
As I said, please check your error_log.txt and log.txt in the install directory for error codes if this happens again, and paste the contents into this issue. Or provide us with a corrupt file, for us to run tests.
As I said, please check your
error_log.txtandlog.txtin the install directory for error codes if this happens again, and paste the contents into this issue. Or provide us with a corrupt file, for us to run tests.
This is for example a corrupt file: Corrupt_Audio.zip
I just inserted your corrupt file into a batch, but the current repo correctly skips the file, but analyses the rest. Is this still an issure for you in the current main?