GooglePhotosTakeoutHelper
GooglePhotosTakeoutHelper copied to clipboard
Inconsistency of "duplicate removal" + missing transperancy
Hi all,
I just tried the script with an example file-set of mine but currently struggling with the logic and the transparency when it comes to the removal of duplicates.
4116 Input files (according the beginning of the script)
3482 calculating small hashes
580 full hashes
574 removing duplicates
Final statistics:
Removed duplicates: 630
- How are the numbers related to each other as I cannot see them matching each other right away
- number of input files
- number of caluclated small hashes
- full hashes
- removing duplicates
- removed duplicates (final statistics) --> number is larger than removing duplicates (?)
- How to allocate the deleted files to validate this manually? I am clearly no expert in this but it seems odd to me that >15% in my example should be duplicates. Without reviewing this by myself it feels that I will loose data. How to avoid that no false-positive in the duplicate assignment is happening?
Would appreciate any support. Thanks