stash
stash copied to clipboard
[Bug Report] Scene Duplicate Checker doesn't show 100% identical duplicates
Describe the bug If you import the same file/Scene twice, the scene is only shown one time in the scene overview, and the Scene duplicate checker doesn't show the two files as being "similar" or identical, and you have to use different tools to get rid of the dupes.
To Reproduce Import a file/scene with a known stash id twice under two different file names, identificate them and look up the Scene duplicate checker under settings -> tools. The two files won't appear in the list, despite being true duplicates.
Expected behavior The two file should be shown and flagged as duplicates.
Screenshots Will add some screenshots as soon as I find the buttons
Stash Version: 0.28.1
Desktop (please complete the following information):
- OS: Linux
- Browser Firefox
Screenshot of the database: two files are assigned to the same scene_id (with the identical stash id): https://i.imgur.com/PydxPKE.png
The hashes shown for both files are identical, so these two files should appear in the Scene duplicate checker tool https://i.imgur.com/a2szn6J.png
Issue: these files are not shown in the dupe checker, which is strange, because the hashes match.
You can filter for file count greater than 1 in the scene view to find them
Your approach doesn't work.
It also includes scenes with different content, assigned to the same scene. This can happen, especially with movies where different scenes are assigned to the same scene id. It is the reason why the hash must be taken into account, too. Otherwise, you have to manually open each file and compare the content.
100% identical duplicates by oshash are merged automatically
100% identical duplicates by oshash are merged automatically
This is correct. I can see the results in the database (see my screenshots), but it doesn't help me to identify and remove duplicates from my hard drives. Removing duplicate files to reduce the amount of data is the most important aspect, at elast for me. And for me, duplicates are not only similar files (=same or different stash id and different but similar ohash), but also identical files (=same stash id, same ohash, different file id).
What I don't understand is. If the current approach is kept in place, to suppress showing identical duplicated files in the duplicate check tool, is the meaning of "exact" as possible selection method in its options. How can something with identical file size, identical ohash and identical scene id not be considered as "exact" match?
Btw.: workaround is to install video duplicate file finder. It does not only show similar files as matches, but also identical ones.