stash icon indicating copy to clipboard operation
stash copied to clipboard

[Bug Report] Scene Duplicate Checker doesn't show 100% identical duplicates

Open becks0815 opened this issue 7 months ago • 5 comments

Describe the bug If you import the same file/Scene twice, the scene is only shown one time in the scene overview, and the Scene duplicate checker doesn't show the two files as being "similar" or identical, and you have to use different tools to get rid of the dupes.

To Reproduce Import a file/scene with a known stash id twice under two different file names, identificate them and look up the Scene duplicate checker under settings -> tools. The two files won't appear in the list, despite being true duplicates.

Expected behavior The two file should be shown and flagged as duplicates.

Screenshots Will add some screenshots as soon as I find the buttons

Stash Version: 0.28.1

Desktop (please complete the following information):

  • OS: Linux
  • Browser Firefox

becks0815 avatar Apr 11 '25 04:04 becks0815

Screenshot of the database: two files are assigned to the same scene_id (with the identical stash id): https://i.imgur.com/PydxPKE.png

The hashes shown for both files are identical, so these two files should appear in the Scene duplicate checker tool https://i.imgur.com/a2szn6J.png

Issue: these files are not shown in the dupe checker, which is strange, because the hashes match.

becks0815 avatar Apr 11 '25 04:04 becks0815

You can filter for file count greater than 1 in the scene view to find them

smith113-p avatar Apr 11 '25 22:04 smith113-p

Your approach doesn't work.

It also includes scenes with different content, assigned to the same scene. This can happen, especially with movies where different scenes are assigned to the same scene id. It is the reason why the hash must be taken into account, too. Otherwise, you have to manually open each file and compare the content.

becks0815 avatar Apr 13 '25 18:04 becks0815

100% identical duplicates by oshash are merged automatically

feederbox826 avatar Apr 14 '25 01:04 feederbox826

100% identical duplicates by oshash are merged automatically

This is correct. I can see the results in the database (see my screenshots), but it doesn't help me to identify and remove duplicates from my hard drives. Removing duplicate files to reduce the amount of data is the most important aspect, at elast for me. And for me, duplicates are not only similar files (=same or different stash id and different but similar ohash), but also identical files (=same stash id, same ohash, different file id).

What I don't understand is. If the current approach is kept in place, to suppress showing identical duplicated files in the duplicate check tool, is the meaning of "exact" as possible selection method in its options. How can something with identical file size, identical ohash and identical scene id not be considered as "exact" match?

Btw.: workaround is to install video duplicate file finder. It does not only show similar files as matches, but also identical ones.

becks0815 avatar Apr 14 '25 05:04 becks0815