How to get DupCleaner4 to match these identical images?

The best solution for finding and removing duplicate files.
Post Reply
i-negative
Posts: 5
Joined: Tue Aug 01, 2017 2:28 pm

How to get DupCleaner4 to match these identical images?

Post by i-negative »

Please see the attached screenshot which contains all relevant and available file information for each of two images.

I have tried at least a dozen combinations and variations of search parameters without success. The date taken is identical, and the images are identical, but other relevant tags are inconsistent, including size and date/time modified.

Recommended solutions would be welcome.

As an aside, each of my many searches revealed new sets of duplicates that were not detected by previous searches. While frustrating, it's generally understandable, since I am attempting to clean up a 15-year library of photos that have been moved from hard drive to hard drive and now have unreliable tags.
User avatar
DigitalVolcano
Site Admin
Posts: 1717
Joined: Thu Jun 09, 2011 10:04 am

Re: How to get DupCleaner4 to match these identical images?

Post by DigitalVolcano »

It might be another criteria setting that is affecting it (such as 'Similar Filename' or 'same size') - Paste in a copy of your log file showing the settings for the last unsuccessful run and I'll take a look!
i-negative
Posts: 5
Joined: Tue Aug 01, 2017 2:28 pm

Re: How to get DupCleaner4 to match these identical images?

Post by i-negative »

Keep in mind that at this point, I'm just trying anything.

Is "Date Taken" a searchable field?

Here are the last few searches, with the most recent at the bottom:

===============

01 Aug 2017 10:30:34 am
Scanning: Image mode
- Same size : 50000 Bytes tolerance
- Same modified date/time- Match date only
- Same file extension
- Photo Similarity: Exact match

Included: *.*
E:\Pix\

01 Aug 2017 10:38:32 am
Scan complete
Total time taken: 00:07:57
20622/20622 files scanned (99.6 GB)
2 groups of duplicates
4 files have duplicates(41.8 MB)

--------------------------------------------------------

01 Aug 2017 10:40:38 am
Scanning: Image mode
- Same size : 100000 Bytes tolerance
- Same modified date/time- Match date only
- Same file extension
- Photo Similarity: Exact match

Included: *.*
E:\Pix\

01 Aug 2017 10:40:45 am
Scan complete
Total time taken: 00:00:06
20622/20622 files scanned (99.6 GB)
2 groups of duplicates
4 files have duplicates(41.8 MB)

--------------------------------------------------------

01 Aug 2017 10:41:10 am
Scanning: Image mode
- Same size : 100000 Bytes tolerance
- Same modified date/time- Match date only
- Same file extension
- Photo Similarity: Custom (99%)

Included: *.*
E:\Pix\

01 Aug 2017 12:26:32 pm
Scan complete
Total time taken: 01:45:21
20622/20622 files scanned (99.6 GB)
29 groups of duplicates
65 files have duplicates(556 MB)
--------------------------------------------------------

None of the alleged duplicates are actual duplicates in the last (most recent) search. Because there is a slight size difference (which seems to require an accommodation in the "same size" tolerance) as well as a generally 5-hour difference in the "time modified" field, I get a lot of false positives when I try to combine those parameters.

Thanks for taking a look.
User avatar
DigitalVolcano
Site Admin
Posts: 1717
Joined: Thu Jun 09, 2011 10:04 am

Re: How to get DupCleaner4 to match these identical images?

Post by DigitalVolcano »

Have you tried it with all the options unchecked? Unless there is some other reason, just use the Photo similarity setting, and uncheck all the other options.
i-negative
Posts: 5
Joined: Tue Aug 01, 2017 2:28 pm

Re: How to get DupCleaner4 to match these identical images?

Post by i-negative »

I had tried a basic image similarity match, but it was not providing meaningful results.

However, last night I ran the following scan using the SHA-256 "Content Comparison Type" Advanced Setting with apparent success:

---------------------------------------------

Included: *.*
E:\Pix\

01 Aug 2017 4:51:18 pm
Scanning: Image mode
- Same file extension
- Photo Similarity: Custom (98%)

Included: *.*
E:\Pix\

01 Aug 2017 11:40:29 pm
Scan complete
Total time taken: 06:49:10
20622/20622 files scanned (99.6 GB)
1684 groups of duplicates
3781 files have duplicates(17.8 GB)

----------------------------------------------

That seemed to make a significant difference in the quality of my results.
Post Reply