Page 1 of 1

Live Duplicates discovered so far

Posted: Mon Feb 24, 2025 12:39 pm
by TimmyAhShhTimmy
Hello again,

When I'm doing a scan, it can take serveral minutes, but I can't see anywhere in the scan window the number of duplicate files found at any given moment.

This means that it could complete the entire scan and not find any.

This only matters I suppose if i'm pretty sure there ARE in fact duplicates there, and it's not seeing any because my search criteria are incorrect.

It would be good to have a live counter of how many files were being discovered in real time..

ta.

Also, what is 'Similar files names' based on?

For my personal needs, a filemask of the first few letters of each file (adjustable between lets say 0-15) would filter in the files with the same filename at the beginning (then match against video 'duration').

ta.

Re: Live Duplicates discovered so far

Posted: Mon Feb 24, 2025 6:13 pm
by therube
Yep.
You can see counts of items, & some percentages & the like, but any number of duplicates found - until completion.

Wonder if that data is actually available - at least for some searches?
If so, yes, I'd think it to be of value.

For some searches, like a hash, it is going to gather hashes first, before doing any compare thereof, so there wouldn't be intermediary results to display.

Now, you could say that as it gathers each hash, it could then iterate through all previously gathered hash to see if there where any dups & interactively update a count of dups, but I'd think that to be rather inefficient.


Come to think of it, in a separate hash program, where I send it expectant file "pairs" that should return same hashes, I even brought forth that same idea... intermediary results, but on writing above, I realize that too would "fail", because all the results would have yet to be gotten, so in that respect, it too could not present them - yet.

The only difference on that end, is that if it displayed intermediary results, you would see that file name/hash of each file as it is computed, & while it may not "mark" duplicates, one might be able to "view" the results & "see" that duplicates are being returned, even if you would not "know" until all were hashed...