The best solution for finding and removing duplicate files.
therube
Posts: 634 Joined: Tue Jun 28, 2011 4:38 pm
Post
by therube » Sun Jan 08, 2012 5:49 pm
RFE: Remove "Noise" from File Names
Or perhaps instead of "noise" it could be considered a "sounds-like", or "similar name" function.
Could come in handy?
Such that ...
Code: Select all
video_01(4).wmv == video_01(15).wmv == video_01.
Where the (4) & (15) are considered "noise".
Similarly ...
Code: Select all
2010_presidential_debate.pdf == 2010 presidential debate.pdf == 2010-Presidential.Debate.pdf
Perhaps "noise" characters could be dropped altogether & the entire name concatenated into single string so that it all of the above resolves (internally) to "2010presidentialdebate.pdf".
And then even further extended such that ...
Code: Select all
2010presidentialdebate.pdf == 2011presidentialdebate.pdf == 2010presidentialdebateiowa.pdf
DV
Posts: 78 Joined: Fri Jun 10, 2011 9:00 am
Post
by DV » Tue Jan 10, 2012 10:54 pm
Good ideas - the fuzzy (similar) search kind of does this - it'll ignore numbers, etc, but won't work if the names are too different.