Am i Missing something ? (wrong duplicates although same content)
Posted: Fri Jul 16, 2021 9:16 pm
Dear Community,
i did not find a suitable thread to this so i hope its ok i write this thread:
i use duplicate cleaner A LOT... and recently i let it scan for 3 days through my whole data... MD5 same content search only limited to files from 0 to 4 gb.....
After it found literally millions of duplicates (yes i am a data freak i know) i found a group of duplicates consisting of 20 files marked as duplicate which are definately no duplicates ! all of them have the exact same size because they are split camcorder ripps from my fathers camcorder cassettes meaning all of them are pretty damn same SIZE... but the CONTENT must be different since the are all distinct different videos...
now i thought md5 makes sure that if 2 files are marked as duplicates this means they are REALLY the same....like "both are the same video"....and not "both are the same size"...hence the "same content" search and not by same size....
is there something i am missing here ? is md5 only capable to find duplicates until a certain amount of megabytes/gigabytes etc ?
most of the duplicates are of course RIGHT...but this wrong group of files really messes upo my trust in this and i must say i have abandoned a lot of programms after such errors...duplicate cleaner ist by far so far my favourite after all these years.... it would really be sad if i found some sort of bug cause i delete a lot of stuff on the daily basis because of DC
i did not find a suitable thread to this so i hope its ok i write this thread:
i use duplicate cleaner A LOT... and recently i let it scan for 3 days through my whole data... MD5 same content search only limited to files from 0 to 4 gb.....
After it found literally millions of duplicates (yes i am a data freak i know) i found a group of duplicates consisting of 20 files marked as duplicate which are definately no duplicates ! all of them have the exact same size because they are split camcorder ripps from my fathers camcorder cassettes meaning all of them are pretty damn same SIZE... but the CONTENT must be different since the are all distinct different videos...
now i thought md5 makes sure that if 2 files are marked as duplicates this means they are REALLY the same....like "both are the same video"....and not "both are the same size"...hence the "same content" search and not by same size....
is there something i am missing here ? is md5 only capable to find duplicates until a certain amount of megabytes/gigabytes etc ?
most of the duplicates are of course RIGHT...but this wrong group of files really messes upo my trust in this and i must say i have abandoned a lot of programms after such errors...duplicate cleaner ist by far so far my favourite after all these years.... it would really be sad if i found some sort of bug cause i delete a lot of stuff on the daily basis because of DC