Suggestion: Compare sets of dirs, highlight different sizes
Posted: 2014-01-20, 18:58 UTC
Grounds:
I have two huge sets of data on different drives: hundreds of directories containing both files and subdirectories with other files.
Problem:
I would like to know whether the contents count of EACH dir in one set differs from the count of the directory of the same name in the other set, or not.
In other words, I want to know if the content count of dirs "X, Y, Z..." in set A differs from the content count of the corresponding dirs "X, Y, Z..." in set B, indicating in this case discrepancies in the two sets, pending an analysis of their causes.
The command: "cm_CompareDirsWithSubdirs" only highlights missing dirs or dirs with different names in each of the two sets, irrespective of the amount of bytes they contain, giving no indication of incongruity in content. Two equal sets of corresponding dirs are therefore seen as identical even if one or more of their dirs contains a different number of files or different versions of them respect to the other set.
The sad story:
There might be a way of doing this in an easy way, but I've been too dumb to find it. Instead, what I have been doing since version 3 of TC is to run "cm_CountDirContent" on both sets and VISUALLY compare the conformity between the counts of each pair, scrolling down the lists. A very long and tedious task that gets harder and harder with increasing age and fading eyesight.
A poor alternative would be to run "cm_FileSync" without performing a synchronization (as this is not desirable in my case), but only taking note of the "offending" dirs' names. But this is also tedious. The sets are HUGE and there's no function for exporting the "compare" analysis to a text file. Which would involve a lot of extra retracing work anyway.
Wishful thinking:
Would it be at all possible to have a function that runs "cm_CountDirContent" on the two panels/sets and then makes an "intelligent" comparison, in much the same way as the command "cm_CompareDirs" does with files of the same name but of different size, by highlighting? I am only interested in sizes, not dates or other attributes.
Thanks a lot for bearing with me all the way through this long post.
I have two huge sets of data on different drives: hundreds of directories containing both files and subdirectories with other files.
Problem:
I would like to know whether the contents count of EACH dir in one set differs from the count of the directory of the same name in the other set, or not.
In other words, I want to know if the content count of dirs "X, Y, Z..." in set A differs from the content count of the corresponding dirs "X, Y, Z..." in set B, indicating in this case discrepancies in the two sets, pending an analysis of their causes.
The command: "cm_CompareDirsWithSubdirs" only highlights missing dirs or dirs with different names in each of the two sets, irrespective of the amount of bytes they contain, giving no indication of incongruity in content. Two equal sets of corresponding dirs are therefore seen as identical even if one or more of their dirs contains a different number of files or different versions of them respect to the other set.
The sad story:
There might be a way of doing this in an easy way, but I've been too dumb to find it. Instead, what I have been doing since version 3 of TC is to run "cm_CountDirContent" on both sets and VISUALLY compare the conformity between the counts of each pair, scrolling down the lists. A very long and tedious task that gets harder and harder with increasing age and fading eyesight.
A poor alternative would be to run "cm_FileSync" without performing a synchronization (as this is not desirable in my case), but only taking note of the "offending" dirs' names. But this is also tedious. The sets are HUGE and there's no function for exporting the "compare" analysis to a text file. Which would involve a lot of extra retracing work anyway.
Wishful thinking:
Would it be at all possible to have a function that runs "cm_CountDirContent" on the two panels/sets and then makes an "intelligent" comparison, in much the same way as the command "cm_CompareDirs" does with files of the same name but of different size, by highlighting? I am only interested in sizes, not dates or other attributes.
Thanks a lot for bearing with me all the way through this long post.