DIFF program capable of comparing a large number of texts

Support for our DiffMerge utility.

Moderator: SourceGear

Post Reply
mfss
Posts: 1
Joined: Wed Feb 16, 2011 9:49 am

DIFF program capable of comparing a large number of texts

Post by mfss » Wed Feb 16, 2011 9:51 am

Hi
We are currently writing our master thesis, and are in great need of a specific program. We have no idea whether or not it actually exists, but what we are looking for is a program capable of comparing about 500 texts simultaneously in order to find differences/similarities between the texts – and being able to report about these cross-references. A dreamscenario would be if it’s also capable of giving a procentage of the differences in between the texts.
We are aware that some of the different diff-programs are able to compare two texts – but we really need it to be able to comprehend more than that.

What we want to do is to compare annual reports to find out whether or not companies "borrow" text from each other, when writing the annual report.
Therefore, we need the program to take 1 text at a time and compare it to the others (prox. 500) to find overlapping between them.
Quite like when universities compare exam thesis with old exam thesis, to se whether or not there has been "copy/paste"

It´s preferable if the program can be installed on a windows computer, furthermore if it is possible for the program to load *.pdf files


Hope that you – or someone else – are able to help us.

Beth
Posts: 8550
Joined: Wed Jun 21, 2006 8:24 pm
Location: SourceGear
Contact:

Re: DIFF program capable of comparing a large number of text

Post by Beth » Wed Feb 16, 2011 11:42 am

What our Diff program can do is compare either two text files or compare two folders. In your case, you would want to compare two folders to identify which items are different. If an item is a text file, then in the compare folders you can double-click the text file to go right to the file compare on that item.

We can't show file compares on binary files, which is what a .pdf is.

I think you are looking for more details than we can provide, but since our Diff program is free, you can download it and use it without having to incur any cost.
Beth Kieler
SourceGear Technical Support

Post Reply