Adventure Game Studio

Community => General Discussion => Topic started by: Atelier on Tue 28/04/2015 14:16:17

Title: Finding similarity between two bodies of text
Post by: Atelier on Tue 28/04/2015 14:16:17
Anybody know how I can search for the percentage difference between two large bodies of text?

I basically need to find out if they are 100% identical - without painstakingly reading them manually of course.


PS. Unsure whether this is fit for Rumpus Room or General Discussion.
Title: Re: Finding similarity between two bodies of text
Post by: cat on Tue 28/04/2015 16:52:17
You are looking for a diff tool I guess?

A quick search brings up this tool: https://www.diffchecker.com/diff (https://www.diffchecker.com/diff)
Title: Re: Finding similarity between two bodies of text
Post by: Crimson Wizard on Tue 28/04/2015 17:16:34
If the text is in files on your disk, there is a good bunch of text comparison tools.

To name a couple of favourites:
WinMerge (http://winmerge.org/)
KDiff (http://kdiff3.sourceforge.net/)
Title: Re: Finding similarity between two bodies of text
Post by: Atelier on Tue 28/04/2015 17:28:28
That's brilliant, thanks cat and Crimson :-D

I did google it but was obviously dumb in my choice of search term.
Title: Re: Finding similarity between two bodies of text
Post by: Monsieur OUXX on Wed 29/04/2015 08:51:49
The tools named previously are great, but if you want to stick to a general-purpose tool without installing an extra application, then use Notepad++ (note the "++"). You should have already have it on your computer, like every civilized person on this planet. Then simply add the Compare plugin (http://sourceforge.net/projects/npp-plugins/files/). You're done.
Title: Re: Finding similarity between two bodies of text
Post by: Gurok on Wed 29/04/2015 12:41:58
Pish pash! Civilised people use AGS and implement this algorithm:

http://en.wikipedia.org/wiki/Levenshtein_distance#Iterative_with_two_matrix_rows