Looking for a text editing sofware (duplicate line finder).

August 6th, 2016

hi, i need some help finding a software that can cross check every line in a text file for similar lines. i don’t mean an exact copy of the line,but very similar. i basically want to check if 2 lines have more then X amount words in common. i want it to be able to ignore simple grammar mistakes like missing commas or apostrophes, capitals, punctuation ect… here is an example of 2 lines that should be tagged as duplicates:
“Did you know that the Basenji is the only dog in the world which does not bark?”
“The Basenji is the only dog which does not bark.”
they software would be similar to “dupli find” but obviously able to do what i said.

Answer #1
I think notepad++ does this by default.
If you highlight something it automatically highlights every other instance of it in the file.
Answer #2
I think notepad++ does this by default.
If you highlight something it automatically highlights every other instance of it in the file.

I can confirm that Notepad++ does do this! It is a brilliant piece of kit, I’d highly recommend it to anyone. It can do almost anything you can imagine that you’d want to do with a text file.
http://notepad-plus-plus.org/
Answer #3
Notepad++ has this feature
Answer #4
and , NP++ is FREE
Answer #5
UltraEdit
Answer #6
thanks for the replies, i ended up using ultraedit. someone on their forums route a script for me.
could not figure out how to do with notepad++.

 

| Sitemap |