HIGHLIGHTING DUPLICATES BY TEXT SIMILARITY