Install simhash by entering the following commands in the terminal:
sudo apt update sudo apt install simhash
Description:
generate similarity hashes to find nearly duplicate files
One of the questions that it's nice to be able to answer about a pair of files is the degree of similarity between them. This command-line tool is useful for estimating the "degree of similarity" between a pair of nominally sequential files such as textfiles. The tool uses Manassas's "shingleprinting" technique;
Homepage: http://wiki.cs.pdx.edu/forge/simhash.html
Version: 0.0.20150404-1
Section: universe/utils