Pretty similar
Normal checksums are used to identify "identical" files. The ssdeep tool is used for finding files that are similar in terms of content.
|
Paul Cowan, 123RF.com
Normal checksums are used to identify "identical" files. The ssdeep tool is used for finding files that are similar in terms of content.
Checksums are a fine thing: Once created and saved, they make it possible to quickly detect the slightest changes to "hashed" files. They are often used with system checks and when installing new packages. The contents of a file to be checked are examined for correctness by comparing the checksum currently calculated from the file with the saved version.
However, in everyday life, there are now several cases where it isn't a matter of precise equality, but rather of similarity. Think, for example, about the different versions of a document, an image, source code, a compiled program, etc. In these cases, most parts of the files will be identical, and differences will only exist in a few places.
Here, similarity is the measure of the changes made to the files. Different versions of a file often differ only by a few bytes. With plain text documents, you could still find this out to a certain extent using household remedies such as wc , uniq , sort , and tr by separating the text into the words contained (tr ',.: ' '\n' ) and then sorting them (| sort| ) to display the frequencies of individual words using uniq . This works, for example, as follows:
[...]
Pages: 3
… rsync, integrit, aide – all these tools monitor the system's directory tree and issue an alarm as soon as they detect unauthorized changes.
A correct choice in an editing program can save you a lot of headaches when working with video clips. We compare several applications to help you choose the tool that's right for you.
With the jack-of-all-trades sed, you can certainly master any kind of text manipulation, but the result is often complex and hard-to-remember syntax. The helpers presented in this article replace or enhance sed for many smaller text editing tasks.
If you don't have the right tools, comparing PDF documents for differences can be very cumbersome. We discuss five nifty tools that can help with this task.
The PDF format has established itself as the standard for document exchange. There are many programs under Linux that you can use to take advantage of all of the possibilities PDFs offer.
© 2025 Linux New Media USA, LLC – Legal Notice