I’m looking for a duplicate/similarity checker against a custom set of documents. This is possibly like a plagiarism checker, but with a custom reference (instead of everything that exists).
But I could not find a solution that can be selfhosted, and have some simple UI and capabilities like Turnitin. Any suggestions?
Thanks’
Grep. If you have the sources digitally and local, you can just run a search for a string.