A trigram-based tool for detecting similarity in groups of text documents or program code.
https://peterlane.codeberg.page/ferret/
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
6 months ago | |
---|---|---|
chez | 6 months ago | |
r7rs | 6 months ago | |
LICENSE.txt | 6 months ago | |
README.adoc | 6 months ago | |
uhferret2.xsl | 6 months ago |
README.adoc
= Ferret
Ferret is a copy-detection tool, locating duplicate text or code in
multiple text documents or source files. Ferret is designed to
detect copying ( _collusion_ ) within a given set of files.
This version is a Scheme implementation of Ferret. Two versions are given: for
practical use, the Chez Scheme version is preferable because of its execution
speed.
For more about Ferret see https://peterlane.codeberg.page/ferret/
== Chez Scheme Implementation
This version is designed for Chez Scheme.
----
> scheme.exe --libdirs . --program .\chezferret.ss
Usage: chezferret
dirname to compare all files in given directory
-g dirname use subdirectory names to group files
-x filename1 filename2 [outfile] to output an XML comparison of given filenames
----
== R7RS Implementation
This version requires an R7RS implementation of Scheme such as Gauche.
Run as:
----
> gosh -I. schemeferret.sps
Usage: schemeferret
dirname to compare all files in given directory
-g dirname use subdirectory names to group files
-x filename1 filename2 [outfile] to output an XML comparison of given filenames
----