Package: textreuse
Type: Package
Title: Detect Text Reuse and Document Similarity
Version: 0.1.5
Date: 2020-05-14
Authors@R: c(person("Lincoln", "Mullen", role = c("aut", "cre"),
    email = "lincoln@lincolnmullen.com", comment = c(ORCID = "0000-0001-5103-6917")))
Description: Tools for measuring similarity among documents and detecting
    passages which have been reused. Implements shingled n-gram, skip n-gram,
    and other tokenizers; similarity/dissimilarity functions; pairwise
    comparisons; minhash and locality sensitive hashing algorithms; and a
    version of the Smith-Waterman local alignment algorithm suitable for
    natural language.
License: MIT + file LICENSE
LazyData: TRUE
URL: https://docs.ropensci.org/textreuse,
        https://github.com/ropensci/textreuse
BugReports: https://github.com/ropensci/textreuse/issues
VignetteBuilder: knitr
Depends: R (>= 3.1.1)
Imports: assertthat (>= 0.1), digest (>= 0.6.8), dplyr (>= 0.8.0), NLP
        (>= 0.1.8), Rcpp (>= 0.12.0), RcppProgress (>= 0.1), stringr
        (>= 1.0.0), tibble (>= 3.0.1), tidyr (>= 0.3.1)
Suggests: testthat (>= 0.11.0), knitr (>= 1.11), rmarkdown (>= 0.8),
        covr
LinkingTo: BH, Rcpp, RcppProgress
RoxygenNote: 7.1.0
Encoding: UTF-8
NeedsCompilation: yes
Packaged: 2020-05-15 14:43:54 UTC; lmullen
Author: Lincoln Mullen [aut, cre] (<https://orcid.org/0000-0001-5103-6917>)
Maintainer: Lincoln Mullen <lincoln@lincolnmullen.com>
Repository: CRAN
Date/Publication: 2020-05-15 15:50:02 UTC
Built: R 4.6.0; x86_64-apple-darwin20; 2025-08-20 04:30:29 UTC; unix
Archs: textreuse.so.dSYM
