Bindings to Google's C++ library Compact Language Detector 2 (see <https://github.com/cld2owners/cld2#readme> for more information). Probabilistically detects over 80 languages in plain text or HTML. For mixed-language input it returns the top three detected languages and their approximate proportion of the total classified text bytes (e.g. 80% English and 20% French out of 1000 bytes). There is also a 'cld3' package on CRAN which uses a neural network model instead.
Version: | 1.2.5 |
Imports: | Rcpp |
LinkingTo: | Rcpp |
Suggests: | testthat, readtext, cld3 |
Published: | 2024-10-04 |
DOI: | 10.32614/CRAN.package.cld2 |
Author: | Jeroen Ooms [aut, cre], Dirk Sites [cph] (Author of CLD2 C++ library) |
Maintainer: | Jeroen Ooms <jeroenooms at gmail.com> |
BugReports: | https://github.com/ropensci/cld2/issues |
License: | Apache License 2.0 |
URL: | https://docs.ropensci.org/cld2/ https://ropensci.r-universe.dev/cld2 |
NeedsCompilation: | yes |
Materials: | NEWS |
CRAN checks: | cld2 results |
Reference manual: | cld2.pdf |
Package source: | cld2_1.2.5.tar.gz |
Windows binaries: | r-devel: cld2_1.2.5.zip, r-release: cld2_1.2.5.zip, r-oldrel: cld2_1.2.5.zip |
macOS binaries: | r-release (arm64): cld2_1.2.5.tgz, r-oldrel (arm64): cld2_1.2.5.tgz, r-release (x86_64): cld2_1.2.5.tgz, r-oldrel (x86_64): cld2_1.2.5.tgz |
Old sources: | cld2 archive |
Reverse imports: | Goodreader, labourR |
Reverse suggests: | bdpar, cld3, emayili, googleLanguageR, morestopwords |
Please use the canonical form https://CRAN.R-project.org/package=cld2 to link to this page.