The idiolect
R package is designed to provide a
comprehensive suite of tools for performing comparative authorship
analysis within a forensic context using the Likelihood Ratio Framework
(e.g. Ishihara 2021; Nini 2023). The package contains a set of
authorship analysis functions that take a set of texts as input and
output scores that can then be calibrated into likelihood ratios. The
package is dependent on quanteda
(Benoit et
al. 2018) for all Natural Language Processing functions.
You can install idiolect
from CRAN:
install.packages("idiolect")
The main functions contained in the package reflect the typical workflow for authorship analysis for forensic problems:
Input data using create_corpus()
;
Optionally mask the content/topic of the texts using
contentmask()
;
Launch an analysis (e.g. delta()
,
ngram_tracing()
, impostors()
);
Test the performance of the method on ground truth data using
performance()
;
Finally, apply the method to the questioned text and generate a
likelihood ratio with calibrate_LLR()
.
Check the website and the vignette for examples.