A Theory of Linguistic Individuality for Authorship Analysis

Data and code accompanying the monograph "A Theory of Linguistic Individuality for Authorship Analysis":

Nini, A. (2023). A Theory of Linguistic Individuality for Authorship Analysis. Elements in Forensic Linguistics. Cambridge, UK: Cambridge University Press.

The repository contains two R scripts and a zipped folder with the data sets.

Main Script

The three main functions are:

test_coefficients(): this function takes as input one folder of texts and returns the results for all the coefficients tested in the monograph for a single combination of parameters.
calibrate.llr(): this function takes as input two result tables, one for the background data and one for the test data and returns scores calibrated into log-likelihood ratios.
extract_unique_ngrams(): takes a folder of texts as input and returns the set of n-grams used by only one author in at least 2 different texts.

More explanations and the values for the parameters are explained in the comments in the script itself. The functions are sourced from the functions.R script.

Data

The two data sets used in the monograph are the English part of the refcor corpus and the c50 corpus. References to both of them are in the monograph. The Data folder contains these two corpora in their plain and POS-tagged versions.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Data.zip		Data.zip
LICENSE		LICENSE
README.md		README.md
functions.R		functions.R
main_script.R		main_script.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Theory of Linguistic Individuality for Authorship Analysis

Main Script

Data

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Theory of Linguistic Individuality for Authorship Analysis

Main Script

Data

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages