stylometry topic
scattertext
Beautiful visualizations of how language differs among document types.
JGAAP
The Java Graphical Authorship Attribution Program
PASTEL
Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas"
gcj-dataset
Collected solutions from Google Code Jam programming competition (2008-2020).
doxer
Stylometric Data Mining Library with a focus on identifying Satoshi Nakamoto as a case study.
faststylometry
Stylometry library for Burrows' Delta method
source2vec
Source code embeddings for various programming languages