Stylometric Multivariate Analyses
Assign colors to samples
Change character encoding
Check character encoding in corpus folder
Machine-learning supervised classification
Function to Perform Cross-Validation
Define area for scatterplots
Delete HTML or XML tags
Exclude stop words (e.g. pronouns, particles, etc.) from a dataset
Cosine Distance
Delta Distance
Entropy Distance
Min-Max Distance (aka Ruzicka Distance)
Cosine Distance
Cosine Delta Distance (aka Wurzburg Distance)
GUI for the function classify
GUI for the function oppose
GUI for stylo
Tuning Parameters for the Imposters Method
Authorship Verification Classifier Known as the Imposters Method
Load text files and perform pre-processing
Load text files
Make List of the Most Frequent Elements (e.g. Words)
Make text n-grams
Split text to samples
Prepare a table of (relative) word frequencies
Contrastive analysis of texts
Perform pre-processing (tokenization, n-gram extracting, etc.)
Extract POS-tags or Words from Annotated Corpora
Exclude variables (e.g. words, n-grams) from a frequency table that ar...
Distance-based classifier
An Authorship Verification Classifier Known as the Impostors Method. A...
k-Nearest Neighbor classifier
Naive Bayes classifier
Nearest Shrunken Centroids classifier
Support Vector Machines classifier
Accuracy, Precision, Recall, and the F Measure
Plot Classification Accuracy for Short Text Samples
Sequential machine-learning classification
Sequential stylometric analysis
Determining Minimal Sample Size for Text Classification
Setting variables for the package stylo
Bootstrap consensus networks, with D3 visualization
List of pronouns
Stylometric multidimensional analyses
Split string of words or other countable features
Split text into words: extended version
Split text into words
Compare two subcorpora using a home-brew variant of Craig's Zeta
Compare two subcorpora using Craig's Zeta
Compare two subcorpora using Eder's Zeta
Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), <https://journal.r-project.org/archive/2016/RJ-2016-007/index.html>. You are also encouraged to visit the Computational Stylistics Group's website <https://computationalstylistics.github.io/>, where a reasonable amount of information about the package and related projects are provided.