stylo0.7.5 package

Stylometric Multivariate Analyses

assign.plot.colors

Assign colors to samples

change.encoding

Change character encoding

check.encoding

Check character encoding in corpus folder

classify

Machine-learning supervised classification

crossv

Function to Perform Cross-Validation

define.plot.area

Define area for scatterplots

delete.markup

Delete HTML or XML tags

delete.stop.words

Exclude stop words (e.g. pronouns, particles, etc.) from a dataset

dist.cosine

Cosine Distance

dist.delta

Delta Distance

dist.entropy

Entropy Distance

dist.minmax

Min-Max Distance (aka Ruzicka Distance)

dist.simple

Cosine Distance

dist.wurzburg

Cosine Delta Distance (aka Wurzburg Distance)

gui.classify

GUI for the function classify

gui.oppose

GUI for the function oppose

gui.stylo

GUI for stylo

imposters.optimize

Tuning Parameters for the Imposters Method

imposters

Authorship Verification Classifier Known as the Imposters Method

load.corpus.and.parse

Load text files and perform pre-processing

load.corpus

Load text files

make.frequency.list

Make List of the Most Frequent Elements (e.g. Words)

make.ngrams

Make text n-grams

make.samples

Split text to samples

make.table.of.frequencies

Prepare a table of (relative) word frequencies

oppose

Contrastive analysis of texts

parse.corpus

Perform pre-processing (tokenization, n-gram extracting, etc.)

parse.pos.tags

Extract POS-tags or Words from Annotated Corpora

perform.culling

Exclude variables (e.g. words, n-grams) from a frequency table that ar...

perform.delta

Distance-based classifier

perform.impostors

An Authorship Verification Classifier Known as the Impostors Method. A...

perform.knn

k-Nearest Neighbor classifier

perform.naivebayes

Naive Bayes classifier

perform.nsc

Nearest Shrunken Centroids classifier

perform.svm

Support Vector Machines classifier

performance.measures

Accuracy, Precision, Recall, and the F Measure

plot.sample.size

Plot Classification Accuracy for Short Text Samples

rolling.classify

Sequential machine-learning classification

rolling.delta

Sequential stylometric analysis

samplesize.penalize

Determining Minimal Sample Size for Text Classification

stylo.default.settings

Setting variables for the package stylo

stylo.network

Bootstrap consensus networks, with D3 visualization

stylo.pronouns

List of pronouns

stylo

Stylometric multidimensional analyses

txt.to.features

Split string of words or other countable features

txt.to.words.ext

Split text into words: extended version

txt.to.words

Split text into words

zeta.chisquare

Compare two subcorpora using a home-brew variant of Craig's Zeta

zeta.craig

Compare two subcorpora using Craig's Zeta

zeta.eder

Compare two subcorpora using Eder's Zeta

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), <https://journal.r-project.org/archive/2016/RJ-2016-007/index.html>. You are also encouraged to visit the Computational Stylistics Group's website <https://computationalstylistics.github.io/>, where a reasonable amount of information about the package and related projects are provided.