Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity
Readability: Automated Readability Index (ARI)
List available language packages
Readability: Bormuth's Mean Cloze and Grade Placement
Lexical diversity: Herdan's C
Transform text into cloze test format
Readability: Coleman-Liau Index
Readability: Coleman's Formulas
Methods to correct koRpus objects
Transform text into C-Test-like format
Lexical diversity: Carroll's corrected TTR (CTTR)
Readability: Dale-Chall Readability Formula
Readability: Danielson-Bryan
Readability: Dickes-Steiwer Handformel
Generate a document-term matrix
Readability: Degrees of Reading Power (DRP)
Readability: Fang's Easy Listening Formula (ELF)
Readability: Farr-Jenkins-Paterson Index
Remove word classes
Readability: Flesch-Kincaid Grade Level
Readability: Flesch Readability Ease
Readability: Gunning FOG Index
Readability: FORCAST Index
Analyze word frequencies
Readability: Fucks' Stilcharakteristik
Get koRpus session settings
Guess language a text is written in
Readability: Gutiérrez Fórmula de comprensibilidad
Readability: Harris-Jacobson indices
Lexical diversity: HD-D (vocd-d)
Automatic hyphenation
Install language support packages
Produce jumbled words
Lexical diversity: Yule's K
Deprecated object classes
tools:::Rd_package_title("koRpus")
Work in (early) progress. Probably don't even look at it. Consider it ...
S4 Class kRp.corp.freq
S4 Class kRp.lang
Get elaborated word tag definitions
S4 Class kRp.readability
S4 Class kRp.text
Getter/setter methods for koRpus objects
S4 Class kRp.TTR
Analyze lexical diversity
Calculate lexical diversity
Readability: Linsear Write Index
Readability: Bj"ornsson's L"asbarhetsindex (LIX)
Lexical diversity: Maas' indices
Lexical diversity: Moving-Average Type-Token Ratio (MATTR)
Lexical diversity: Mean Segmental Type-Token Ratio (MSTTR)
Lexical diversity: Measure of Textual Lexical Diversity (MTLD)
Readability: Neue Wiener Sachtextformeln
Paste koRpus objects
Plot method for objects of class kRp.text
A method to get information out of koRpus objects
Lexical diversity: Guiraud's R
Import BAWL-R data
Import Celex data
Import custom corpus data
Import LCC data
Measure readability
Calculate readability
Import already tagged texts
Readability: Anderson's Readability Index (RIX)
Lexical diversity: Summer's S
A function to optimize MSTTR segment sizes
A function to set information on your koRpus environment
Add support for new languages
Show methods for koRpus objects
Readability: Simple Measure of Gobbledygook (SMOG)
Readability: Spache Formula
Turn a multi-document kRp.text object into a list of kRp.text objects
Readability: Strain Index
Summary methods for koRpus objects
Extract text features for authorship analysis
Letter case transformation
A simple tokenizer
Readability: Traenkle-Bailer Formeln
A method to call TreeTagger
Readability: Kuntzsch's Text-Redundanz-Index
Lexical diversity: Type-Token Ratio
Readability: Tuldava's Text Difficulty Formula
Get types and tokens of a given text
Lexical diversity: Uber Index (U)
Readability: Wheeler-Smith Score
A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Note: For full functionality a local installation of TreeTagger is recommended. It is also recommended to not load this package directly, but by loading one of the available language support packages from the 'l10n' repository <https://undocumeantit.github.io/repos/l10n/>. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from <https://rkward.kde.org> (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list (<https://korpusml.reaktanz.de>).