Tools for Language Data Analysis
Calculate the dispersion measure for a term-document matrix
Calculate the dispersion measure
Calculate the dispersion measure for a term-document matrix
Calculate the dispersion measure
Calculate Gries's deviation of proportions for a term-document mat...
Calculate Gries's deviation of proportions
Calculate the dispersion measure 'range' for a term-document matrix
Calculate the dispersion measure 'range'
Calculate the dispersion measure for a term-document matrix
Calculate the dispersion measure
Calculate parts-based dispersion measures for a term-document matrix
Calculate parts-based dispersion measures
Find the maximally dispersed distribution of each item in a term-docum...
Find the maximally dispersed distribution of an item across corpus par...
Find the minimally dispersed distribution of each item in a term-docum...
Find the minimally dispersed distribution of an item across corpus par...
Support functions and datasets to facilitate the analysis of linguistic data. The current focus is on the calculation of corpus-linguistic dispersion measures as described in Gries (2021) <doi:10.1007/978-3-030-46216-1_5> and Soenning (2025) <doi:10.3366/cor.2025.0326>. The most commonly used parts-based indices are implemented, including different formulas and modifications that are found in the literature, with the additional option to obtain frequency-adjusted scores. Dispersion scores can be computed based on individual count variables or a term-document matrix.