Graphical Integrated Text Mining Solution
Correspondence analysis helper functions
Show terms co-occurrences
Correspondence analysis from a tm corpus
Hierarchical clustering of a tm corpus
Cross-Dissimilarity Table
Cut hierarchical clustering tree into clusters
Documents/Variables Dissimilarity Table
List most frequent terms of a corpus
List most frequent terms of a corpus
Class "GDf"
Import a corpus and process it
Inspect corpus
Output results to HTML file
Plotting 2D maps in correspondence analysis of corpus
Recode Date/Time Variable
Select or exclude terms
Correspondence analysis from a tm corpus
Set corpus variables
Save the name of last table and give a title
Show a correspondence analysis from a tm corpus
List terms specific of a document or level
List terms specific of a document or level
Subset Corpus by Terms
Subset Corpus by Levels of a Variable
Show co-occurrent terms
Term frequencies in the corpus
Frequency of chosen terms in the corpus
Dictionary of terms found in a corpus
Temporal Evolution of Occurrences
Two-way table of corpus meta-data variables
One-way table of a corpus meta-data variable
Corpus Temporal Evolution
Vocabulary Summary
Vocabulary summary table
An 'R Commander' plug-in providing an integrated solution to perform a series of text mining tasks such as importing and cleaning a corpus, and analyses like terms and documents counts, vocabulary tables, terms co-occurrences and documents similarity measures, time series analysis, correspondence analysis and hierarchical clustering. Corpora can be imported from spreadsheet-like files, directories of raw text files, 'Twitter' queries, as well as from 'Dow Jones Factiva', 'LexisNexis', 'Europresse' and 'Alceste' files.