Text Mining Package
Data Frame Source
Tokenizers
Transformations
Parallelized lapply
Inspect Objects
Term-Document Matrix
Metadata Management
Remove Words from a Text Document
Strip Whitespace from a Text Document
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency...
Content Transformers
Corpora
Directory Source
Access Document IDs and Terms
Find Associations in a Term-Document Matrix
Find Frequent Terms
Find Most Frequent Terms
Read Document-Term Matrices
Permanent Corpora
Plain Text Documents
Visualize a Term-Document Matrix
Read In a Text Document from a Data Frame
Read In a MS Word Document
Readers
Read In a PDF Document
Read In a Text Document
Read In a Reuters Corpus Volume 1 Document
Read In a Reuters-21578 XML Document
Read In a POS-Tagged Word Text Document
Read In an XML Document
Remove Numbers from a Text Document
Remove Punctuation Marks from a Text Document
Remove Sparse Terms from a Term-Document Matrix
Simple Corpora
Sources
Complete Stems
Stem Words
Stopwords
Term Frequency Vector
Text Documents
Filter and Index Functions on Corpora
Transformations on Corpora
Combine Transformations
Compute Score for Matching Terms
Tokenizers
Uniform Resource Identifier Source
Volatile Corpora
Vector Source
Weight Binary
Weighting Function
Explore Corpus Term Frequency Characteristics
SMART Weightings
Weight by Term Frequency
Weight by Term Frequency - Inverse Document Frequency
Write a Corpus to Disk
XML Source
XML Text Documents
ZIP File Source
A framework for text mining applications within R.