A Simple General Purpose N-Gram Tokenizer
Wrappers to DocumentTermMatrix and DocumentTermMatrix to use n-gram to...
General purpose n-gram tokenizer
A simple n-gram (contiguous sequences of n items from a given sequence of text) tokenizer to be used with the 'tm' package with no 'rJava'/'RWeka' dependency.