RTextTools1.4.3 package

Automatic Text Classification via Supervised Learning

analytics-class

an S4 class containing the analytics for a classified set of documents...

analytics_virgin-class

an S4 class containing the analytics for a classified set of documents...

as.compressed.matrix

converts a tm DocumentTermMatrix or TermDocumentMatrix into a matrix.c...

classify_model

makes predictions from a train_model() object.

classify_models

makes predictions from a train_models() object.

create_analytics

creates an object of class analytics given classification results.

create_container

creates a container for training, classifying, and analyzing documents...

create_ensembleSummary

creates a summary with ensemble coverage and precision.

create_matrix

creates a document-term matrix to be passed into create_container().

create_precisionRecallSummary

creates a summary with precision, recall, and F1 scores.

create_scoreSummary

creates a summary with the best label for each document.

cross_validate

used for cross-validation of various algorithms.

getStemLanguages

Query the languages supported in this package

matrix_container-class

an S4 class containing the training and classification matrices.

NYTimes

a sample dataset containing labeled headlines from The New York Times.

print_algorithms

prints available algorithms for train_model() and train_models().

read_data

reads data from files into an R data frame.

recall_accuracy

calculates the recall accuracy of the classified data.

summary.analytics

summarizes the analytics-class class

summary.analytics_virgin

summarizes the analytics_virgin-class class

train_model

makes a model object using the specified algorithm.

train_models

makes a model object using the specified algorithms.

USCongress

a sample dataset containing labeled bills from the United State Congre...

wordStem

Get the common root/stem of words

A machine learning package for automatic text classification that makes it simple for novice users to get started with machine learning, while allowing experienced users to easily experiment with different settings and algorithm combinations. The package includes eight algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks), comprehensive analytics, and thorough documentation.