textmineR3.0.6 package

Functions for Text Mining and Topic Modeling

CalcGamma

Calculate a matrix whose rows represent P(topic_i|tokens)

CalcHellingerDist

Calculate Hellinger Distance

CalcJSDivergence

Calculate Jensen-Shannon Divergence

CalcLikelihood

Calculate the log likelihood of a document term matrix given a topic m...

CalcProbCoherence

Probabilistic coherence of topics

CalcTopicModelR2

Calculate the R-squared of a topic model.

Cluster2TopicModel

Represent a document clustering as a topic model

CreateDtm

Convert a character vector to a document term matrix.

CreateTcm

Convert a character vector to a term co-occurrence matrix.

Dtm2Docs

Convert a DTM to a Character Vector of documents

Dtm2Lexicon

Turn a document term matrix into a list for LDA Gibbs sampling

Dtm2Tcm

Turn a document term matrix into a term co-occurrence matrix

FitCtmModel

Fit a Correlated Topic Model

FitLdaModel

Fit a Latent Dirichlet Allocation topic model

FitLsaModel

Fit a topic model using Latent Semantic Analysis

GetProbableTerms

Get cluster labels using a "more probable" method of terms

GetTopTerms

Get Top Terms for each topic from a topic model

InternalFunctions

Internal helper functions for textmineR

LabelTopics

Get some topic labels using a "more probable" method of terms

nih

Abstracts and metadata from NIH research grants awarded in 2014

posterior.lda_topic_model

Draw from the posterior of an LDA topic model

posterior

Posterior methods for topic models

predict.ctm_topic_model

Predict method for Correlated topic models (CTM)

predict.lda_topic_model

Get predictions from a Latent Dirichlet Allocation model

predict.lsa_topic_model

Predict method for LSA topic models

SummarizeTopics

Summarize topics in a topic model

TermDocFreq

Get term frequencies and document frequencies from a document term mat...

textmineR-deprecated

Deprecated functions in package textmineR.

textmineR

textmineR

TmParallelApply

An OS-independent parallel version of lapply

update.lda_topic_model

Update a Latent Dirichlet Allocation topic model with new data

update

Update methods for topic models

An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.

  • Maintainer: Tommy Jones
  • License: MIT + file LICENSE
  • Last published: 2025-10-18