text R package [Documentation]

find_textrpp_env

Find text required python packages env

textCentrality

Semantic similarity score between single words' and an aggregated word...

textCentralityPlot

Plots words from textCentrality()

textClean

Cleans text from standard personal information

textCleanNonASCII

Clean non-ASCII characters

textDescriptives

Compute descriptive statistics of character variables.

textDiagnostics

Run diagnostics for the text package

textDimName

Change dimension names

textDistance

Semantic distance

textDistanceMatrix

Semantic distance across multiple word embeddings

textDistanceNorm

Semantic distance between a text variable and a word norm

textDomainCompare

Compare two language domains

textEmbed

textEmbed() extracts layers and aggregate them to word embeddings, for...

textEmbedLayerAggregation

Aggregate layers

textEmbedRawLayers

Extract layers of hidden states

textEmbedReduce

Pre-trained dimension reduction (experimental)

textEmbedStatic

Apply static word embeddings

textExamples

Identify language examples.

textFindNonASCII

Detect non-ASCII characters

textFineTuneDomain

Domain Adapted Pre-Training (EXPERIMENTAL - under development)

textFineTuneTask

Task Adapted Pre-Training (EXPERIMENTAL - under development)

textGeneration

Text generation

textLBAM

The LBAM library

textModelLayers

Number of layers

textModels

Check downloaded, available models.

textModelsRemove

Delete a specified model

textNER

Named Entity Recognition. (experimental)

textPCA

textPCA()

textPCAPlot

textPlot

Plot words

textPredict

textPredict, textAssess and textClassify

textPredictAll

Predict from several models, selecting the correct input

textPredictTest

Significance testing for model prediction performance

textProjection

Supervised Dimension Projection

textProjectionPlot

Plot Supervised Dimension Projection

textQA

Question Answering. (experimental)

textrpp_initialize

Initialize text required python packages

textrpp_install

Install text required python packages in conda or virtualenv environme...

textrpp_uninstall

Uninstall textrpp conda environment

textSimilarity

Semantic Similarity

textSimilarityMatrix

Semantic similarity across multiple word embeddings

textSimilarityNorm

Semantic similarity between a text variable and a word norm

textSum

Summarize texts. (experimental)

textTokenize

Tokenize text-variables

textTokenizeAndCount

Tokenize and count

textTopics

BERTopics

textTopicsReduce

textTopicsReduce (EXPERIMENTAL)

textTopicsTest

Wrapper for topicsTest function from the topics package

textTopicsTree

textTopicsTest (EXPERIMENTAL) to get the hierarchical topic tree

textTopicsWordcloud

Plot word clouds

textTrain

Trains word embeddings

textTrainLists

Train lists of word embeddings

textTrainN

Cross-validated accuracies across sample-sizes

textTrainNPlot

Plot cross-validated accuracies across sample sizes

textTrainRandomForest

Trains word embeddings usig random forest

textTrainRegression

Train word embeddings to a numeric variable.

textTranslate

Translation. (experimental)

textZeroShot

Zero Shot Classification (Experimental)

Download source package Read PDF manual

Link R with Transformers from Hugging Face to transform text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <https://www.r-text.org>.

Maintainer: Oscar Kjell
License: GPL-3
Last published: 2025-09-01

Useful links

text1.7.0 package

Functions

Readme

Datasets

Dependencies

Imports

Versions

News