Natural Language Processing in R [CRAN Task View]

boilerpipeR

Interface to the Boilerpipe Java Library

Version 1.3.2

BTM

Biterm Topic Models for Short Text

Version 0.3.8

corpora

Statistics and Data Sets for Corpus Frequency Data

Version 0.7

text2vec

Modern Text Mining Framework for R

Version 0.6.6

tokenizers.bpe

Byte Pair Encoding Text Tokenization

Version 0.1.4

keyperm

Keyword Analysis Using Permutation Tests

Version 0.1.1

gsubfn

Utilities for Strings and Function Arguments

Version 0.7

lda

Collapsed Gibbs Sampling Methods for Topic Models

Version 1.5.2

lsa

Latent Semantic Analysis

Version 0.73.4

movMF

Mixtures of von Mises-Fisher Distributions

Version 0.2-9

mscstexta4r

R Client for the Microsoft Cognitive Services Text Analytics REST API

Version 0.1.2

openNLP

Apache OpenNLP Tools Interface

Version 0.2-7

stringdist

Approximate String Matching, Fuzzy Text Search, and String Distance Fu...

Version 0.9.17

topicmodels

Topic Models

Version 0.2-17

stm

Estimation of the Structural Topic Model

Version 1.3.8

tau

Text Analysis Utilities

Version 0.0-26

mscsweblm4r

R Client for the Microsoft Cognitive Services Web Language Model REST ...

Version 0.1.2

SnowballC

Snowball Stemmers Based on the C 'libstemmer' UTF-8 Library

Version 0.7.1

RWeka

R/Weka Interface

Version 0.4-46

textcat

N-Gram Based Text Categorization

Version 1.0-9

tokenizers

Fast, Consistent Tokenization of Natural Language Text

Version 0.3.0

topicdoc

Topic-Specific Diagnostics for LDA and CTM Topic Models

Version 0.1.1

sentometrics

An Integrated Framework for Textual Sentiment Time Series Aggregation ...

Version 1.0.1

tm.plugin.dc

Text Mining Distributed Corpus Plug-in

Version 0.2-10

textrank

Summarize Text by Ranking Sentences and Finding Keywords

Version 0.3.1

zipfR

Statistical Models for Word Frequency Distributions

Version 0.6-70

textreuse

Detect Text Reuse and Document Similarity

Version 0.1.5

tm.plugin.lexisnexis

Import Articles from 'LexisNexis' Using the 'tm' Text Mining Framework

Version 1.4.2

wordcloud

Word Clouds

Version 2.6

textir

Inverse Regression for Text Analysis

Version 2.0-5

RKEA

R/KEA Interface

Version 0.0-6

textplot

Text Plots

Version 0.2.3

qdap

Bridging the Gap Between Qualitative Data and Quantitative Analysis

Version 2.4.6.1

sentiment.ai

Simple Sentiment Analysis Using Deep Learning

Version 0.1.1

udpipe

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Pa...

Version 0.8.16

phonics

Phonetic Spelling Algorithms

Version 1.3.10

wordnet

WordNet Interface

Version 0.1-17

koRpus

Text Analysis with Emphasis on POS Tagging, Readability, and Lexical D...

Version 0.13-9

skmeans

Spherical k-Means Clustering

Version 0.2-19

stringi

Fast and Portable Character String Processing Facilities

Version 1.8.7

svs

Tools for Semantic Vector Spaces

Version 3.1.1

tesseract

Open Source OCR Engine

Version 5.2.5

kernlab

Kernel-Based Machine Learning Lab

Version 0.9-33

tm

Text Mining Package

Version 0.7-17

tm.plugin.europresse

Import Articles from 'Europresse' Using the 'tm' Text Mining Framework

Version 1.4.1

tm.plugin.mail

Text Mining E-Mail Plug-in

Version 0.3-1

hunspell

High-Performance Stemmer, Tokenizer, and Spell Checker

Version 3.0.6

quanteda

Quantitative Analysis of Textual Data

Version 4.3.1

languageR

Data Sets and Functions with Analyzing Linguistic Data: A Practical In...

Version 1.6

corporaexplorer

A 'Shiny' App for Exploration of Text Collections

Version 0.9.0

tm.plugin.alceste

Import Texts from Files in the 'Alceste' Format Using the 'tm' Text Mi...

Version 1.1.2

ore

An R Interface to the Onigmo Regular Expression Library

Version 1.7.5.1

tm.plugin.factiva

Import Articles from 'Factiva' Using the 'tm' Text Mining Framework

Version 1.8.1

RcmdrPlugin.temis

Graphical Integrated Text Mining Solution

Version 0.7.12

tidytext

Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Version 0.4.3

sentencepiece

Text Tokenization using Byte Pair Encoding and Unigram Modelling

Version 0.2.5

word2vec

Distributed Representations of Words

Version 0.4.1

Natural Language Processing - CRAN Task View

boilerpipeR

BTM

corpora

text2vec

tokenizers.bpe

keyperm

gsubfn

lda

lsa

movMF

mscstexta4r

openNLP

stringdist

topicmodels

stm

tau

mscsweblm4r

SnowballC

RWeka

textcat

tokenizers

topicdoc

sentometrics

tm.plugin.dc

textrank

zipfR

textreuse

tm.plugin.lexisnexis

wordcloud

textir

RKEA

textplot

qdap

sentiment.ai

udpipe

phonics

wordnet

koRpus

skmeans

stringi

svs

tesseract

kernlab

tm

tm.plugin.europresse

tm.plugin.mail

hunspell

quanteda

languageR

corporaexplorer

tm.plugin.alceste

ore

tm.plugin.factiva

RcmdrPlugin.temis

tidytext

sentencepiece

word2vec