cleanNLP R package [Documentation]

cleanNLP-package

cleanNLP: A Tidy Data Model for Natural Language Processing

cnlp_annotate

Run the annotation pipeline on a set of documents

cnlp_download_spacy

Download model files needed for spacy

cnlp_init_spacy

Interface for initializing the spacy backend

cnlp_init_stringi

Interface for initializing the standard R backend

cnlp_init_udpipe

Interface for initializing the udpipe backend

cnlp_utils_pca

Compute Principal Components and store as a Data Frame

cnlp_utils_tfidf

Construct the TF-IDF Matrix from Annotation or Data Frame

Download source package Read PDF manual

Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.

Maintainer: Taylor B. Arnold
License: LGPL-2
Last published: 2024-05-20

Useful links

https://github.com/statsmaths/cleanNLP/issues
https://statsmaths.github.io/cleanNLP/

cleanNLP3.1.0 package

Functions

Datasets

Dependencies

Imports

Versions

News