wordpredictor R package [Documentation]

TokenGenerator

Generates n-grams from text files

TPGenerator

Generates transition probabilities for n-grams

wordpredictor-package

wordpredictor: Develop Text Prediction Models Based on N-Grams

DataSampler

Generates data samples from text files

EnvManager

Allows managing the test environment

Model

Represents n-gram models

ModelEvaluator

Evaluates performance of n-gram models

ModelGenerator

Generates n-gram models from a text file

ModelPredictor

Allows predicting text, calculating word probabilities and Perplexity

Base

Base class for all other classes

DataAnalyzer

Analyzes input text files and n-gram token files

DataCleaner

Provides data cleaning functionality

Download source package Read PDF manual

A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.

Maintainer: Nadir Latif
License: MIT + file LICENSE
Last published: 2024-10-08

Useful links

wordpredictor0.0.5 package

Functions

Readme

Imports

Versions

News