audubon R package [Documentation]

audubon-package

audubon: Japanese Text Processing Tools

default_format

Default Japanese date format

label_date_jp

Japanese date labeller for ggplot2

label_wrap_jp

Japanese word-wrapping labeller for ggplot2

read_rewrite_def

Read rewrite definition file

strj_fill_iter_mark

Fill Japanese iteration marks

strj_normalize

Convert text following the rules of 'NEologd'

strj_parse_date

Parse Japanese calendar dates

strj_rewrite_as_def

Rewrite Japanese text using normalization rules

strj_romanize

Romanize Japanese text

strj_tokenize

Tokenize Japanese text

strj_transcribe_num

Transcribe integers into Japanese kanji numerals

strj-hira-kana

Convert Japanese kana characters

Download source package Read PDF manual

A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).

Maintainer: Akiru Kato
License: Apache License (>= 2)
Last published: 2026-01-09

Useful links

audubon0.6.2 package

Functions

Readme

Datasets

Dependencies

Imports

Versions

News