Japanese Text Processing Tools
audubon: Japanese Text Processing Tools
Default Japanese date format
Japanese date labeller for ggplot2
Japanese word-wrapping labeller for ggplot2
Read rewrite definition file
Fill Japanese iteration marks
Convert text following the rules of 'NEologd'
Parse Japanese calendar dates
Rewrite Japanese text using normalization rules
Romanize Japanese text
Tokenize Japanese text
Transcribe integers into Japanese kanji numerals
Convert Japanese kana characters
A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).
Useful links