tipitaka_long dataset

Tipitaka in "long" form

Tipitaka in "long" form

Every word of every volume of the Tipitaka, with one word per volume per line. data

Format

A tibble with the variables:

  • word: Pali word
  • n: Number of time this word appears in this book
  • total: Ttal number of words in this book
  • freq: Frequency with which this word appears in this book
  • book: Abbreviated book name

Source

Vipassana Research Institute, CST4, April 2020

tipitaka_long
  • Maintainer: Dan Zigmond
  • License: CC0
  • Last published: 2021-03-30

About the dataset

  • Number of rows: 334276
  • Number of columns: 5
  • Class: data.frame

Column names and types

  • book:character
  • word:character
  • n:integer
  • total:integer
  • freq:numeric