hiroba dataset

Whole tokens of 'Porano no Hiroba' written by Miyazawa Kenji from Aozora Bunko

  • Maintainer: Akiru Kato
  • License: Apache License (>= 2)
  • Last published: 2024-04-27

About the dataset

  • Number of rows: 26849
  • Number of columns: 5
  • Class: data.frame

Column names and types

  • doc_id:factor
  • sentence_id:integer
  • token_id:integer
  • token:character
  • feature:character