theoffice dataset

The entire script transcriptions from The Office

  • Maintainer: Brad Lindblad
  • License: MIT + file LICENSE
  • Last published: 2022-09-29

About the dataset

  • Number of rows: 55130
  • Number of columns: 12
  • Class: tbl_df, tbl, data.frame

Column names and types (First 10)

  • index:integer
  • season:integer
  • episode:integer
  • episode_name:character
  • director:character
  • writer:character
  • character:character
  • text:character
  • text_w_direction:character
  • imdb_rating:numeric