strip function

Strip Text

Strip Text

Strip text of unwanted characters.

strip( x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE ) ## S3 method for class 'character' strip( x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE ) ## S3 method for class 'factor' strip( x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE ) ## Default S3 method: strip( x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE ) ## S3 method for class 'list' strip( x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE )

Arguments

  • x: The text variable.
  • char.keep: A character vector of symbols (i.e., punctuation) that strip should keep. The default is to strip every symbol except apostrophes and a double tilde "~~". The double tilde "~~" is included for a convenient means of keeping word groups together in functions that split text apart based on spaces. To remove double tildes "~~" set char.keep to NULL.
  • digit.remove: logical. If TRUE strips digits from the text.
  • apostrophe.remove: logical. If TRUE removes apostrophes from the output.
  • lower.case: logical. If TRUE forces all alpha characters to lower case.

Returns

Returns a vector of text that has been stripped of unwanted characters.

Examples

## Not run: DATA$state #no strip applied strip(DATA$state) strip(DATA$state, apostrophe.remove=FALSE) strip(DATA$state, char.keep = c("?", ".")) ## End(Not run)

See Also

rm_stopwords

  • Maintainer: Tyler Rinker
  • License: GPL-2
  • Last published: 2023-05-11