trim: logical. If TRUE removes leading and trailing white spaces.
clean: trim logical. If TRUE extra white spaces and escaped character will be removed.
pattern: A character string containing a regular expression (or character string for fixed = TRUE) to be matched in the given character vector. Default, @rm_non_words uses the rm_non_words regex from the regular expression dictionary from the dictionary argument.
replacement: Replacement for matched pattern (Note: default is " ", whereas most qdapRegex functions replace with "").
extract: logical. If TRUE the non-words are extracted into a list of vectors.
dictionary: A dictionary of canned regular expressions to search within if pattern begins with "@rm_".
...: Other arguments passed to gsub.
Returns
Returns a character string with non-words removed.
Note
Setting the argument extract = TRUE is not very useful. Use the following setup instead (see Examples for a demonstration).
x <- c("I like 56 dogs!","It's seventy-two feet from the px290.",NA,"What","that1is2a3way4to5go6.","What do you*% want? For real%; I think you'll see.","Oh some <html>code</html> to remove")rm_non_words(x)ex_non_words(x)