tld_extract function

Extract TLDs

Extract TLDs

tld_extract extracts the top-level domain (TLD) from a vector of domain names. This is distinct from the suffixes, extracted with suffix_extract; TLDs are top level, while suffixes are just domains through which internet users can publicly register domains (the difference between .org.uk and .uk).

tld_extract(domains, tlds = NULL)

Arguments

  • domains: a vector of domains, retrieved through url_parse or domain.
  • tlds: a dataset of TLDs. If NULL (the default), tld_extract relies on urltools' tld_dataset; otherwise, you can pass in the result of tld_refresh.

Returns

a data.frame of two columns: domain, with the original domain names, and tld, the identified TLD from the domain.

Examples

# Using the inbuilt dataset domains <- domain("https://en.wikipedia.org/wiki/Main_Page") tld_extract(domains) # Using a refreshed one tld_extract(domains, tld_refresh())

See Also

suffix_extract for retrieving suffixes (distinct from TLDs).

  • Maintainer: Os Keyes
  • License: MIT + file LICENSE
  • Last published: 2019-04-14