host_extract extracts the host from a vector of domain names. A host isn't the same as a domain - it could be the subdomain, if there are one or more subdomains. The host of en.wikipedia.org
is en, while the host of wikipedia.org is wikipedia.
host_extract(domains)
Arguments
domains: a vector of domains, retrieved through url_parse or domain.
Returns
a data.frame of two columns: domain, with the original domain names, and host, the identified host from the domain.
Examples
# With subdomainshas_subdomain <- domain("https://en.wikipedia.org/wiki/Main_Page")host_extract(has_subdomain)# Withoutno_subdomain <- domain("https://ironholds.org/projects/r_shiny/")host_extract(no_subdomain)