getXMLErrors function

Get XML/HTML document parse errors

Get XML/HTML document parse errors

This function is intended to be a convenience for finding all the errors in an XML or HTML document due to being malformed, i.e. missing quotes on attributes, non-terminated elements/nodes, incorrectly terminated nodes, missing entities, etc. The document is parsed and a list of the errors is returned along with information about the file, line and column number.

getXMLErrors(filename, parse = xmlParse, ...)

Arguments

  • filename: the identifier for the document to be parsed, one of a local file name, a URL or the XML/HTML content itself
  • parse: the function to use to parse the document, usually either xmlTreeParse or htmlTreeParse.
  • ...: additional arguments passed to the function given by parse

Returns

A list of S3-style XMLError objects.

References

libxml2 (http://xmlsoft.org)

Author(s)

Duncan Temple Lang

See Also

error argument for xmlTreeParse and related functions.

Examples

# Get the "errors" in the HTML that was generated from this Rd file getXMLErrors(system.file("html", "getXMLErrors.html", package = "XML")) ## Not run: getXMLErrors("https://www.omegahat.net/index.html") ## End(Not run)