getNodeSet {XML} | R Documentation |
This function provides a way to find XML nodes that match a particular criterion. It uses the XPath syntax and allows quite powerful expressions for identifying nodes. The XPath language requires some knowledge, but tutorials are available.
getNodeSet(doc, path, namespaces = character())
doc |
an object of class XMLInternalDocument |
path |
a string (character vector of length 1) giving the XPath expression to evaluate. |
namespaces |
a named character vector giving the namespace prefix and URI pairs that are to be used in the XPath expression and matching of nodes. One only needs to specify the namespaces in the XPath expression and for the nodes of interest rather than requiring all the namespaces for the entire document. |
This calls the libxml routine xmlXPathEval
.
The results can currently be different based on the returned value from the XPath expression evaluation:
list |
a node set |
numeric |
a number |
logical |
a boolean |
character |
a string, i.e. a single character element. |
More of the XPath functionality provided by libxml can and may be made available to the R package. Facilities such as compiled XPath expressions, functions, ordered node information,
Please send requests to the maintainer.
Duncan Temple Lang <duncan@wald.ucdavis.edu>
http://xmlsoft.org, http://www.w3.org/xml http://www.w3.org/TR/xpath http://www.omegahat.org/RSXML
xmlTreeParse
with useInternalNodes
as TRUE
.
doc = xmlTreeParse(system.file("exampleData", "tagnames.xml", package = "XML"), useInternalNodes = TRUE) getNodeSet(doc, "/doc//b[@status]") getNodeSet(doc, "/doc//b[@status='foo']") els = getNodeSet(doc, "/doc//a[@status]") sapply(els, function(el) xmlGetAttr(el, "status"))