Remove namespaces for easier Xpath queries
Xpath can get strange when namespaces are involved
> doc = Nokogiri::XML(<<-XML)
<a xmlns='http://www.example.com/xhtml'>
<b>
<c></c>
</b>
</a>
XML
> doc.xpath("/a")
[] # returns empty array
> doc.xpath("/*[name()='a']")
# returns the first node
The [name()='a']
isn't really clear and will become less clear as the query
looks for elements deeper in the document. When the namespace doesn't provide
any value, for instance by helping to avoid collisions, or helping to validate
the given xml document, then removing the namespace is entirely acceptable.
In Nokogiri you can remove namespaces from the document with, remove_namespaces!
.
> doc.remove_namespaces!
> doc.xpath('/a')
[<Node a>]
Now traversing the document with xpath will be significantly more straightforward.
Tweet