Skip to content Skip to sidebar Skip to footer

Python Xml Query Get Parent

I have a big xml document that looks like this: 3

Solution 1:

Unfortunately, when using the ElementTree API, each Element object has no reference back to its parent, so you cannot go up the tree from a known point. Instead, you have to find the possible parent objects and filter the ones you want.

This is commonly done with XPath expressions. However, ElementTree only supports a subset of XPath (see the docs), the most useful parts of which were only added in ElementTree 1.3, which only comes with Python 2.7+ or 3.2+.

And even, ElementTree's XPath it cannot work with your file as is - there is no way to select based on the text of a node, only its attributes (or attribute values).

My experimentation has only found two ways you can proceed with ElementTree. If you are using Python 2.7+ (or are able to download and install a newer version of ElementTree to work with older Python versions), and you can modify the format of the XML file to put the numbers as attributes, like so

<Valname="number"><fval="265456" /></Val>

then the following Python code will pull out the nodes of interest:

import xml.etree.ElementTree as ETree
tree = ETree.ElementTree(file='sample.xml')
nodes = tree.findall(".//Node/Val[@name='number']/f[@val='265456']....")

For older Pythons, or if you cannot modify the XML format, you will have to filter the invalid nodes manually. The following worked for me:

import xml.etree.ElementTree as ETree
tree = ETree.ElementTree(file='sample.xml')
all = tree.findall(".//Node")
nodes = []

# Filter matching nodes and put them in the nodes variable.for node inall:
    for val in node.getchildren():
        if val.attrib['name'] == 'number'and val.getchildren()[0].text =='265456':
            nodes.append(node)

Neither of these solutions is what I would call ideal, but they're the only ones I have been able to make work with the ElementTree library (since that is what you mentioned using). You might be better off using a third-party library rather than using the built-in ones; see the Python wiki entry on XML for a list of options. lxml is the Python bindings for the widely-used libxml2 library, and would be the one I would suggest looking at first. It has XPath support so you should be able to use the queries from the other answers.

Solution 2:

This XPath:

/Node/Node[Val[@name='number']/f='265456']/@name

Outputs:

1764466544

Solution 3:

The following function has helped me in similar cases. As the docstring explains, it doesn't work in the general case, but if your nodes are unique, it should help.

def get_element_ancestry(root, element):
'''Return a list of ancestor Elements for the given element.If both root and element are of type xml.etree.ElementTree.Element, andif
the given root contains the given element as a descendent, thenreturn a
list of direct xml.etree.ElementTree.Element ancestors, starting with root
and ending with element. Otherwise, return an empty list.

The xml.etree.ElementTree module offers no functiontoreturn the parent of
a given Element, presumably because an Element may be in more than one tree,
or even multiple times within a given tree, so its parent depends on the
context. This function provides a solution in the specific cases where the
caller either knows that the given element appears just once within the
tree oris satisfied with the first branch to reference the given element.
'''
result = []
xet = xml.etree.ElementTree
ifnot xet.iselement(root) ornot xet.iselement(element):
    return result
xpath = './/' + element.tag \
    + ''.join(["[@%s='%s']" % a for a in element.items()])
parent = root
while parent != None:
    result.append(parent)
    for child in parent.findall('*'):if child == element:
            result.append(element)
            return result
        if child.findall(xpath).count(element):
            parent = child
            break
    else:
        return []
return result

Solution 4:

Usually

node.parentNode 

will return a potiner to the parent node (when using a DOM parser).

For XPath see

http://www.tizag.com/xmlTutorial/xpathparent.php

Post a Comment for "Python Xml Query Get Parent"