Python XML Parsing without root

python xml parsing python-2.7 elementtree

ElementTree.fromstringlist accepts an iterable (that yields strings).

Using it with itertools.chain:

import itertoolsimport xml.etree.ElementTree as ET# import xml.etree.cElementTree as ETwith open('xml-like-file.xml') as f:    it = itertools.chain('<root>', f, '</root>')    root = ET.fromstringlist(it)# Do something with `root`root.find('.//tag3')

python xml parsing python-2.7 elementtree

How about instead of editing the file do something like this

import xml.etree.ElementTree as ETwith file("xml-file.xml") as f:    xml_object = ET.fromstringlist(["<root>", f.read(), "</root>"])

python xml parsing python-2.7 elementtree

lxml.html can parse fragments:

from lxml import htmls = """<tag1> <tag2> </tag2></tag1><tag1> <tag3/></tag1>"""doc = html.fromstring(s)for thing in doc:    print thing    for other in thing:        print other""">>> <Element tag1 at 0x3411a80><Element tag2 at 0x3428990><Element tag1 at 0x3428930><Element tag3 at 0x3411a80>>>>"""

Courtesy this SO answer

And if there is more than one level of nesting:

def flatten(nested):    """recusively flatten nested elements    yields individual elements    """    for thing in nested:        yield thing        for other in flatten(thing):            yield otherdoc = html.fromstring(s)for thing in flatten(doc):    print thing

Similarly, lxml.etree.HTML will parse this. It adds html and body tags:

d = etree.HTML(s)for thing in d.iter():    print thing""" <Element html at 0x3233198><Element body at 0x322fcb0><Element tag1 at 0x3233260><Element tag2 at 0x32332b0><Element tag1 at 0x322fcb0><Element tag3 at 0x3233148>"""

CodeHunter

Python XML Parsing without root

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last