[lxml-dev] Simple doctypes not in docinfo.doctype
F Wolff
friedel at translate.org.za
Mon Oct 27 16:52:25 CET 2008
On Sa, 2008-10-25 at 20:16 +0200, Stefan Behnel wrote:
> Hi,
>
> Stefan Behnel wrote:
> > F Wolff wrote:
> >> I've tried this with an old (1.3.2) and newer (2.0.6) lxml version.
> >>
> >> (this example is roughly based on the code at
> >> http://codespeak.net/lxml/tutorial.html)
> >>
> >> from lxml import etree
> >> from StringIO import StringIO
> >> tree = etree.parse(StringIO("""<!DOCTYPE TS><TS></TS>"""))
> >> tree.docinfo.doctype
> >> ''
> >>
> >> From my understanding this DOCTYPE declaration is valid (and occurring
> >> in the wild in Qt .ts files). My real issue is round-trip problems in a
> >> reading-writing cycle where the DOCTYPE is lost, but I guess not being
> >> able to use .docinfo.doctype is already a problem.
> >
> > I agree that better handling is desirable here. Could you file a bug report so
> > that this doesn't get lost?
>
> Ok, I fixed it anyway. Here's a patch.
>
> Stefan
Thank you Stefan! I haven't even gotten round to the bug report yet, and
you already have it fixed!
At the time I implemented a workaround, but I hope to test this issue
with your proper fix soon.
Thank you again.
Friedel
--
Recently on my blog:
http://translate.org.za/blogs/friedel/en/content/its-easyer-with-kulula
More information about the lxml-dev
mailing list