[lxml-dev] Proposal: Better html5lib Support

Stefan Behnel stefan_ml at behnel.de
Sun Jul 13 17:12:32 CEST 2008



Stefan Behnel wrote:
> Armin Ronacher wrote:
>> This document::
>>
>>     <!doctype html>
>>     <title>foo</title>
>>     <p>blah
>>
>> Comes out as (lxml.etree.tostring)::
>>
>>     <!DOCTYPE html PUBLIC "" "">
>>     ...
> 
> I'm not sure if <!DOCTYPE html> is actually allowed in SGML, didn't find
> anything on that so far.

http://xml.coverpages.org//sgmlsyn/sgmlsyn.htm#P110

Looks like that's the right thing to do, so I committed a fixed version of the
patch to the trunk.

Stefan



More information about the lxml-dev mailing list