[lxml-dev] html5lib tree builder in lxml 2.2

Geoffrey Sneddon foolistbar at googlemail.com
Sat Mar 28 21:09:36 CET 2009


On 25 Mar 2009, at 21:44, Stefan Behnel wrote:

>> Would it be possible to get a
>> patch for html5lib that would fix these issues (this'll need to be  
>> under
>> the MIT license)?
>
> It's mainly about stuff that ET doesn't support, such as the  
> DOCTYPE, or
> top-level comments. I don't know if the html5lib project is  
> interested in
> that, but it shouldn't be too hard to add some conditional lxml  
> specifics
> to their code.

There is already a whole separate lxml treebuilder in html5lib. I'm in  
part wondering why that wasn't used verbatim, and if there are any  
issues with it fixed in lxml 2.2's treebuilder that a patch be made  
available under licensing terms acceptable to html5lib (I'd probably  
look more closely to see quite what was changed if I could actually  
copy changes safely with the licensing being such).


--
Geoffrey Sneddon
<http://gsnedders.com/>



More information about the lxml-dev mailing list