[lxml-dev] html5lib tree builder in lxml 2.2
Geoffrey Sneddon
foolistbar at googlemail.com
Sat Mar 28 21:09:36 CET 2009
On 25 Mar 2009, at 21:44, Stefan Behnel wrote:
>> Would it be possible to get a
>> patch for html5lib that would fix these issues (this'll need to be
>> under
>> the MIT license)?
>
> It's mainly about stuff that ET doesn't support, such as the
> DOCTYPE, or
> top-level comments. I don't know if the html5lib project is
> interested in
> that, but it shouldn't be too hard to add some conditional lxml
> specifics
> to their code.
There is already a whole separate lxml treebuilder in html5lib. I'm in
part wondering why that wasn't used verbatim, and if there are any
issues with it fixed in lxml 2.2's treebuilder that a patch be made
available under licensing terms acceptable to html5lib (I'd probably
look more closely to see quite what was changed if I could actually
copy changes safely with the licensing being such).
--
Geoffrey Sneddon
<http://gsnedders.com/>
More information about the lxml-dev
mailing list