[lxml-dev] lxml 2.0alpha4 released

Stefan Behnel stefan_ml at behnel.de
Sun Oct 7 07:14:33 CEST 2007


Hi all,

I just released a 4th alpha version of lxml 2.0 to PyPI. It hopefully sets an
end to the tag name validation problems by distinguishing between HTML tags
and XML tags based on the associated parser, i.e. either the one that parsed
it or the one that created the element through its "makeelement" method. Note
that the Element factory of lxml.etree uses the XMLParser by default, while
the factory in lxml.html uses the HTMLParser, and thus allows HTML tag names.

Everyone who bumped into and/or reported problems with this, please verify
that this provides a viable solution to you.

Have fun,
Stefan


2.0alpha4 (2007-10-07)

Features added
Bugs fixed

    * AttributeError in feed parser on parse errors

Other changes

    * Tag name validation in lxml.etree (and lxml.html) now distinguishes
      between HTML tags and XML tags based on the parser that was used to
      parse or create them. HTML tags no longer reject any non-ASCII
      characters in tag names but only spaces and the special characters
      <>&/'"


More information about the lxml-dev mailing list