[lxml-dev] lxml.html, now with ignored namespaces!

Thomas Weigel seasong at chantofwaves.com
Fri Jun 26 23:48:01 CEST 2009


Hello,

Stefan Behnel wrote:
> Thomas Weigel wrote:
>> I am using lxml to parse HTML documents, which include a custom 
>> namespace (for example, "<p cs:content='fruit'>FRUIT</p>").
> 
> You forgot to mention which versions of libxml2 you are using on both
> systems. That's likely the reason for the difference.

Thank you for being kind.

> http://codespeak.net/lxml/FAQ.html#i-think-i-have-found-a-bug-in-lxml-what-should-i-do

I have begun investigating down this path. I will not bother you again 
until I have finished there.

In the meantime, I am working around the problem with a regular 
expression to replace 'custom_namespace:' with 'custom_namespace_', 
depending on whether or not lxml deletes the custom namespace.

Thank you for your time.

Thomas Weigel


More information about the lxml-dev mailing list