[lxml-dev] lxml.html, now with ignored namespaces!
Thomas Weigel
seasong at chantofwaves.com
Fri Jun 26 23:48:01 CEST 2009
Hello,
Stefan Behnel wrote:
> Thomas Weigel wrote:
>> I am using lxml to parse HTML documents, which include a custom
>> namespace (for example, "<p cs:content='fruit'>FRUIT</p>").
>
> You forgot to mention which versions of libxml2 you are using on both
> systems. That's likely the reason for the difference.
Thank you for being kind.
> http://codespeak.net/lxml/FAQ.html#i-think-i-have-found-a-bug-in-lxml-what-should-i-do
I have begun investigating down this path. I will not bother you again
until I have finished there.
In the meantime, I am working around the problem with a regular
expression to replace 'custom_namespace:' with 'custom_namespace_',
depending on whether or not lxml deletes the custom namespace.
Thank you for your time.
Thomas Weigel
More information about the lxml-dev
mailing list