[lxml-dev] problem about lxml encoding

Geoffrey Sneddon foolistbar at googlemail.com
Sun May 31 14:28:31 CEST 2009


On 31 May 2009, at 10:29, Stefan Behnel wrote:

> qhlonline wrote:
>> Since I am processing Chinese Webs. There are instances that some  
>> Webs
>> are not regular.

If you need support for deployed web content, you'll probably have  
better success with html5lib than you will with libxml2's HTML parser.


--
Geoffrey Sneddon
<http://gsnedders.com/>



More information about the lxml-dev mailing list