[lxml-dev] clean_html

Francesco cattafra at hotmail.com
Wed Jun 24 15:30:48 CEST 2009


Thank you very much!

I need now a way to find out the encoding of my data... Because it is a webpage 
there must be a way to extract that information...

Should I look for something like charset=XXXXXXX?

Is there a way to extract that info easily after a call to urlopen?
html = urlopen(webpage).read()

Thanks,

Francesco




More information about the lxml-dev mailing list