Thank you very much! I need now a way to find out the encoding of my data... Because it is a webpage there must be a way to extract that information... Should I look for something like charset=XXXXXXX? Is there a way to extract that info easily after a call to urlopen? html = urlopen(webpage).read() Thanks, Francesco