[Lxml-checkins] r43962 - lxml/branch/html/src/lxml/html

ianb at codespeak.net ianb at codespeak.net
Fri Jun 1 07:10:51 CEST 2007


Author: ianb
Date: Fri Jun  1 07:10:51 2007
New Revision: 43962

Modified:
   lxml/branch/html/src/lxml/html/clean.py
Log:
added some more tests for basic functionality

Modified: lxml/branch/html/src/lxml/html/clean.py
==============================================================================
--- lxml/branch/html/src/lxml/html/clean.py	(original)
+++ lxml/branch/html/src/lxml/html/clean.py	Fri Jun  1 07:10:51 2007
@@ -97,6 +97,10 @@
 
     This modifies the document *in place*.
     """
+    # First, handle a case that IE treats <image> like <img>, and that can
+    # confuse either this step or later steps.
+    for el in doc.xpath('descendant-or-self::image'):
+        el.tag = 'img'
     kill_tags = []
     remove_tags = list(remove_tags or [])
     if scripts:


More information about the lxml-checkins mailing list