[Lxml-checkins] r44786 - lxml/branch/html/doc
scoder at codespeak.net
scoder at codespeak.net
Fri Jul 6 15:26:25 CEST 2007
Author: scoder
Date: Fri Jul 6 15:26:24 2007
New Revision: 44786
Modified:
lxml/branch/html/doc/lxmlhtml.txt
Log:
small doc fixes
Modified: lxml/branch/html/doc/lxmlhtml.txt
==============================================================================
--- lxml/branch/html/doc/lxmlhtml.txt (original)
+++ lxml/branch/html/doc/lxmlhtml.txt Fri Jul 6 15:26:24 2007
@@ -66,6 +66,7 @@
Returns the text content of the element, including the text
content of its children, with no markup.
+
Running HTML doctests
=====================
@@ -164,6 +165,7 @@
serialized as ``<script src="..." />``, which completely confuses
browsers.
+
Working with links
==================
@@ -214,6 +216,7 @@
If you want access to the context of the link, you should use
``.iter_links()`` instead.
+
Functions
---------
@@ -228,6 +231,7 @@
new HTML as a string. If you pass in a document, the document will be
copied, the method performed, and the new document returned.
+
Cleaning up HTML
================
@@ -266,7 +270,7 @@
... </html>'''
To remove the all suspicious content from this unparsed document, use the
-``clean_html`` function.::
+``clean_html`` function::
>>> from lxml.html.clean import clean_html
@@ -336,6 +340,7 @@
See the docstring of ``Cleaner`` for the details of what can be
cleaned.
+
autolink
--------
@@ -351,7 +356,7 @@
Links in the elements ``<textarea>``, ``<pre>``, ``<code>``,
anything in the head of the document. You can pass in a list of
-elements to avoid in ``avoid_elements=['textarea', ...]```.
+elements to avoid in ``avoid_elements=['textarea', ...]``.
Links to some hosts can be avoided. By default links to
``localhost*``, ``example.*`` and ``127.0.0.1`` are not
@@ -364,6 +369,7 @@
The ``autolink_html()`` version of the function parses the HTML
string first, and returns a string.
+
wordwrap
--------
More information about the lxml-checkins
mailing list