summaryrefslogtreecommitdiff
path: root/textproc/py-lxml
AgeCommit message (Collapse)AuthorFilesLines
2011-04-22recursive bump from gettext-lib shlib bump.obache1-1/+2
2010-05-02Update to 2.2.4:wiz2-6/+6
2.2.6 (2010-03-02) ================== Bugs fixed ---------- * Fixed several Python 3 regressions by building with Cython 0.11.3. 2.2.5 (2010-02-28) ================== Features added -------------- * Support for running XSLT extension elements on the input root node (e.g. in a template matching on "/"). Bugs fixed ---------- * Crash in XPath evaluation when reading smart strings from a document other than the original context document. * Support recent versions of html5lib by not requiring its ``XHTMLParser`` in ``htmlparser.py`` anymore. * Manually instantiating the custom element classes in ``lxml.objectify`` could crash. * Invalid XML text characters were not rejected by the API when they appeared in unicode strings directly after non-ASCII characters. * lxml.html.open_http_urllib() did not work in Python 3. * The functions ``strip_tags()`` and ``strip_elements()`` in ``lxml.etree`` did not remove all occurrences of a tag in all cases. * Crash in XSLT extension elements when the XSLT context node is not an element.
2009-11-17Update to 2.2.4:wiz2-6/+6
2.2.4 (2009-11-11) ================== Bugs fixed ---------- * Static build of libxml2/libxslt was broken.
2009-11-01Update to py-lxml-2.2.3:joerg2-6/+8
* The resolve_entities option did not work in the incremental feed parser. * Looking up and deleting attributes without a namespace could hit a namespaced attribute of the same name instead. * Late errors during calls to SubElement() (e.g. attribute related ones) could leave a partially initialised element in the tree. * Modifying trees that contain parsed entity references could result in an infinite loop. * ObjectifiedElement.__setattr__ created an empty-string child element when the attribute value was rejected as a non-unicode/ non-ascii string * Syntax errors in lxml.cssselect could result in misleading error messages. * Invalid syntax in CSS expressions could lead to an infinite loop in the parser of lxml.cssselect. * CSS special character escapes were not properly handled in lxml.cssselect. * CSS Unicode escapes were not properly decoded in lxml.cssselect. * Select options in HTML forms that had no explicit value attribute were not handled correctly. The HTML standard dictates that their value is defined by their text content. This is now supported by lxml.html. * XPath raised a TypeError when finding CDATA sections. This is now fully supported. * Calling help(lxml.objectify) didn't work at the prompt. * The ElementMaker in lxml.objectify no longer defines the default namespaces when annotation is disabled. * Feed parser failed to honout the 'recover' option on parse errors. * Diverting the error logging to Python's logging system was broken. Require setuptools-0.6c11 as it fails at least with newer Python 2.6 otherwise.
2009-08-09Update to 2.2.2:wiz3-8/+13
2.2.2 (2009-06-21) ================== Features added -------------- * New helper functions ``strip_attributes()``, ``strip_elements()``, ``strip_tags()`` in lxml.etree to remove attributes/subtrees/tags from a subtree. Bugs fixed ---------- * Namespace cleanup on subtree insertions could result in missing namespace declarations (and potentially crashes) if the element defining a namespace was deleted and the namespace was not used by the top element of the inserted subtree but only in deeper subtrees. * Raising an exception from a parser target callback didn't always terminate the parser. * Only {true, false, 1, 0} are accepted as the lexical representation for BoolElement ({True, False, T, F, t, f} not any more), restoring lxml <= 2.0 behaviour. 2.2.1 (2009-06-02) ================== Features added -------------- * Injecting default attributes into a document during XML Schema validation (also at parse time). * Pass ``huge_tree`` parser option to disable parser security restrictions imposed by libxml2 2.7. Bugs fixed ---------- * The script for statically building libxml2 and libxslt didn't work in Py3. * ``XMLSchema()`` also passes invalid schema documents on to libxml2 for parsing (which could lead to a crash before release 2.6.24). 2.2 (2009-03-21) ================ Features added -------------- * Support for ``standalone`` flag in XML declaration through ``tree.docinfo.standalone`` and by passing ``standalone=True/False`` on serialisation. Bugs fixed ---------- * Crash when parsing an XML Schema with external imports from a filename. 2.2beta4 (2009-02-27) ===================== Features added -------------- * Support strings and instantiable Element classes as child arguments to the constructor of custom Element classes. * GZip compression support for serialisation to files and file-like objects. Bugs fixed ---------- * Deep-copying an ElementTree copied neither its sibling PIs and comments nor its internal/external DTD subsets. * Soupparser failed on broken attributes without values. * Crash in XSLT when overwriting an already defined attribute using ``xsl:attribute``. * Crash bug in exception handling code under Python 3. This was due to a problem in Cython, not lxml itself. * ``lxml.html.FormElement._name()`` failed for non top-level forms. * ``TAG`` special attribute in constructor of custom Element classes was evaluated incorrectly. Other changes ------------- * Official support for Python 3.0.1. * ``Element.findtext()`` now returns an empty string instead of None for Elements without text content. 2.2beta3 (2009-02-17) ===================== Features added -------------- * ``XSLT.strparam()`` class method to wrap quoted string parameters that require escaping. Bugs fixed ---------- * Memory leak in XPath evaluators. * Crash when parsing indented XML in one thread and merging it with other documents parsed in another thread. * Setting the ``base`` attribute in ``lxml.objectify`` from a unicode string failed. * Fixes following changes in Python 3.0.1. * Minor fixes for Python 3. Other changes ------------- * The global error log (which is copied into the exception log) is now local to a thread, which fixes some race conditions. * More robust error handling on serialisation. 2.2beta2 (2009-01-25) ===================== Bugs fixed ---------- * Potential memory leak on exception handling. This was due to a problem in Cython, not lxml itself. * ``iter_links`` (and related link-rewriting functions) in ``lxml.html`` would interpret CSS like ``url("link")`` incorrectly (treating the quotation marks as part of the link). * Failing import on systems that have an ``io`` module. 2.1.5 (2009-01-06) ================== Bugs fixed ---------- * Potential memory leak on exception handling. This was due to a problem in Cython, not lxml itself. * Failing import on systems that have an ``io`` module. 2.2beta1 (2008-12-12) ===================== Features added -------------- * Allow ``lxml.html.diff.htmldiff`` to accept Element objects, not just HTML strings. Bugs fixed ---------- * Crash when using an XPath evaluator in multiple threads. * Fixed missing whitespace before ``Link:...`` in ``lxml.html.diff``. Other changes ------------- * Export ``lxml.html.parse``. 2.1.4 (2008-12-12) ================== Bugs fixed ---------- * Crash when using an XPath evaluator in multiple threads. 2.0.11 (2008-12-12) =================== Bugs fixed ---------- * Crash when using an XPath evaluator in multiple threads. 2.2alpha1 (2008-11-23) ====================== Features added -------------- * Support for XSLT result tree fragments in XPath/XSLT extension functions. * QName objects have new properties ``namespace`` and ``localname``. * New options for exclusive C14N and C14N without comments. * Instantiating a custom Element classes creates a new Element. Bugs fixed ---------- * XSLT didn't inherit the parse options of the input document. * 0-bytes could slip through the API when used inside of Unicode strings. * With ``lxml.html.clean.autolink``, links with balanced parenthesis, that end in a parenthesis, will be linked in their entirety (typical with Wikipedia links).
2009-06-14Remove @dirrm entries from PLISTsjoerg1-4/+1
2009-02-17Fix native_libs.txt issue. Bump revision.joerg2-4/+3
2009-02-16Use Python Egg framework. DESTDIR support. Bump revision.joerg2-4/+13
2008-11-18Initial import of py-lxml:wiz4-0/+99
lxml is a Pythonic binding for the libxml2 and libxslt libraries. It is unique in that it combines the speed and feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API.