From ed70512c28c36ce9c6779bd9b9c7db72740a3a85 Mon Sep 17 00:00:00 2001 From: heinz Date: Mon, 25 Aug 2003 00:00:08 +0000 Subject: Update to 3.31. Better compatibility with Mozilla/MSIE behaviour. ==== Changes since 3.27 ==== 2003-08-19 Gisle Aas Release 3.31 The -DDEBUGGING fix in 3.30 was not really there :-( 2003-08-17 Gisle Aas Release 3.30 The previous release failed to compile on a -DDEBUGGING perl like the one provided by Redhat 9. Got rid of references to perl-5.7. Further fixes to avoid warnings from Visual C. Patch by Steve Hay . 2003-08-14 Gisle Aas Release 3.29 Setting xml_mode now implies strict_names also for end tags. Avoid warning from Visual C. Patch by . 64-bit fix from Doug Larrick http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=195500 Try to parse similar to Mozilla/MSIE in certain edge cases. All these are outside of the official definition of HTML but HTML spam often tries to take advantage of these. - New configuration attribute 'strict_end'. Unless enabled we will allow end tags to contain extra words or stuff that look like attributes before the '>'. This means that tags like these: " ignored> are now all parsed as a 'foo' end tag instead of text. Even if the extra stuff looks like attributes they will not be reported if requested via the 'attr' or 'tokens' argspecs for the 'end' handler. - Parse '' and '' as comments unless strict_comment is enabled. Previous versions of the parser would report these as text. If these comments contain quoted words prefixed by space or '=' these words can contain '>' without terminating the comment. - Parse '" foo>' as comment containing ' "<>" foo'. Previous versions of the parser would terminate the comment at the first '>' and report the rest as text. - Legacy comment mode: Parse with comments terminated with a lone '>' if no '-->' is found before eof. - Incomplete tag at eof is reported as a 'comment' instead of 'text' unless strict_comment is enabled. 2003-04-16 Gisle Aas Release 3.28 When 'strict_comment' is off (which it is by default) treat anything that matches a comment. Should now be more efficient on threaded perls. --- www/p5-HTML-Parser/Makefile | 5 +++-- www/p5-HTML-Parser/distinfo | 8 ++++---- www/p5-HTML-Parser/patches/patch-aa | 10 +++++----- 3 files changed, 12 insertions(+), 11 deletions(-) (limited to 'www/p5-HTML-Parser') diff --git a/www/p5-HTML-Parser/Makefile b/www/p5-HTML-Parser/Makefile index 758c290ce28..11e6141c7eb 100644 --- a/www/p5-HTML-Parser/Makefile +++ b/www/p5-HTML-Parser/Makefile @@ -1,13 +1,14 @@ -# $NetBSD: Makefile,v 1.21 2003/07/22 04:14:31 martti Exp $ +# $NetBSD: Makefile,v 1.22 2003/08/25 00:00:08 heinz Exp $ # -DISTNAME= HTML-Parser-3.27 +DISTNAME= HTML-Parser-3.31 PKGNAME= p5-${DISTNAME} SVR4_PKGNAME= p5hpa CATEGORIES= www perl5 MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=HTML/} MAINTAINER= jlam@NetBSD.org +HOMEPAGE= http://search.cpan.org/author/GAAS/HTML-Parser/ COMMENT= Perl5 module to parse HTML text documents DEPENDS+= p5-HTML-Tagset>=3.0:../../www/p5-HTML-Tagset diff --git a/www/p5-HTML-Parser/distinfo b/www/p5-HTML-Parser/distinfo index 7908049ae71..d07189d2f00 100644 --- a/www/p5-HTML-Parser/distinfo +++ b/www/p5-HTML-Parser/distinfo @@ -1,5 +1,5 @@ -$NetBSD: distinfo,v 1.5 2003/04/12 15:40:38 martti Exp $ +$NetBSD: distinfo,v 1.6 2003/08/25 00:00:09 heinz Exp $ -SHA1 (HTML-Parser-3.27.tar.gz) = 6b7ee2266c93377b930910b0098d5fae5b305ee6 -Size (HTML-Parser-3.27.tar.gz) = 70891 bytes -SHA1 (patch-aa) = 6c3aecb398c078f9823c4f4ef0da34fedc84b6c1 +SHA1 (HTML-Parser-3.31.tar.gz) = 85a88d3179e11e90dfb54b33a24b1e59fd161b57 +Size (HTML-Parser-3.31.tar.gz) = 73132 bytes +SHA1 (patch-aa) = 2db44b7ffb783264f0fd2db79449d1408745bcee diff --git a/www/p5-HTML-Parser/patches/patch-aa b/www/p5-HTML-Parser/patches/patch-aa index 0cc13c3e668..9c3da1a50f3 100644 --- a/www/p5-HTML-Parser/patches/patch-aa +++ b/www/p5-HTML-Parser/patches/patch-aa @@ -1,9 +1,9 @@ -$NetBSD: patch-aa,v 1.1 2000/10/15 02:19:39 jlam Exp $ +$NetBSD: patch-aa,v 1.2 2003/08/25 00:00:09 heinz Exp $ ---- Makefile.PL.orig Sat Sep 16 21:40:09 2000 -+++ Makefile.PL Sun Sep 24 19:40:27 2000 -@@ -19,7 +19,7 @@ - only entities in the Latin-1 range is decoded. +--- Makefile.PL.orig Fri Aug 15 17:32:56 2003 ++++ Makefile.PL +@@ -19,7 +19,7 @@ the question below such entities will be + in the Latin-1 range is decoded. EOT - my $ans = prompt("Do you want decoding on unicode entities?", "no"); -- cgit v1.2.3