summaryrefslogtreecommitdiff
path: root/www/p5-HTML-Parser
diff options
context:
space:
mode:
authorheinz <heinz@pkgsrc.org>2003-08-25 00:00:08 +0000
committerheinz <heinz@pkgsrc.org>2003-08-25 00:00:08 +0000
commit1841c35e67321a1ad5ea960c92715f003bc799d0 (patch)
tree322f08a6d32de978056400fe028bd76438158240 /www/p5-HTML-Parser
parent8444d93977ea92e7e522edac3d36dd4ae9f21543 (diff)
downloadpkgsrc-1841c35e67321a1ad5ea960c92715f003bc799d0.tar.gz
Update to 3.31.
Better compatibility with Mozilla/MSIE behaviour. ==== Changes since 3.27 ==== 2003-08-19 Gisle Aas <gisle@ActiveState.com> Release 3.31 The -DDEBUGGING fix in 3.30 was not really there :-( 2003-08-17 Gisle Aas <gisle@ActiveState.com> Release 3.30 The previous release failed to compile on a -DDEBUGGING perl like the one provided by Redhat 9. Got rid of references to perl-5.7. Further fixes to avoid warnings from Visual C. Patch by Steve Hay <steve.hay@uk.radan.com>. 2003-08-14 Gisle Aas <gisle@ActiveState.com> Release 3.29 Setting xml_mode now implies strict_names also for end tags. Avoid warning from Visual C. Patch by <gsar@activestate.com>. 64-bit fix from Doug Larrick <doug@ties.org> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=195500 Try to parse similar to Mozilla/MSIE in certain edge cases. All these are outside of the official definition of HTML but HTML spam often tries to take advantage of these. - New configuration attribute 'strict_end'. Unless enabled we will allow end tags to contain extra words or stuff that look like attributes before the '>'. This means that tags like these: </foo foo="<ignored>"> </foo ignored> </foo ">" ignored> are now all parsed as a 'foo' end tag instead of text. Even if the extra stuff looks like attributes they will not be reported if requested via the 'attr' or 'tokens' argspecs for the 'end' handler. - Parse '</:comment>' and '</ comment>' as comments unless strict_comment is enabled. Previous versions of the parser would report these as text. If these comments contain quoted words prefixed by space or '=' these words can contain '>' without terminating the comment. - Parse '<! "<>" foo>' as comment containing ' "<>" foo'. Previous versions of the parser would terminate the comment at the first '>' and report the rest as text. - Legacy comment mode: Parse with comments terminated with a lone '>' if no '-->' is found before eof. - Incomplete tag at eof is reported as a 'comment' instead of 'text' unless strict_comment is enabled. 2003-04-16 Gisle Aas <gisle@ActiveState.com> Release 3.28 When 'strict_comment' is off (which it is by default) treat anything that matches <!...> a comment. Should now be more efficient on threaded perls.
Diffstat (limited to 'www/p5-HTML-Parser')
-rw-r--r--www/p5-HTML-Parser/Makefile5
-rw-r--r--www/p5-HTML-Parser/distinfo8
-rw-r--r--www/p5-HTML-Parser/patches/patch-aa10
3 files changed, 12 insertions, 11 deletions
diff --git a/www/p5-HTML-Parser/Makefile b/www/p5-HTML-Parser/Makefile
index 758c290ce28..11e6141c7eb 100644
--- a/www/p5-HTML-Parser/Makefile
+++ b/www/p5-HTML-Parser/Makefile
@@ -1,13 +1,14 @@
-# $NetBSD: Makefile,v 1.21 2003/07/22 04:14:31 martti Exp $
+# $NetBSD: Makefile,v 1.22 2003/08/25 00:00:08 heinz Exp $
#
-DISTNAME= HTML-Parser-3.27
+DISTNAME= HTML-Parser-3.31
PKGNAME= p5-${DISTNAME}
SVR4_PKGNAME= p5hpa
CATEGORIES= www perl5
MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=HTML/}
MAINTAINER= jlam@NetBSD.org
+HOMEPAGE= http://search.cpan.org/author/GAAS/HTML-Parser/
COMMENT= Perl5 module to parse HTML text documents
DEPENDS+= p5-HTML-Tagset>=3.0:../../www/p5-HTML-Tagset
diff --git a/www/p5-HTML-Parser/distinfo b/www/p5-HTML-Parser/distinfo
index 7908049ae71..d07189d2f00 100644
--- a/www/p5-HTML-Parser/distinfo
+++ b/www/p5-HTML-Parser/distinfo
@@ -1,5 +1,5 @@
-$NetBSD: distinfo,v 1.5 2003/04/12 15:40:38 martti Exp $
+$NetBSD: distinfo,v 1.6 2003/08/25 00:00:09 heinz Exp $
-SHA1 (HTML-Parser-3.27.tar.gz) = 6b7ee2266c93377b930910b0098d5fae5b305ee6
-Size (HTML-Parser-3.27.tar.gz) = 70891 bytes
-SHA1 (patch-aa) = 6c3aecb398c078f9823c4f4ef0da34fedc84b6c1
+SHA1 (HTML-Parser-3.31.tar.gz) = 85a88d3179e11e90dfb54b33a24b1e59fd161b57
+Size (HTML-Parser-3.31.tar.gz) = 73132 bytes
+SHA1 (patch-aa) = 2db44b7ffb783264f0fd2db79449d1408745bcee
diff --git a/www/p5-HTML-Parser/patches/patch-aa b/www/p5-HTML-Parser/patches/patch-aa
index 0cc13c3e668..9c3da1a50f3 100644
--- a/www/p5-HTML-Parser/patches/patch-aa
+++ b/www/p5-HTML-Parser/patches/patch-aa
@@ -1,9 +1,9 @@
-$NetBSD: patch-aa,v 1.1 2000/10/15 02:19:39 jlam Exp $
+$NetBSD: patch-aa,v 1.2 2003/08/25 00:00:09 heinz Exp $
---- Makefile.PL.orig Sat Sep 16 21:40:09 2000
-+++ Makefile.PL Sun Sep 24 19:40:27 2000
-@@ -19,7 +19,7 @@
- only entities in the Latin-1 range is decoded.
+--- Makefile.PL.orig Fri Aug 15 17:32:56 2003
++++ Makefile.PL
+@@ -19,7 +19,7 @@ the question below such entities will be
+ in the Latin-1 range is decoded.
EOT
- my $ans = prompt("Do you want decoding on unicode entities?", "no");