summaryrefslogtreecommitdiff
path: root/www/p5-HTML-Parser
diff options
context:
space:
mode:
authorhe <he@pkgsrc.org>2004-12-05 18:38:58 +0000
committerhe <he@pkgsrc.org>2004-12-05 18:38:58 +0000
commitd9cbe8c612e6deefdb07ac837d27525b7f827c02 (patch)
treee55fd7e0987771d659630216811319982e8d5037 /www/p5-HTML-Parser
parent100e297b55bd9422a6807132fc81d34c08b3c98a (diff)
downloadpkgsrc-d9cbe8c612e6deefdb07ac837d27525b7f827c02.tar.gz
Update p5-HTML-Parser from version 3.35 to 3.42.
Change HOMEPAGE to author-independent link on search.cpan.org. Change log: 2004-12-04 Gisle Aas <gisle@ActiveState.com> Release 3.42 Avoid sv_catpvn_utf8_upgrade() as that macro was not available in perl-5.8.0. Patch by Reed Russell <Russell.Reed@acxiom.com>. Add casts to suppress compilation warnings for char/U8 mismatches. HTML::HeadParser will always push new header values. This make sure we never loose old header values. 2004-11-30 Gisle Aas <gisle@ActiveState.com> Release 3.41 Fix unresolved symbol error with perl-5.005. 2004-11-29 Gisle Aas <gisle@ActiveState.com> Release 3.40 Make utf8_mode only available on perl-5.8 or better. It produced garbage with older versions of perl. Emit warning if entities are decoded and something in the first chunk looks like hibit UTF-8. Previously this warning was only triggered for documents with BOM. 2004-11-23 Gisle Aas <gisle@ActiveState.com> Release 3.39_92 More documentation of the Unicode issues. Moved around HTML::Parser documentation a bit. New boolean option; $p->utf8_mode to allow parsing of raw UTF-8. Documented that HTML::Entities::decode_entities() can take multiple arguments. Unterminated entities are now decoded in text (compatibility with MSIE misfeature). Document HTML::Entities::_decode_entities(); this variation of the decode_entities() function has been available for a long time, but have not been documented until now. HTML::Entities::_decode_entities() can now be told to try to expand unterminated entities. Simplified Makefile.PL 2004-11-23 Gisle Aas <gisle@ActiveState.com> Release 3.39_91 The HTML::HeadParser will skip Unicode BOM. Previously it would consider the <head> section done when it saw the BOM. The parser will look for Unicode BOM and give appropriate warnings if the form found indicate trouble. If no matching end tag is found for <script>, <style>, <xmp> <title>, <textarea> then generate one where the next tag starts. For <script> and <style> recognize quoted strings and don't consider end element if the corresponding end tag is found inside such a string. 2004-11-17 Gisle Aas <gisle@ActiveState.com> Release 3.39_90 The <title> element is now parsed in literal mode, which means that other tags are not recognized until </title> has been seen. Unicode support for perl-5.8 and better. Decoding Unicode entities always enabled; no longer a compile time option. Propagation of UTF8 state on strings. Patch contributed by John Gardiner Myers <jgmyers@proofpoint.com>. Calculate offsets and lengths in chars for Unicode strings. Fixed link typo in the HTML::TokeParser documentation. 2004-11-11 Gisle Aas <gisle@ActiveState.com> Release 3.38 New boolean option; $p->closing_plaintext Contributed by Alex Kapranoff <alex@kapranoff.ru> 2004-11-10 Gisle Aas <gisle@ActiveState.com> Release 3.37 Improved handling of HTML encoded surrogate pairs and illegally endoded Unicode; <http://rt.cpan.org/Ticket/Display.html?id=7785>. Patch by John Gardiner Myers <jgmyers@proofpoint.com>. Avoid generating bad UTF8 strings when decoding entities representing chars beyond #255 in 8-bit strings. Such bad UTF8 sometimes made perl-5.8.5 and older segfault. Undocument v2 style subclassing in synopsis section. Internal cleanup: Make 'gcc -Wall' happier. Avoid modification of PVs during parsing of attrspec. Another patch by John Gardiner Myers. 2004-04-01 Gisle Aas <gisle@ActiveState.com> Release 3.36 Improved MSIE/Mozilla compatibility. If the same attribute name repeats for a start tag, use the first value instead of the last. Patch by Nick Duffek <html-parser@duffek.com>. <https://rt.cpan.org/Ticket/Display.html?id=5472>
Diffstat (limited to 'www/p5-HTML-Parser')
-rw-r--r--www/p5-HTML-Parser/Makefile6
-rw-r--r--www/p5-HTML-Parser/distinfo7
-rw-r--r--www/p5-HTML-Parser/patches/patch-aa13
3 files changed, 6 insertions, 20 deletions
diff --git a/www/p5-HTML-Parser/Makefile b/www/p5-HTML-Parser/Makefile
index 594d4c5fd48..930dc357830 100644
--- a/www/p5-HTML-Parser/Makefile
+++ b/www/p5-HTML-Parser/Makefile
@@ -1,14 +1,14 @@
-# $NetBSD: Makefile,v 1.26 2004/04/24 22:46:13 jlam Exp $
+# $NetBSD: Makefile,v 1.27 2004/12/05 18:38:58 he Exp $
#
-DISTNAME= HTML-Parser-3.35
+DISTNAME= HTML-Parser-3.42
PKGNAME= p5-${DISTNAME}
SVR4_PKGNAME= p5hpa
CATEGORIES= www perl5
MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=HTML/}
MAINTAINER= tech-pkg@NetBSD.org
-HOMEPAGE= http://search.cpan.org/author/GAAS/HTML-Parser/
+HOMEPAGE= http://search.cpan.org/dist/HTML-Parser/
COMMENT= Perl5 module to parse HTML text documents
DEPENDS+= p5-HTML-Tagset>=3.0:../../www/p5-HTML-Tagset
diff --git a/www/p5-HTML-Parser/distinfo b/www/p5-HTML-Parser/distinfo
index 6cd6ebd056f..e43f394ed18 100644
--- a/www/p5-HTML-Parser/distinfo
+++ b/www/p5-HTML-Parser/distinfo
@@ -1,5 +1,4 @@
-$NetBSD: distinfo,v 1.8 2004/01/20 13:41:19 abs Exp $
+$NetBSD: distinfo,v 1.9 2004/12/05 18:38:58 he Exp $
-SHA1 (HTML-Parser-3.35.tar.gz) = 2bdcb02bbd80d49d77d9ed086ff2659043758473
-Size (HTML-Parser-3.35.tar.gz) = 75266 bytes
-SHA1 (patch-aa) = 2db44b7ffb783264f0fd2db79449d1408745bcee
+SHA1 (HTML-Parser-3.42.tar.gz) = 2d540a74cbc6375c00e69986e6628b1da0e4a40b
+Size (HTML-Parser-3.42.tar.gz) = 81771 bytes
diff --git a/www/p5-HTML-Parser/patches/patch-aa b/www/p5-HTML-Parser/patches/patch-aa
deleted file mode 100644
index 9c3da1a50f3..00000000000
--- a/www/p5-HTML-Parser/patches/patch-aa
+++ /dev/null
@@ -1,13 +0,0 @@
-$NetBSD: patch-aa,v 1.2 2003/08/25 00:00:09 heinz Exp $
-
---- Makefile.PL.orig Fri Aug 15 17:32:56 2003
-+++ Makefile.PL
-@@ -19,7 +19,7 @@ the question below such entities will be
- in the Latin-1 range is decoded.
-
- EOT
-- my $ans = prompt("Do you want decoding on unicode entities?", "no");
-+ my $ans = "y"; #prompt("Do you want decoding on unicode entities?", "no");
- if ($ans =~ /^y(es)?$/i) {
- push(@define, "-DUNICODE_ENTITIES");
- }