diff options
author | jlam <jlam@pkgsrc.org> | 2001-10-01 01:58:26 +0000 |
---|---|---|
committer | jlam <jlam@pkgsrc.org> | 2001-10-01 01:58:26 +0000 |
commit | d6e0950c71393e5f46f8888bb45a04c3b53fb9d9 (patch) | |
tree | 2bd8fb80f1d0df06901bb5e271264e0aae7aa970 /www/p5-HTML-SimpleParse | |
parent | 5d68119a1f6d9df32896008a45bdd0c8652efa89 (diff) | |
download | pkgsrc-d6e0950c71393e5f46f8888bb45a04c3b53fb9d9.tar.gz |
www/p5-HTML-Parser:
This module is a bare-bones HTML parser. It is similar in concept to
HTML::Parser, but it differs in a couple of important ways.
First, HTML::SimpleParse just finds tags and text in the HTML you give it;
it does not care about the specific content of these tags (though it does
distinguish between different _types_ of tags, such as comments, starting
tags like <b>, ending tags like </b>, and so on).
Second, HTML::SimpleParse does not create a hierarchical tree of HTML
content, but rather a simple linear list. It does not pay any attention to
balancing start tags with corresponding end tags, or which pairs of tags
are inside other pairs of tags.
Because of these characteristics, you can make a very effective HTML filter
by sub-classing HTML::SimpleParse.
Diffstat (limited to 'www/p5-HTML-SimpleParse')
-rw-r--r-- | www/p5-HTML-SimpleParse/Makefile | 17 | ||||
-rw-r--r-- | www/p5-HTML-SimpleParse/distinfo | 4 | ||||
-rw-r--r-- | www/p5-HTML-SimpleParse/pkg/DESCR | 15 | ||||
-rw-r--r-- | www/p5-HTML-SimpleParse/pkg/PLIST | 1 |
4 files changed, 37 insertions, 0 deletions
diff --git a/www/p5-HTML-SimpleParse/Makefile b/www/p5-HTML-SimpleParse/Makefile new file mode 100644 index 00000000000..1961b1c03ec --- /dev/null +++ b/www/p5-HTML-SimpleParse/Makefile @@ -0,0 +1,17 @@ +# $NetBSD: Makefile,v 1.1.1.1 2001/10/01 01:58:26 jlam Exp $ + +DISTNAME= HTML-SimpleParse-0.10 +PKGNAME= p5-${DISTNAME} +CATEGORIES= www perl5 +MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=HTML/} + +MAINTAINER= packages@netbsd.org +COMMENT= perl5 module implementing a bare-bones HTML parser + +USE_PERL5= YES +PERL5_PACKLIST= ${PERL5_SITEARCH}/auto/HTML/SimpleParse/.packlist + +do-configure: + @cd ${WRKSRC}; ${SETENV} ${MAKE_ENV} ${PERL5} Makefile.PL + +.include "../../mk/bsd.pkg.mk" diff --git a/www/p5-HTML-SimpleParse/distinfo b/www/p5-HTML-SimpleParse/distinfo new file mode 100644 index 00000000000..79c516b1289 --- /dev/null +++ b/www/p5-HTML-SimpleParse/distinfo @@ -0,0 +1,4 @@ +$NetBSD: distinfo,v 1.1.1.1 2001/10/01 01:58:26 jlam Exp $ + +SHA1 (HTML-SimpleParse-0.10.tar.gz) = 2b12b7a45b8bd01f952ca78e08b2364cb986b681 +Size (HTML-SimpleParse-0.10.tar.gz) = 8162 bytes diff --git a/www/p5-HTML-SimpleParse/pkg/DESCR b/www/p5-HTML-SimpleParse/pkg/DESCR new file mode 100644 index 00000000000..a5db83b581c --- /dev/null +++ b/www/p5-HTML-SimpleParse/pkg/DESCR @@ -0,0 +1,15 @@ +This module is a bare-bones HTML parser. It is similar in concept to +HTML::Parser, but it differs in a couple of important ways. + +First, HTML::SimpleParse just finds tags and text in the HTML you give it; +it does not care about the specific content of these tags (though it does +distinguish between different _types_ of tags, such as comments, starting +tags like <b>, ending tags like </b>, and so on). + +Second, HTML::SimpleParse does not create a hierarchical tree of HTML +content, but rather a simple linear list. It does not pay any attention to +balancing start tags with corresponding end tags, or which pairs of tags +are inside other pairs of tags. + +Because of these characteristics, you can make a very effective HTML filter +by sub-classing HTML::SimpleParse. diff --git a/www/p5-HTML-SimpleParse/pkg/PLIST b/www/p5-HTML-SimpleParse/pkg/PLIST new file mode 100644 index 00000000000..f13f4460359 --- /dev/null +++ b/www/p5-HTML-SimpleParse/pkg/PLIST @@ -0,0 +1 @@ +@comment $NetBSD: PLIST,v 1.1.1.1 2001/10/01 01:58:26 jlam Exp $ |