summaryrefslogtreecommitdiff
path: root/www/p5-HTML-TableExtract
AgeCommit message (Collapse)AuthorFilesLines
2006-05-08Update to 2.08:gson2-6/+6
2.08 Wed May 3 17:17:33 EDT 2006 - Implemented new rasterizer for grid mapping. Thanks to Roland Schar for a tortuous example of span issues. - Regular extraction and TREE mode are using the same rasterizer now. - Fixed HTML stripping for a header matching bug on single word text in keep_html mode (thanks to Michael S. Muegel for pointing the bug out) 2.07 Sun Feb 19 13:40:44 EST 2006 - Fixed subtable slicing bug - Fixed hrow() attachment bug - Added tests
2005-10-25Update to 2.06:wiz2-7/+8
2.06 Tue Oct 18 13:13:52 EDT 2005 - Tightened up element interactions in TREE() mode when examing rows, columns, cells, etc. Was running into trouble with dereferencing scalars vs objects. - Documented space() H::TE::T method, added tests - Added POD tests - Documentation updates and fixes 2.05 Tue Oct 4 16:00:02 EDT 2005 - Fixed a TREE() definition bug and class method assignments - Fixed a 'row above header' bug, added tests 2.04 Wed Aug 3 14:42:23 EDT 2005 - Fixed some conditional optional dependency tests in order to avoid falure assertions on some test boxes. 2.03 Wed Jul 20 12:45:56 EDT 2005 - Fixed greedy attribute bug (non qualifying tables were being selected under certain circumstances) - Moved more completely to File::Spec operations in testload.pm in order to make windows boxes happy. 2.02 Thu Jun 23 12:42:44 EDT 2005 - squelched TREE() creation warnings for subclasses - fixed a rows() bug involving keep_headers 2.01 Tue Jun 21 22:05:53 EDT 2005 - fixed some test changes 2.00 Fri Jun 17 17:28:10 EDT 2005 - Can now return parsed tables as HTML::TableElement objects within an HTML::Element tree structure (via HTML::TreeBuilder) for such purposes as in-line editing of table content within documents. Invoked via 'use HTML::TableExtract qw(tree);'. - Added columns(), row(), column(), and cell() methods. - Added some handy reporting methods: tables_report() and tables_dump(). These are almost always handy while first analyzing a new HTML document for table content. - Debugging and error output can now be assigned to arbitrary file handles. ! Old 'table_state' methods are now merely 'table' methods, though the old table_state style is still supported. ! Chains have been dropped. Though interesting (think xpath), they needlessly complicated matters as they were nearly universally unused.
2005-08-06Bump the PKGREVISIONs of all (638) packages that hardcode the locationsjlam1-1/+2
of Perl files to deal with the perl-5.8.7 update that moved all pkgsrc-installed Perl files into the "vendor" directories.
2005-07-13Turn PERL5_PACKLIST into a relative path instead of an absolute path.jlam1-2/+2
These paths are now relative to PERL5_PACKLIST_DIR, which currently defaults to ${PERL5_SITEARCH}. There is no change to the binary packages.
2005-04-11Remove USE_BUILDLINK3 and NO_BUILDLINK; these are no longer used.tv1-2/+1
2005-03-24Update HTML-TableExtract to v1.10. v1.09 appears to have disappeared fromchris2-6/+6
cpan mirrors. Changes: 1.10 Mon Mar 7 10:45:54 EST 2005 - Minor fix to resets during calls to parse(). Resets must be requested manually. - Docs update to reflect impending arrival of 2.x series.
2005-02-27Update to 1.09:wiz2-7/+6
1.09 Fri Feb 25 17:49:00 EST 2005 - Tables can now be selected by table tag attributes - lineage() method now returns row and column information, as well as depth and count, for each ancestor (potential backwards incompatability, entries are now 4 element arrays now rather than 2) - header matching and column retention enhancements - header retention - old-style procedures deprecated in prepration for them to become methods - various bug fixes
2005-02-24Add RMD160 checksums.wiz1-1/+2
2004-12-20since perl is now built with threads on most platforms, the perl archlibgrant1-1/+2
module directory has changed (eg. "darwin-2level" vs. "darwin-thread-multi-2level"). binary packages of perl modules need to be distinguishable between being built against threaded perl and unthreaded perl, so bump the PKGREVISION of all perl module packages and introduce BUILDLINK_RECOMMENDED for perl as perl>=5.8.5nb5 so the correct dependencies are registered and the binary packages are distinct. addresses PR pkg/28619 from H. Todd Fujinaka.
2004-04-27Convert to buildlink3.snj1-2/+2
2003-07-25Update to 1.08:jmmv2-5/+5
1.08 Thu Apr 4 11:26:27 CST 2002 - Added some more crufty HTML tolerance -- not PC (puristicly correct) but HTML correctness is probably of no interest to those merely trying to extract information *out* of HTML. - Fixed a mapback problem with the legacy methods 1.07 Wed Aug 22 06:14:24 CDT 2001 - Added keep_html option for HTML retention - bug fix for depth/count targets
2003-07-22COMMENT should start with a capital letter.martti1-2/+2
2003-07-17s/netbsd.org/NetBSD.org/grant1-2/+2
2002-10-20Use buildlink2. Use perl5/module.mk.seb1-6/+3
2001-11-26Buildlinkify, in the sense that only the perl headers are found injlam1-2/+3
${PREFIX} -- everything else is pickup up from ${BUILDLINK_DIR}.
2001-11-01Move pkg/ files into package's toplevel directoryzuntum3-1/+1
2001-10-30Initial Versiondamon4-0/+31