summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2006-04-17Add p5-Text-Context-EitherSide.minskim1-1/+2
2006-04-17Import p5-Text-Context-EitherSide from pkgsrc-wip. Packaged by Roman Kulik.minskim4-0/+42
Suppose you have a large piece of text - typically, say, a web page or a mail message. And now suppose you've done some kind of full-text search on that text for a bunch of keywords, and you want to display the context in which you found the keywords inside the body of the text. A simple-minded way to do that would be just to get the two words either side of each keyword. But hey, don't be too simple minded, because you've got to make sure that the list doesn't overlap. If you have the quick brown fox jumped over the lazy dog and you extract two words either side of ``fox'', ``jumped'' and ``dog'', you really don't want to end up with quick brown fox jumped over brown fox jumped over the the lazy dog so you need a small amount of smarts. This module has a small amount of smarts.
2006-04-17Replace hardcoded /usr/local/include with ${BUILDLINK_PREFIX.iconv}/includeminskim1-2/+3
to avoid picking up a wrong iconv library. Patch provided by Bel Woolley in private email.
2006-04-17Bump BUILDLINK_ABI_DEPENDS.png and PKGREVISION for png-1.2.9nb2 update.wiz10-19/+20
2006-04-17Remove share/html2text directory on uninstall.schmonz1-1/+2
2006-04-17Strip ${PKGLOCALEDIR} from PLISTs of packages that already obeyjlam10-588/+588
PKGLOCALEDIR and which install their locale files directly under ${PREFIX}/${PKGLOCALEDIR} and sort the PLIST file entries. From now on, pkgsrc/mk/plist/plist-locale.awk will automatically handle transforming the PLIST to refer to the correct locale directory.
2006-04-15Add p5-XML-Clean.minskim1-1/+2
2006-04-15Import p5-XML-Clean from pkgsrc-wip. Packaged by Romak Kulik.minskim4-0/+31
The ultimate quest of this module is to produce from non-XML text text, that will will most probably pass through any XML parser one could find. Basic cleaning is just XML tag matching (for every opening tag there will be closing tag as well, and they will form a tree structure). When you add some extra parameters, you will receive complete XML text, including XML head and root element (if none were defined in text, then some will be added).
2006-04-14Added appropriate TEST_TARGET.heinz1-1/+3
2006-04-14Add p5-Lingua-Stem-Snowball.minskim1-1/+2
2006-04-14Import p5-Lingua-Stem-Snowball from pkgsrc-wip. Packaged by Roman Kulik.minskim4-0/+27
This module provides unified perl interface to Snowball stemmers (http://snowball.tartarus.org) and virtually supports various languages. It is written using C for high performance and provides OO and plain interfaces.
2006-04-13Add p5-Encode.minskim1-1/+2
2006-04-13Import p5-Encode from pkgsrc-wip. Packaged by Roman Kulik.minskim4-0/+28
The "Encode" module provides the interfaces between Perl's strings and the rest of the system. Perl strings are sequences of characters.
2006-04-13BUILD_USE_MSGFMT and USE_MSGFMT_PLURALS are obsolete. Replace withjlam5-15/+10
USE_TOOLS+=msgfmt.
2006-04-13Sorted PLIST.rillig1-126/+126
2006-04-12The documentation is installed into share/doc/sablotron instead ofrillig2-215/+216
share/doc/html. Bumped PKGREVISION.
2006-04-12The documentation files are installed into share/doc instead ofrillig4-106/+97
share/doc/html. Bumped PKGREVISION.
2006-04-12Aligned the last line of the buildlink3.mk files with the first line, sorillig32-64/+64
that they look nicer.
2006-04-11Update to 1.2.1, provided by OBATA Akio in PR 33239:wiz3-13/+17
2006-04-10 Mikio Hirabayashi * estraier.c (est_db_put_doc): dispensable trailer was purged from record. * estraier.c (est_db_add_attr_index): new function. * estraier.c (est_aidx_seq_put, est_aidx_seq_out, est_aidx_seq_narrow): new functions. * estraier.c (est_aidx_attr_put, est_aidx_attr_out, est_aidx_attr_narrow): new functions. * estnode.c (est_url_shuttle): "limsize" parameter was added. * mastermod.c (nmgr_clear): new function. * estcmd.c (runcreate, proccreate): new functions. - The Java binding and the Ruby binding was to support attribute indexes. - The Java binding and the Ruby binding was to support feedback of scores. - Release: 1.2.1
2006-04-09Update to 4.0. From the changelog:schmonz2-8/+10
- Requires XML::Elemental 2.0 for SAX-based parsing. - Switched internal storage of element and attribute names to Clarkian notation. - Too many bug fixes and minor enhancements to list. - Added atom, itunes, openSerch, feedburner and creativeCommons namespaces to default set. - Moved XPath namespace and prefixes storage from Element to the Parser module. - Added register_ns_prefix, prefix and namespace methods to parser for OO access to the internal hashes. - Introduced use of Class::ErroHandler for more quieter error handling. - We have tests. - Introduction of Util module. Moved as_xml and encode_xml methods from Element to Util.
2006-04-08Remove work directory references in the ndtpdaily script.jlam4-3/+35
2006-04-08Update rubber to 1.1tonio4-38/+12
Remove patch-ab, as it has been applied upstream (no dependency over gmake) Version 1.1 (2006-03-17) * Features - Support for user-defined conversion rules. - Support for xindy and sam2p. - Improved fig2dev support. - Experimental cache system. * Bugfixes - Better parsing of log files. - Improved error reporting (including for BibTeX and Metapost). - Better handling of verbatim environments. - Many other fixes.
2006-04-08Add missing file to the PLIST when using xemacs. Bump the PKGREVISIONjlam2-4/+5
to 3.
2006-04-08* List info files directly in the PLIST and honor PKG{INFO,MAN}DIR.jlam2-6/+16
* Fix PLIST by adding man page entries. Bump the PKGREVISION to 3.
2006-04-07List info files directly in the PLIST and honor PKG{INFO,MAN}DIR.jlam15-25/+31
2006-04-07* List info files directly in the PLIST, and install them intojlam2-12/+16
${EMACS_INFOPREFIX}, which is correct for both emacs and xemacs. * Fix PLIST to be correct on xemacs. * Depend on xemacs-packages when building with xemacs. Bump the PKGREVISION.
2006-04-07This package installs locale files that conflict with grep.jlam1-1/+3
2006-04-07Link against -liconv when using the pkgsrc iconv implementation. Wejlam1-1/+2
need to force this to happen since the configure script inexplicably just assumes that iconv() is available without needing to check for any libraries or anything silly like that.
2006-04-07Properly locate -liconv when linking ebinfo in the case where we usejlam3-5/+27
the pkgsrc libiconv implementation.
2006-04-07Remove patch made unnecessary by makeinfo replacement script.jlam2-15/+1
2006-04-07Use teTeX/module.mk to run mktexlsr for texmf-local only.minskim4-6/+25
2006-04-07Use texmf-dirs to install into texmf-local. Bump PKGREVISION.minskim2-4/+6
2006-04-06Over 1200 files touched but no revisions bumped :)reed46-121/+121
RECOMMENDED is removed. It becomes ABI_DEPENDS. BUILDLINK_RECOMMENDED.foo becomes BUILDLINK_ABI_DEPENDS.foo. BUILDLINK_DEPENDS.foo becomes BUILDLINK_API_DEPENDS.foo. BUILDLINK_DEPENDS does not change. IGNORE_RECOMMENDED (which defaulted to "no") becomes USE_ABI_DEPENDS which defaults to "yes". Added to obsolete.mk checking for IGNORE_RECOMMENDED. I did not manually go through and fix any aesthetic tab/spacing issues. I have tested the above patch on DragonFly building and packaging subversion and pkglint and their many dependencies. I have also tested USE_ABI_DEPENDS=no on my NetBSD workstation (where I have used IGNORE_RECOMMENDED for a long time). I have been an active user of IGNORE_RECOMMENDED since it was available. As suggested, I removed the documentation sentences suggesting bumping for "security" issues. As discussed on tech-pkg. I will commit to revbump, pkglint, pkg_install, createbuildlink separately. Note that if you use wip, it will fail! I will commit to pkgsrc-wip later (within day).
2006-04-02Update to 1.1.6, provided by OBATA Akio in PR 33168:wiz3-12/+22
From ChangeLog: 2006-03-12 Mikio Hirabayashi * estraier.c (est_weight_keywords): weighting algorithm was enhanced. * estraier.c (est_db_set_wildmax): new function. * estraier.c (est_narrow_scores): disjunction of attribute conditions was added. * estraier.c (est_doc_dup, est_cond_dup): new functions. * estcmd.c (rungather, doputfile): default value of "-lf" options was changed. * estseek.c (realmain): "wildmax" parameter was added. * estmaster.c (main): "wildmax" parameter was added. - A bug about the skip parameter in the pure Java API was fixed. - The Java binding and the Ruby binding was to support similarity eclipse. - Release: 1.1.6 pkgsrc changes: From 1.1.3, compression with LZO and BZIP2 was added and auto detected from configure, so add those libraries. Also add libraries used directly.
2006-04-01Require a docbook-xml package that has correct rewriteSystem entries in itsjmmv1-3/+3
catalog files so that registering some documents (e.g., those from totem) works correctly. Bump PKGREVISION to 10.
2006-04-01Add appropriate rewriteSystem entries to the catalog so that system-stylejmmv1-2/+6
queries (URLs) work. This removes several warnings when, e.g., registering some documents in the scrollkeeper database. Bump PKGREVISION to 4.
2006-03-31List the info pages directly in the PLIST and ensure that we honorjlam2-4/+5
PKGINFODIR.
2006-03-31Register the installed documentation in the scrollkeeper database.jmmv2-3/+13
This is to add an explicit dependency on this utility so that gtk-doc does not install unmanaged files when scrollkeeper is already installed. Bump PKGREVISION to 2.
2006-03-30Update to 0.50:jmmv4-17/+117
0.50 Alastair McKinstry <mckinstry@debian.org> [ Alastair McKinstry ] * iso_639: - Added new code: srn Sranan Tongo - Move to DH_COMPAT=5 - Split iso_639_3 into a separate domain, to be kind to translators. * iso_4217: - de.po: proof-read patch by Christian Stimming. * Updated debian/copyright to point to current source of iso-codes, namely the CVS repository. [ Christian Perrier ] * iso_639: - tt.po updated by Albert Fazlí - da.po updated by Claus Hindsgaul - Added new codes (see http://www.loc.gov/standards/iso639-2/codechanges.html) - Karelian - Frisian renamed to Western Frisian - Eastern Frisian - Northern Frisian - Angika - Swiss German - Aromanian - update ISO 639-3 to the 20050910 version of SIL * iso_3166: - tt.po added by Albert Fazlí - bn.po added by Progga - et.po updated by Hasso Tepper - pa_IN.po added by Amanpreet Singh Alam - irrelevant pa.po removed [ Tobias Toedter ] * Removed double space from package description. Closes: #338190 * Removed watch file. There's currently no real homepage for the tarball. * iso_4217: - ru.po updated by Yuri Kozlov. Closes: #346189 [ Konstantinos Margaritis] * Updated iso_3166/el.po [ Claus Hindsgaul ] * Updated iso_3166/da.po and iso_3166_3/da.po 0.49 Alastair McKinstry <mckinstry@debian.org> [ Alastair McKinstry ] * iso_639: - vi.po updated by Clytie Siddall. - sr.po updated by Danilo Segan. - eo.po updated by Edmund GRIMLEY EVANS. - sl.po updated by Primož Peterlin. - ja.po updated by IIDA Yosiaki. * iso_3166: - sl.po updated by Primož Peterlin - ku.po by Erdal Ronahî. Closes: #335234. Remove byte-order-mark bytes from file ku.po: Closes: #335975. * iso_3166_2: - corrected spelling errors; thanks to Theppitak Karoonboonyanan. Closes: #331071. * iso_639.tab: Don't sort headers. Closes: #329549. * Allow iso-codes to build without pyxml. Patch thanks to James Henstridge. Closes: #331357, #328152. * Include draft iso_639_3.xml table so that translators see the translations needed. This generates a new.larger iso_639.pot. Closes: #325276. * Correct README file, mentioning XML rather than tab formats. Closes: #334744. [ Tobias Toedter ] * iso_4217: - th.po translated by Theppitak Karoonboonyanan. Closes: #333076 * iso_3166: - th.po updated by Theppitak Karoonboonyanan. Closes: #333080 * iso_3166_2: - th.po updated by Theppitak Karoonboonyanan. Closes: #333093 * iso_639: - th.po updated by Theppitak Karoonboonyanan. Closes: #333094
2006-03-30Avoid extra stat() calls by not repeatedly checking whether a filejlam1-3/+3
exists on the disk -- we can just check whether a variable defined by find-files.mk is "__nonexistent__" or not.
2006-03-30* Honor PKGINFODIR.jlam2-4/+4
* List the info files directly in the PLIST.
2006-03-22Use "tv@NetBSD.org" as my MAINTAINER address; makes identifying me as atv2-4/+4
PR-responsible person (such as I am ;) a little easier.
2006-03-21Make this build on Darwin.yyamano5-1/+66
2006-03-21Make this build on Darwin.yyamano2-11/+18
2006-03-21Bump PKGREVISION for all packages that had broken INSTALL/DEINSTALLjlam20-38/+40
scripts prior to revision 1.43 of bsd.pkginstall.mk.
2006-03-20* Nuke all references to and definitions of INFO_DIR in package Makefilesjlam2-8/+12
and replace with appropriate references to PKGINFODIR instead. * Properly account for split info files during installation. * Move info file listings directly into the package PLISTs. This fixes info-file-related PLIST problems.
2006-03-20Remove patch made unnecessary because the tools framework hides thejlam2-29/+1
GNU auto* tools automatically.
2006-03-18Depend on p5-File-MMagic>=1.25, from ISIHARA Takanori in PR 33099.wiz1-2/+2
No PKGREVISION bump since older versions broke configure (-> no binary pkg).
2006-03-17bl3 against netpbm. Fixes build failure. PKGREVISION++gdt1-2/+3
2006-03-16Consistently use NetBSD.org address for shell, shellhung.org does not exist.wiz1-2/+2