summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2013-07-02+py-anyjsonrodent1-1/+2
2013-07-02Import py27-anyjson-0.3.3 as textproc/py-anyjson.rodent4-0/+33
Anyjson loads whichever is the fastest JSON module installed and provides a uniform API regardless of which JSON implementation is used
2013-06-28+LICENSE= gnu-gpl-v2 OR gnu-lgpl-v2.1 OR mpl-1.1rodent1-1/+2
2013-06-28Updated to latest release, 4.6.0. MASTER_SITES changed, but other than thatrodent2-7/+8
nothing else relative to pkgsrc. There are hundred of additions in doc/ChangeLog. Most have to do with adding more words and cleanup/fixes.
2013-06-28Updated to latest release, 0.3.2. ${SORT} and ${CAT} are no longer usedrodent3-22/+19
by package. Updated patch to reflect changes in source file. From CHANGELOG: 0.3.2 * zdpo (en): small words update, improved "make clean" * verkarsky (sk): integration od 3 letters Slovak words * zdpo (sk): drobné aktualizácie, vylepąený príkaz "make clean" * verkarsky (sk): integrácia 3-písmenkových slovenských výrazov do slovníka 2005.09.06 0.3.1 * fred: remove install.sh * zdpo: words words words 2005.09.01 0.3.0 * zdpo: Licence change/extension to GPL(v2), LGPL (v2.1) and MPL (1.1). Details can be found at http://www.opensource.org/licenses/. * zdpo (sk): Roząírenie licencie na GPL(v2), LGPL (v2.1) a MPL (1.1). Anglické znenie licencií nájdete na http://www.opensource.org/licenses/. Preklady GPL a LGPL nájdete na www.gnu.sk a www.gnu.cz
2013-06-24lang/mono-basic, not lang/mono2-basic.dholland1-2/+2
2013-06-22Updating package for Perl Module Text::CSV_XS in similar named CPANsno2-6/+6
distribution from 0.99 to 1.01 because of serious upstream isse fixed. Upstream changes since 0.99: 1.01 - 2013-06-16, H.Merijn Brand - Cache not re-read on getline_all (RT#86155) 1.00 - 2013-06-13, H.Merijn Brand * Fix automatic UTF-8 in getline/parse for SV's with \0
2013-06-18Fix perl interpreter path in installed files. Bump PKGREVISION.wiz1-5/+19
2013-06-18Fix build with perl-5.18.wiz2-1/+17
2013-06-17Fix python interpreter path in installed files.wiz1-3/+5
2013-06-17Make mono packages depend on lang/mono2 instead of lang/mono.wiz4-13/+13
The latter contains the unstable version 3 of mono, which breaks many mono packages. Bump dependencies and PKGREVISIONs. Ok during freeze: gdt@
2013-06-16Update ruby-diff-lcs to 1.2.4.taca3-8/+8
=== 1.2.4 / 2013-04-20 * Bugs fixed: * A bug was introduced after 1.1.3 when pruning common sequences at the start of comparison. Paul Kunysch (@pck) fixed this in pull request 18. Thanks! https://github.com/halostatue/diff-lcs/pull/18 * The Rubinius (1.9 mode) bug in rubinius/rubinius#2268 has been fixed by the Rubinius team two days after it was filed. Thanks for fixing this so quickly! https://github.com/rubinius/rubinius/issues/2268 * Switching to Raggi's hoe-gemspec2 for gemspec generation. == 1.2.3 / 2013-04-11 * Bugs Fixed: * The new encoding detection for diff output generation (added in 1.2.2) introduced a bug if the left side of the comparison was the empty set. Originally found in rspec/rspec-expectations#238 and rspec/rspec-expectations#239. Jon Rowe developed a reasonable heuristic (left side, right side, empty string literal) to avoid this bug. https://github.com/rspec/rspec-expectations/pull/238 https://github.com/rspec/rspec-expectations/pull/239 * There is a known issue with Rubinius in 1.9 mode reported in rubinius/rubinius#2268 and demonstrated in the Travis CI builds. For all other tested platforms, diff-lcs is considered stable. As soon as a suitably small test-case can be created for the Rubinius team to examine, this will be added to the Rubinius issue around this. https://github.com/rubinius/rubinius/issues/2268 https://travis-ci.org/halostatue/diff-lcs/jobs/6241195 == 1.2.2 / 2013-03-30 * Bugs Fixed: * Diff::LCS::Hunk could not properly generate a difference for comparison sets that are not US-ASCII-compatible because of the use of literal regular expressions and strings. Jon Rowe (JonRowe) found this in rspec/rspec-expectations#219 and provided a first pass implementation in diff-lcs#15. I've reworked it because of test failures in Rubinius when running in Ruby 1.9 mode. This coerces the added values to the encoding of the old dataset (as determined by the first piece of the old dataset). https://github.com/rspec/rspec-expectations/issues/219 https://github.com/halostatue/diff-lcs/pull/15 * Adding Travis CI testing for Ruby 2.0.
2013-06-16- Updated package to also install the cmxs (dynamic link) file.jaapb4-9/+29
2013-06-16Fix PKGNAME.ryoon1-2/+2
2013-06-16Update to 0.94.4ryoon11-48/+107
* HOMEPAGE and MASTER_SITES are changed to Debian's alioth. Changelog: unknown
2013-06-15Added new ocaml-csv package to Makefilejaapb1-1/+2
2013-06-15Added new package, textproc/ocaml-csv. A pure OCaml library that supportsjaapb4-0/+58
comma separated values.
2013-06-14Install url_handler.sh, from Dario Niedermann on pkgsrc-users.wiz2-3/+9
Fix bash call in script and add run-time-dependency on bash for the script. Bump PKGREVISION.
2013-06-13Update ruby-json-pure to 1.8.0obache2-6/+6
2013-05-13 (1.8.0) * Fix https://github.com/flori/json/issues/162 reported by Marc-Andre Lafortune <github_rocks@marc-andre.ca>. Thanks! * Applied patches by Yui NARUSE <naruse@airemix.jp> to suppress warning with -Wchar-subscripts and better validate UTF-8 strings. * Applied patch by ginriki@github to remove unnecessary if. * Add load/dump interface to JSON::GenericObject to make serialize :some_attribute, JSON::GenericObject work in Rails active models for convenient SomeModel#some_attribute.foo.bar access to serialised JSON data.
2013-06-13Update ruby-json to 1.8.0.obache2-6/+6
2013-05-13 (1.8.0) * Fix https://github.com/flori/json/issues/162 reported by Marc-Andre Lafortune <github_rocks@marc-andre.ca>. Thanks! * Applied patches by Yui NARUSE <naruse@airemix.jp> to suppress warning with -Wchar-subscripts and better validate UTF-8 strings. * Applied patch by ginriki@github to remove unnecessary if. * Add load/dump interface to JSON::GenericObject to make serialize :some_attribute, JSON::GenericObject work in Rails active models for convenient SomeModel#some_attribute.foo.bar access to serialised JSON data.
2013-06-12Remove GNU_CONFIGURE_LIBDIR.wiz1-2/+2
Rationale: undocumented, easily replaced with "CONFIGURE_ARGS+=--libdir=" and only used by 10 packages.
2013-06-11- Patched package to compile with new version of net/ocamlnetjaapb3-6/+21
2013-06-11Update to 1.12ryoon2-6/+6
Changelog: 1.12 (2012/12/22) * Fix error handling (Eric Leblond) * Put operation in try pass. (Eric Leblond) * Don't ignore header files (Frédéric Yhuel) * Don't rm temporary .cocci file in verbose mode (Frédéric Yhuel)
2013-06-11Update to 1.30wen2-8/+7
Update DEPENDS Upstream changes: 1.30 Tue Jun 11 00:06:02 2013 - catch up Text::CSV_XS 0.99 (except for diag_verbose and allow_unquoted_escape) * TODO: Documents and tests in the next version
2013-06-11Update to 0.99wen2-7/+6
Upstream changes: 0.99 - 2013-06-05, H.Merijn Brand * Documents return value of bind_columns without arguments * Fix automatic UTF-8 in getline/parse 0.98 - 2013-06-03, H.Merijn Brand * Clarify eol documentation * Move error_input to XS
2013-06-09Fix hardcoded ${INSTALL} path with SUBST. Likely fixes install failure onrodent1-1/+2
SunOS.
2013-06-09gmake is required for SunOS.obache1-1/+7
2013-06-09Update to 1.2.1ryoon3-7/+9
Changelog: Version 1.2.1 - March 2013, by Martin von Gagern Added Esperanto translation. Updated Czech, German, Spanish, Finnish, Galician, Italian, Dutch, Polish, Slovenian, Serbian, Swedish, Ukrainian and Vietnamese translations. Updated gnulib. Recreated build system using recent versions of autotools. This will avoid security issues in "make distcheck" target. (CVE-2012-3386)
2013-06-06Bump PKGREVISION for libXft changes for NetBSD native X support onwiz15-30/+30
NetBSD 6, requested by tron.
2013-06-06The revived rss2email (not yet in pkgsrc) requires Python 3.2 orschmonz1-1/+3
higher, and depends on this, so this must work with Python 3.2 or higher.
2013-06-06Update to 3.200.3. No changelog provided. Summarized highlightsschmonz3-29/+21
from the git log: * allow module to be imported * set UTF-8 as default encoding * better handling of Google Docs HTML * better handling of more edge-case inputs * nitpicky bugfixes to whitespace, emphasis, etc. * new config options
2013-06-04Try to fix the fallout caused by the fix for PR pkg/47882. Part 3:tron16-32/+32
Recursively bump package revisions again after the "freetype2" and "fontconfig" handling was fixed.
2013-06-04Update to 1.2.15.0. From the changelog:schmonz2-7/+6
1.2.15.0 Tue Apr 16 23:43:24 UTC 2013 [Changes contributed by Olly Betts] - Remove superfluous duplicate method wrappers from WritableDatabase for methods wrapped in Database parent class. - Improve test coverage. - Fix minor typo in POD documentation. 1.2.14.0 Thu Mar 14 23:12:38 UTC 2013 [Changes contributed by Olly Betts] - Perl 5.16.1 adds a '.' after "at foo line 123" so adjust regexp in testcase t/10query.t to allow an optional '.' there. (ticket#610) 1.2.13.0 Wed Jan 9 11:19:20 UTC 2013 [Changes contributed by Adam Sjøgren] - Wrap 2 and 3 parameter forms of StringValueRangeProcessor constructor so prefixes and suffixes can be specified. (ticket#607) [Changes contributed by Olly Betts] - Wrap the zero argument Query constructor. - Add wrappers for Query::MatchAll and Query::MatchNothing. - Don't pointlessly reinitialise default std::string parameters to an empty string (performance and code size micro-optimisation). 1.2.12.0 Wed Jun 27 12:17:26 UTC 2012 - No change except for bumping the version to indicate compatibility with Xapian 1.2.12. 1.2.11.0 Tue Jun 26 12:13:39 UTC 2012 - No change except for bumping the version to indicate compatibility with Xapian 1.2.11. 1.2.10.0 Wed May 9 10:45:51 UTC 2012 [Changes contributed by Olly Betts] - Wrap Database::close() (was previously only wrapped for WritableDatabase). - Suppress warnings about "not a known MakeMaker parameter name" in a way which also works for newer versions of Perl. 1.2.9.0 Thu Mar 8 07:19:27 UTC 2012 [Changes contributed by Olly Betts] - Wrap Document::get_docid() method. - Fix "Use of qw(...) as parentheses is deprecated" warnings in tests with Perl 5.14. - Improve test coverage of TermGenerator (backported from trunk).
2013-06-04Update to 1.2.15. From the changelog:schmonz4-25/+25
Omega 1.2.15 (2013-04-16): omega: * Don't pointlessly link utf8convert.o into the omega CGI. Omega 1.2.14 (2013-03-14): indexers: * omindex: + Correct "max" -> "min" when reserving space for shared strings in .xlsx files. This just means we now reserve a more appropriate amount of space to start with. + Ignore .com files by default. Omega 1.2.13 (2013-01-09): indexers: * omindex: + Extracting text using external filters now works for filenames containing a newline character - previously the newline got lost during escaping for the shell. + Fix segfault when -F option without a ':' is passed. + Skip a file if we get a read error while calculating the MD5 checksum (used for duplicate detection) - previously we used a checksum of the file up to that point. + Avoid rereading SVG and Atom files when we calculate their MD5 checksums. + Improvement --help output and man page, most notably: - Say explicitly that --sample-size accepts the same formats as --max-size. - Note default size limit on files to index is unlimited. + When generating a sample for a CSV file, limit the size we pre-allocate to the CSV file size if that's smaller than the requested sample size, in case the user sets that limit very high. omega: * Fix to decode %-encoded character at the end of the query string. Omega 1.2.12 (2012-06-27): No changes since 1.2.11 except to bump the version - this release was made to fix an incorrect library version information update in xapian-core 1.2.11. Omega 1.2.11 (2012-06-26): indexers: * Change HTML parser's handling of multiple <body> tags and of text outside of <body> to match the behaviour of modern web browsers. (ticket#599) * omindex: + Add command line option to control the size of the document sample stored. Patch from Mihai Bivol. + Rework .xlsx parsing to substitute the shared strings into the positions they are used in, so that the sample actually matches what appears in the spreadsheet, and to index calculated cell contents. + Improve handling of headers and footers in OpenDocument documents. + pdftotext outputs a formfeed between each page, which messes up our "empty body" check, so trim any trailing formfeeds before this check. Omega 1.2.10 (2012-05-09): indexers: * Add support for CDATA to HTML/XML parser. * omindex: + Add --max-size option, based on patch from ndaley in ticket#587. + Add support for atom feed files, patch from Mihai Bivol in ticket#595. + If the document with the highest existing docid before the run was updated, we were reporting it as "added", but now we correctly report it as "updated". (Backported from 1.3.0). + Catch and report std::exception explicitly, so failing to allocate memory is no longer reported as "Unknown exception". (Backported from 1.3.0). Omega 1.2.9 (2012-03-08): documentation: * docs/overview.html: + Document that libmagic is used to determine the MIME type if the extension isn't known. Partly addresses ticket#569. + We now limit time as well as CPU and memory for external filters. indexers: * Our HTML parser now ignores sections bracketed by <!--UdmComment--> and <!--/UdmComment-->, like we already do for <!--htdig_noindex-->. * omindex: Add more extensions to the default ignore list: bin dat db fon jar lnk pyc pyd pyo sqlite sqlite3 sqlite-journal tmp ttf
2013-06-04Update to 1.2.15. From the changelog:schmonz3-10/+10
Xapian-core 1.2.15 (2013-04-16): API: * QueryParser/TermGenerator: Don't include CJK codepoints which are punctuation in N-grams. * TermGenerator: Fix bug where we failed to generate the first bigram from the second sequence of N-grammable CJK characters in a piece of text. brass backend: * Call fdatasync()/fsync() when creating the "iambrass" file. chert backend: * Call fdatasync()/fsync() when creating the "iamchert" file. flint backend: * Call fdatasync()/fsync() when creating the "iamflint" file. tools: * delve: If -v is specified more than once, show even more info in some cases. Xapian-core 1.2.14 (2013-03-14): API: * MSet::get_document(): Don't cache retrieved Document objects unless they were requested with fetch(). This avoids using a lot of memory when many MSet entries are retrieved. (Fixes #604) matcher: * Check if a candidate document has at least the minimum weight needed before checking positional information, which speeds up slow phrase searches (partly addresses #394). brass backend: * Fix multipass compaction not to damage document values, and to merge the database stats correctly. (fixes #615) chert backend: * Fix multipass compaction not to damage document values, and to merge the database stats correctly. (fixes #615) flint backend: * Fix multipass compaction bug. (fixes #615) tools: * xapian-replicate: + Fix handling of delays between replication events - the subtraction of the target time and the current time was reversed, so we wouldn't sleep when before the deadline, but would sleep after it for the amount we'd missed it by. + On Microsoft Windows, we no longer sleep for more than 43 years if the target time for a replication event had already passed. (Fixes #472) Xapian-core 1.2.13 (2013-01-09): API: * TermGenerator: Add new method TermGenerator::set_max_word_length() to allow this limit to be adjusted by the user. * QueryParser: Implicitly close any unclosed brackets at the end of the query string. Patch from Sehaj Singh Kalra. * DateValueRangeProcessor: Add extra constructor overloaded form so that in DateValueRangeProcessor(1, "date:"), the const char * gets interpreted as std::string rather than bool. matcher: * Improved fix for #590 - count all matching LeafPostList objects with a Weight object rather than trying to prune at the MultiAndPostList level based on max_wt (if wdf is always zero for a term, BM25 gives max_wt of 0, which lead to us never counting that subquery. * Fix calculation of 0.0/0.0 in some cases. This then got used as a minimum weight, but it seems this gives -nan (at least on x86-64 Linux) so it may have been harmless in practice. * We no longer use the highest weighted MSet entry to calculate percentages, so remove code which finds it. brass backend: * Close excess file handles before we get the fcntl lock, which avoids the lock being released again if one is open on the lock file. Notably this avoids a situation where multiple threads in the same process could succeed in locking a database concurrently. chert backend: * Close excess file handles before we get the fcntl lock, which avoids the lock being released again if one is open on the lock file. Notably this avoids a situation where multiple threads in the same process could succeed in locking a database concurrently. flint backend: * Close excess file handles before we get the fcntl lock, which avoids the lock being released again if one is open on the lock file. Notably this avoids a situation where multiple threads in the same process could succeed in locking a database concurrently. remote backend: * Improve the UnimplementedError message for a MatchSpy subclass which doesn't implement name() so it's clearer that it is this particular subclass which can't be used remotely, rather than all MatchSpy objects. documentation: * valueranges.html: Update documentation to reflect change in Xapian 1.1.2 - DateValueRangeProcessor and StringValueRangeProcessor now support a prefix or suffix. * Clarify that the "reverse" parameter of set_sort_by_relevance_then_value() and set_sort_by_relevance_then_key() only affects the ordering of the value/key part of the sort. * docs/quickstart.html: Fix seriously outdated statement that Xapian doesn't create the database directory - that changed in 0.7.2 (released 2003-07-11). * HACKING: Try to make it clearer we're looking for a dual-licence on submitted patches. tools: * xapian-replicate: + Add a --full-copy option to force a full copy to be sent. (ticket#436) + Add --quiet option, and be a little more verbose by default. + Allow files > 32G to be be copied by replication. + Fix "if (fd > 0)" tests in some replication code to be "if (fd >= 0)". In practice this is unlikely to actually have caused problems since stdin is typically still open and using fd 0. + Simplify how we open the .DB file on the replication slave to just call open() once with O_CREAT, rather than once without, than stat() if that fails, and then again with O_CREAT|O_TRUNC if stat() doesn't show an ordinary file exists. examples: * quest: + New --flags command line option to allow setting arbitrary QueryParser flags. + Align option descriptions in --help output, and make the initial letter of such descriptions consistently lowercase.
2013-06-04Changes 51.2:adam3-7/+8
Bug fixes: * fix for enumset.h not being installed on Windows * zOS pkgdata fix * Test fixes * Region enumeration fix * make stable sort faster * host failures for DateFormatTest * LayoutEngine security patches (see above) * ubrk fix for word_POSIX infinite loop * fix memory leak/crash in LayoutEngine * fix header guard typo in layout/TibetanReordering.h
2013-06-03Set as BROKEN on Linux due to non-ASCII PLIST and thus problem withrodent1-1/+6
libarchive on glibc systems.
2013-06-03Set as BROKEN on Linux due to non-ASCII PLIST and thus a problem withrodent1-1/+6
libarchive on glibc systems.
2013-06-03Bump freetype2 and fontconfig dependencies to current pkgsrc versions,wiz16-31/+32
to address issues with NetBSD-6(and earlier)'s fontconfig not being new enough for pango. While doing that, also bump freetype2 dependency to current pkgsrc version. Suggested by tron in PR 47882
2013-06-02Remove "my" before $_, since perl 5.18 complains about it.wiz2-8/+9
2013-05-31Bump all packages for perl-5.18, thatwiz268-505/+537
a) refer 'perl' in their Makefile, or b) have a directory name of p5-*, or c) have any dependency on any p5-* package Like last time, where this caused no complaints.
2013-05-29Recursive revbump because poppler updated its shlib major for awiz1-2/+2
x.y.z+1 change, again... Noted by tron.
2013-05-29Return end() when iteration fails due to node being NULL.joerg3-3/+19
Bump revision.
2013-05-28fix for Python-2.6drochner2-1/+24
2013-05-26Updated textproc/py-libxml2 to 2.9.1adam2-6/+6
2013-05-26Changes 2.9.1:adam11-274/+26
Features: Support for Python3, Add xmlXPathSetContextNode and xmlXPathNodeEval Documentation: Add documentation for xmllint --xpath Fix the URL of the SAX documentation from James Fix spelling of "length" Portability: Fix python bindings with versions older than 2.7 rebuild docs:Makefile.am elfgcchack.h after rebuild in doc elfgcchack for buf module Fix a uneeded and wrong extra link parameter Few cleanup patches for Windows Fix rpmbuild --nocheck Fix for win32/configure.js and WITH_THREAD_ALLOC Fix Broken multi-arch support in xml2-config Fix a portability issue for GCC < 3.4.0 Windows build fixes Fix a thread portability problem Downgrade autoconf requirement to 2.63 Bug Fixes: Fix a linking error for python bindings Fix a couple of return without value Improve the hashing functions Improve handling of xmlStopParser() Remove risk of lockup in dictionary initialization Activate detection of encoding in external subset Fix an output buffer flushing conversion bug Fix an old bug in xmlSchemaValidateOneElement Fix configure cannot remove messages fix schema validation in combination with xsi:nil xmlCtxtReadFile doesn't work with literal IPv6 URLs Fix a few problems with setEntityLoader Detect excessive entities expansion upon replacement Fix the flushing out of raw buffers on encoding conversions Fix some buffer conversion issues When calling xmlNodeDump make sure we grow the buffer quickly Fix an error in the progressive DTD parsing code xmllint should not load DTD by default when using the reader Try IBM-037 when looking for EBCDIC handlers Fix potential out of bound access Fix large parse of file from memory Fix a bug in the nsclean option of the parser Fix a regression in 2.9.0 breaking validation while streaming Remove potential calls to exit() Improvements: Regenerated API, and testapi, rebuild documentation Fix tree iterators broken by 2to3 script update all tests for Python3 and Python2 A few more fixes for python 3 affecting libxml2.py Fix compilation on Python3 Converting apibuild.py to python3 First pass at starting porting to python3 updated configure.in for python3 Add support for xpathRegisterVariable in Python Added a regression tests from bug 694228 data Cache presence of '<' in entities content Avoid extra processing on entities Python binding for xmlRegisterInputCallback Python bindings: DOM casts everything to xmlNode Define LIBXML_THREAD_ALLOC_ENABLED via xmlversion.h Adding streaming validation to runtest checks Add a --pushsmall option to xmllint Cleanups: Switched comment in file to UTF-8 encoding Extend gitignore Silent the new python test on input Cleanup of a duplicate test Cleanup on duplicate test expressions Fix compiler warning after 153cf15905cf4ec080612ada6703757d10caba1e Spec cleanups and a fix for multiarch support Silence a clang warning Cleanup the Copyright to be pure MIT Licence wording rand_seed should be static in dict.c Fix typos in parser comments
2013-05-25The dictionary is named british, not en_GB. Add missing (n)cursesjoerg4-6/+58
dependency. Don't use getline as function name. Bump revision.
2013-05-24Fix build with Python 3.2joerg1-1/+6
2013-05-23Fix Python 3.x build.joerg3-1/+78
2013-05-23Fix include order.joerg2-1/+25