summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2009-02-14Fix patchsum.joerg1-2/+2
2009-02-12DESTDIR support.joerg2-5/+34
2009-02-11Fix build with Python 2.5.joerg2-4/+10
2009-02-11add DESTDIR support.markd1-2/+4
2009-02-11Sort PYTHON_VERSIONS_ACCEPTED.joerg1-2/+2
2009-02-102.07 29 Jan 2009rhaen2-7/+6
- Minor correction to POD from Daniel Leidert <daniel.leidert@wgdd.de>. 2.06 20 Jan 2009 - Fixed make_encmap to avoid spurious output when there is an error with the input file. (Fixes [cpan #41854].)
2009-02-10! Encode.pmrhaen2-7/+6
VERSION++ just to make PAUSE happy Message-Id: <877i4anwwt.fsf@k75.linux.bogus> 2.28 Date: 2009/02/01 12:30:18 ! Unicode/Unicode.xs Latest refactoring broke the backward compatibility w/ Perl 5.8.6 and before now restored Message-Id: <1233185156.DABa130.74940@basic2.hostingcompartido.com> Message-Id: <693254b90902010027x277a5d0fm4f5700ba2f276239@mail.gmail.com> ! lib/Encode/MIME/Header.pm Addressed: Split header lines are joined incorrectly http://rt.cpan.org/Ticket/Display.html?id=42902 2.27 2009/01/21 22:55:07 ! lib/Encode/MIME/Header.pm t/mime-header.t Addressed: Encode::MIME::Header MIME-Q encoding truncates trailing zeros in some circumstances http://rt.cpan.org/Ticket/Display.html?id=42627 ! lib/Encode/Alias.pm Added alias: unicode-1-1-utf-7 http://rt.cpan.org/Ticket/Display.html?id=38558 ! Encode.pm Documented: _utf8_on() does not work for tainted values http://rt.cpan.org/Ticket/Display.html?id=41163 ! bin/enc2xs s[oss.software.ibm.com/icu][www.icu-project.org]g http://rt.cpan.org/Ticket/Display.html?id=40245 ! lib/Encode/Guess.pm t/guess.t Addressed:Empty file should produce an error message http://rt.cpan.org/Ticket/Display.html?id=38652 ! Unicode/Unicode.xs AUTHORS Refactored by Alex Davies http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2007-10/msg00745.html Message-Id: <7637669B2E3D46B187591747DA27F4C8@Amelie>
2009-02-09Switch to Python 2.5 as default. Bump revision of all packages that havejoerg8-11/+16
changed runtime dependencies now.
2009-02-09Remove Python 2.1 support.joerg10-43/+6
2009-02-08Update to 1.2.8.ahoka4-11/+23
No longer needs ncurses (at least on NetBSD 5.0). Official changelog: 2008-11-01: Hunspell 1.2.8 release: - Default BREAK feature and better hyphenated word suggestion to accept and fix (compound) words with hyphen characters by spell checker instead of by work breaking code of OpenOffice.org. With this feature it's possible to accept hyphenated compound words, such as "scot-free", where "scot" is not a correct English word. - ICONV & OCONV: input and output conversion tables for optional character handling or using special inner format. Example: # Accepting de facto replacements of the Romanian comma acuted letters SET UTF-8 ICONV 4 ICONV ş ș ICONV ţ ț ICONV Ş Ș ICONV Ţ Ț Typical usage of ICONV/OCONV is to manage an inner format for a segmental writing system, like the Ethiopic script of the Amharic language. - Extended CHECKCOMPOUNDPATTERN to handle conpound word alternations, like sandhi feature of Telugu and other writing systems. - SIMPLIFIEDTRIPLE compound word feature: allow simplified Swedish and Norwegian compound word forms, like tillåta (till|låta) and bussjåfør (buss|sjåfør) - wordforms: word generator script for dictionary developers (Hunspell version of unmunch). - bug fixes 2008-08-15: Hunspell 1.2.7 release: - FULLSTRIP: new option for affix handling. With FULLSTRIP, affix rules can strip full words, not only one less characters. - COMPOUNDRULE works with all flag types. (COMPOUNDRULE is for pattern matching. For example, en_US dictionary of OpenOffice.org uses COMPOUNDRULE for ordinal number recognition: 1st, 2nd, 11th, 12th, 22nd, 112th, 1000122nd etc.). - optimized suggestions: - modified 1-character distance suggestion algorithms: search a TRY character in all position instead of all TRY characters in a character position (it can give more readable suggestion order, also better suggestions in the first positions, when TRY characters are sorted by frequency.) For example, suggestions for "moze": ooze, doze, Roze, maze, more etc. (Hunspell 1.2.6), maze, more, mote, ooze, mole etc. (Hunspell 1.2.7). - extended compound word checking for better COMPOUNDRULE related suggestions, for example English ordinal numbers: 121323th -> 121323rd (it needs also a th->rd REP definition). - bug fixes 2008-07-15: Hunspell 1.2.6 release: - bug fix release (fix affix rule condition checking of sk_SK dictionary, iconv support in stemming and morphological analysis of the Hunspell utility, see also Changelog) 2008-07-09: Hunspell 1.2.5 release: - bug fix release (fix affix rule condition checking of en_GB dictionary, also morphological analysis by dictionaries with two-level suffixes) 2008-06-18: Hunspell 1.2.4-2 release: - fix GCC compiler warnings 2008-06-17: Hunspell 1.2.4 release: - add free_list() for C, C++ interfaces to deallocate suggestion lists - bug fixes 2008-06-17: Hunspell 1.2.3 release: - extended XML interface to use morphological functions by standard spell checking interface, spell() and suggest(). See hunspell.3 manual page. - default dash suggestions for compound words: newword-> new word and new-word - new manual pages: hunspell.3, hzip.1, hunzip.1. - bug fixes
2009-02-07Fix build on newer NetBSD.joerg1-4/+2
2009-02-07Update to 1.1:wiz2-6/+6
New in 1.1 - Make get_lb_prop_lang static and not an exported symbol - Define is_line_breakable to alias to is_breakable - Declare get_next_char_utf* will be changed to lb_get_next_char_utf* - Move the declarations of get_next_char_utf* from linebreak.h to linebreakdef.h - Add the function documentation comments to the header files
2009-02-02Note py-jinja dependency.markd3-2/+20
pkgsrc's docutils doesn't install an egg-info so dont note the requirement in this egg-info as it wont be detected. Bump PKGREVISION.
2009-02-02Add py-jinjamarkd1-1/+2
2009-02-02Import of py-jinja 1.2markd5-0/+109
Jinja is a sandboxed template engine written in pure Python. It provides a Django like non-XML syntax and compiles templates into executable python code. It's basically a combination of Django templates and python code.
2009-01-31add py-sphinxmarkd1-1/+2
2009-01-31Import py-sphinx 0.5.1markd4-0/+275
Sphinx is a tool that makes it easy to create intelligent and beautiful documentation for Python projects (or other documents consisting of multiple reStructuredText sources), written by Georg Brandl. It was originally created to translate the new Python documentation, but has now been cleaned up in the hope that it will be useful to many other projects. Sphinx uses reStructuredText as its markup language, and many of its strengths come from the power and straightforwardness of reStructuredText and its parsing and translating suite, the Docutils. Although it is still under constant development, the following features are already present, work fine and can be seen "in action" in the Python docs: * Output formats: HTML (including Windows HTML Help), plain text and LaTeX, for printable PDF versions * Extensive cross-references: semantic markup and automatic links for functions, classes, glossary terms and similar pieces of information * Hierarchical structure: easy definition of a document tree, with automatic links to siblings, parents and children * Automatic indices: general index as well as a module index * Code handling: automatic highlighting using the Pygments highlighter * Various extensions are available, e.g. for automatic testing of snippets and inclusion of appropriately formatted docstrings.
2009-01-29Update from version 0.58 to version 0.60.seb2-6/+6
Upstream changes: 2009-01-27 0.60 - H.Merijn Brand <h.m.brand@xs4all.nl> * Optimize for threaded perls. (Goro Fuji, RT#42517) Non-threaded perls won't notice 2009-01-23 0.59 - H.Merijn Brand <h.m.brand@xs4all.nl> * Wrong e-mail in META.yml * Missing $IO argument in bind_columns example (docs only) * Upped Copyright notices to 2009 * Added warning for parse () (RT#42261) * Small optimisations (Goro Fuji, RT#42517) * ppport.h updated to 3.15 * Added git clone command to README * ppport.h updated to 3.16-pre * Optimize getline/print method calls (Goro Fuji, RT#42517) * Decode *_char attributes for perl 5.8.2 and up
2009-01-29Needs intltool.joerg1-2/+2
2009-01-27+ py-html5libjoerg1-1/+2
2009-01-27Import py-html5lib-0.11:joerg4-0/+159
html5lib is a pure-python library for parsing HTML. The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers. Output is to a tree structure; the current release supports output to DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a simple custom format.
2009-01-27Update to soprano 2.2markd3-10/+7
* The AsyncModel now has a "real" asyncroneous mode which uses multiple threads to execute the commands. * New class Soprano::Client::SparqlModel which provides a client to http Sparql endpoints. * Added method Node::toN3 to easily build SPARQL queries without having to encode nodes all the time. * New convinience method QueryResultIterator::iterateStatementsFromBindings * Added convinience methods to handle BackendSettings. * Added operator[] to QueryResultIterator to easily access bindings * Added parameters influencing the visibility to onto2vocabularyclass * Soprano::Node now comes with definitions of qHash for both Node and QUrl * Sesame2 updated to 2.1.4 * The IndexFilterModel can now optionally index specific URIs (IndexFilterModel::setForceIndexPredicates) * Added method IndexFilterModel::optimizeIndex which optimizes the clucene index on disk * Added method IndexFilterModel::encodeUriForLuceneQuery which encodes/escapes a URI so it can be used as a field in a clucene query. * SPARQL endpoint query support in sopranocmd * Performace improvements
2009-01-26Make package build using Sun Studio.sketch3-1/+38
2009-01-23Give up maintainership, as I have not so much as looked at pkgsrc injschauma1-2/+2
well over a year now. Sorry. :-(
2009-01-21Add xapian-flint-backend and xapian-quartz-backend options, allowingdsainty2-14/+36
rationalisation of backends (and also allows wiring the database to the older database format if desired). The "suggested options" select support for both formats, which is also the status quo.
2009-01-18+ liblinebreak.wiz1-1/+2
2009-01-18Initial import of liblinebreak-1.0:wiz5-0/+56
This is liblinebreak, an implementation of the line breaking algorithm as described in Unicode 5.0.0 Standard Annex 14, Revision 19, available at http://www.unicode.org/reports/tr14/tr14-19.html
2009-01-18MAKE_JOBS_SAFE=noobache1-1/+3
2009-01-12Update to 0.14.2:wiz2-6/+6
Changes in 0.14.2: ------------------ * Updated translations: et (Priit Laes)
2009-01-12Update distinfo for p5-Data-FormValidator-4.63.taca1-4/+4
2009-01-11Update from version 1.32 to 1.36.he2-8/+7
Upstream changes: 1.36 Wed 7 Jan 2009 - Fixing missing feature reported by H.Merijn Brand 1.35 Wed 17 Dec 2008 - Production release for changes in 1.34_01 - No other changes 1.34_01 Wed 19 Nov 2008 - Improving sub-classability by passing the indent level through to the ->_write_scalar method. 1.33 Thu 24 Jul 2008 - Export Load and Dump functions by default.
2009-01-11Update from version 0.29 to 0.30.he2-6/+7
Pkgsrc changes: o Add PERL5_REQD+= 5.8.3 Upstream changes: version: 0.30 date: Sun Jan 11 12:53:55 IST 2009 changes: - Plug a leak in Load - More efficient intermediate structure allocation
2009-01-11Update from version 1.37 to 1.42.he2-7/+8
Pkgsrc changes: o Change to Module::Build o Adjust dependency according to META.yml Upstream changes: 1.42 - January 3, 2008 - When transforming elements with attributes into hash-refs - make sure they are placed in the correct namespace. Previously <media:description> over-rided the plain <description>. (as in Flickr). 1.41 - December 4, 2008 - Made the default keys for the channel link, description and title undef instead of ''. They become '' once they are opened. - http://rt.cpan.org/Public/Bug/Display.html?id=40686 - Now keeping track of rdf:resource of <image> in RSS 1.0: - http://rt.cpan.org/Ticket/Display.html?id=3978 - Made sure that channel/{link,description,title} are localized to the empty string before rendering, to allow for better backwards compatibility. - http://rt.cpan.org/Public/Bug/Display.html?id=40686 1.40 - December 1, 2008 - Now outputting an arrayref of <skipDays>/<day>'s and <skipHours>/<hours> properly. - http://rt.cpan.org/Public/Bug/Display.html?id=40978 - Now parsing multiple <category>'ies inside an <item> into an array-reference. - http://rt.cpan.org/Ticket/Display.html?id=38296 - Now outputting an arayref of <item>/<category> prpoerly. - http://rt.cpan.org/Ticket/Display.html?id=38296 - Parsing multiple <channel>/<category> into arrays. - http://rt.cpan.org/Public/Bug/Display.html?id=9729 - Rendering multiple <channel>/<category> elements from array refs. - http://rt.cpan.org/Public/Bug/Display.html?id=9729 - Now parsing multiple dc:items into an array reference (at least inside an <item>): - http://rt.cpan.org/Public/Bug/Display.html?id=6000 - Added the modules_as_arrays option to parse repeatable module elements into an array ref. 1.38 - November 27, 2008 - Added support for duplicate items of RSS modules. - http://rt.cpan.org/Public/Bug/Display.html?id=4495 - Added support for multiple Dublin Core elements. One can point the field to an array and it will create multiple elements: - http://rt.cpan.org/Public/Bug/Display.html?id=6000 - Now parsing several <skipDays>/<day>s and <skipHours>/<hour>s into an array reference instead of concatenating them together. - http://rt.cpan.org/Public/Bug/Display.html?id=40978
2009-01-11Update from version 0.03 to 0.05.he2-6/+6
Upstream changes: 0.05 Mon Dec 15 2008 - skip some tests when missing English dictionary - added license 0.04 Sun Dec 14 2008 - patch by Yanick Champoux with three changes : - lang option - cached Text::Aspell object - made suggestions() context-aware
2009-01-11Update from version 0.7 to .08.he2-7/+8
Pkgsrc changes: o Add PERL5_REQD+=5.8.0 Upstream changes: 0.8 2008/12/07 - Added support for ddns-rev-name, ddns-txt and ddns-fwd-name statements, based on patch by Franck Joncourt.
2009-01-11Update from version 0.29nb1 to 0.30.he2-7/+6
Upstream changes: 0.30 - 2008-11-26 * Remove the "wrapper" export, its name is too generic and clashes with a lot of existing code. It's a negligible amount of sugar.
2009-01-11Update from version 0.72 to 0.72.003.he2-6/+6
Upstream changes: No change log available (lazy bums!). Judging from the diff they are minor.
2009-01-11Update from version 1.60nb1 to 1.61 (distname 1.61a -- someone messed up?)he2-8/+10
Pkgsrc changes: o Add some cludges to tell pkgsrc this is really version 1.61, not 1.61a (the module version is 1.61) Upstream changes: Changes for version 1.61 (Dec 29, 2008) ------------------------ - Fix bugs in locale operations for format_price (thanks Moritz Onken) - Fix documentation in format_bytes (rt # 42036) - Enable warning when format_bytes called with numeric precision not hash
2009-01-11Update from version 4.61nb1 to 4.63.he1-6/+6
Pkgsrc changes: o Adjust dependencies, including reshuffling to match contents and order in META.yml o Require perl version 5.8, also according to META.yml Upstream changes: 4.63 Sat Jan 3 12:46:15 EST 2009 [DOCUMENTATION] - Improve docs for writing your own constraints (girlwithglasses) 4.62 Sat Jan 3 12:10:14 EST 2009 [DOCUMENTATION] - Fix typo in ConstraintsFactory docs (Geraint A. Edwards) - Clarified docs for get_filtered_data (Lisa Hagemann) - Clarify that 'field_filter_regexp_map' can take coderefs as values (Ron Savage) [INTERNALS] - Better diagnostics when load_profiles fails (Thoralf Klein) - Improved portability of t/untaint.t (Alexandr Ciornii) - Make some tests conditional on Date::Calc being installed (Frank Wiles)
2009-01-11Fix build with curl-7.19.x.wiz2-1/+15
Still builds with this patch and curl-7.18, so no PKGREVISION bump. Addresses PR 40369.
2009-01-11Do not add "-lintl" to Makefile dependencies.minskim2-5/+5
This addresses one of the issues in PR 30019.
2009-01-11openjade needs libintl. Include gettext-lib's buildlink3.mk.minskim1-1/+2
2009-01-10Update to soprano 2.1.64markd3-9/+16
First beta of soprano 2.2. Required for KDE 4.2
2009-01-10+ ebook-tools.wiz1-1/+2
2009-01-10Initial import of ebook-tools-0.1.1, originally packaged for wip.wiz6-0/+84
ebook-tools provides tools for accessing and converting various ebook file formats.
2009-01-09Remove unneeded patch.wiz1-40/+0
2009-01-09Update to 20090107:wiz2-7/+6
- fix build on the less GNUish systems where wc -l outputs leading whitespace - some words added
2009-01-08Buildlink in the required devel/zlib. Fixes build on systems where native ↵dsainty1-1/+2
zlib (header) is not installed.
2009-01-07Update to 1.0.10:wiz3-20/+11
Omega 1.0.10 (2008-12-23): build system: * This release now uses newer versions of the autotools (autoconf 2.62 -> 2.63; automake 1.10.1 -> 1.10.2). The newer autoconf fixes a regression in autoconf 2.62 (and so Omega 1.0.7) with detecting the endian-ness of some platforms. Omega 1.0.9 (2008-10-31): documentation: * docs/overview.html: Document HTML parsing a bit, including robots meta and htdig_noindex. omega: * omega: Catch std::exception and report what its what() method returns. * omega: Remove undocumented and non-functional support for numeric sorting via CGI parameter SORT=#<slot> (SORT=<slot> works as before). build system: * configure: Sync warning flag handling changes from xapian-core to eliminate many warnings from GCC 4.3. Omega 1.0.8 (2008-09-04): documentation: * Fix a few typos and improve wording in a few places. indexers: * omindex: + If the character encoding is specified using <meta http-equiv=...> in an HTML document then reparse the document if it isn't the encoding we're already using so that any preceding <title> is converted correctly (bug#292). + Convert text from meta tag parameters to UTF-8 (bug#293). + Handle <meta charset="..."> (new in HTML 5). + Fix bug in HTML tag parameter parsing which was probably just a small performance penalty in real world cases, but could perhaps result in parsing bogus extra parameters in carefully contrived situations. portability: * Add missing <signal.h>, noted on FreeBSD by Henrik Brix Andersen.
2009-01-07Update to 1.0.10:wiz2-7/+7
Xapian-core 1.0.10 (2008-12-23): API: * Composing an OP_NEAR query with two non-term subqueries now throws UnimplementedError instead of AssertionError (in a --enable-assertions build) or leading to unexpected results (otherwise). This partly addresses bug#201. * Using a MultiValueSorter with no values set no longer causes a hang or segmentation fault (but it is still rather pointless!) matcher: * If we're using values for sorting and for another purpose, cache the Document::Internal object created to get the value for sorting, like we do between other uses. flint backend: * If the disk became full while flushing database changes to disk, the WritableDatabase object would throw a DatabaseError exception but be left in an inconsistent state such that further use could lead to the database on disk ending up in a "corrupt" state (theoretically fixable, but no tool to fix such a database exists). Now we try to ensure that the object is left in a consistent state, but if doing so throws a further exception, we put the WritableDatabase object in a "closed" state such that further attempts to use it throw an exception. * Create the lockfile "flintlock" with permissions 0666 so that the umask is honoured just like we do for the other files (previously we used 0600). Previously it wasn't possible to lock a database for update if it was owned by another user, even if you otherwise had sufficient permissions via "group" or "other". * Fix garbled exception message when a base file can't be reread. quartz backend: * Fix garbled exception message when a base file can't be reread. remote backend: * xapian-tcpsrv and xapian-progsrv now accept -w as a short form of --writable, as was always intended. build system: * This release now uses newer versions of the autotools (autoconf 2.62 -> 2.63; automake 1.10.1 -> 1.10.2). documentation: * INSTALL: Add new paragraphs about HP's aCC and IRIX (adapted from footnotes in PLATFORMS). * PLATFORMS: HP testdrive has been shut down, so all mark all those machines as "no longer available". Update atreus' build report to 1.0.10. * docs/queryparser.html: Add link to valueranges.html. examples: * delve: Add missing "and" to --help output. Report termfreq and collection freq for each term we're asked about. portability: * Fix to build with GCC 4.4 snapshot. Xapian-core 1.0.9 (2008-10-31): API: * Database::get_spelling_suggestion() is now faster (15% speed up for parsing queries with FLAG_SPELLING_CORRECTION set in a test on real world data). * Fix OP_ELITE_SET segmentation fault due to excess floating point precision on x86 Linux (and possibly other platforms). * Database::allterms_begin() over multiple databases now gives a TermIterator with operations O(log(n)) rather than potentially O(n) in the number of databases. * Add new Database methods metadata_keys_begin() and metadata_keys_end() to allow the complete list of metadata in a database to be retrieved (this API addition is needed so that copydatabase can copy database metadata). testsuite: * Remove the cached test databases before running the testsuite. * apitest: Fix cursordelbug1 to work on Microsoft Windows (bug#301). * apitest,queryparsertest: Skip tests which fail because the timer granularity is too coarse to measure how long the test took. In practice, this is only an issue on Microsoft Windows (bug#300 and bug#308). matcher: * Adjust percent cutoff calculations in the matcher in a way which corresponds to the change to percentage calculations made in 1.0.7 to allow for excess precision. * Query::MatchAll no longer gives match results ranked by increasing document length. flint backend: * xapian-compact: Fix crash while compacting spelling table for a single database when built with MSVC, and probably other platforms, though Linux got lucky and happened to work (bug#305). build system: * configure: Disable -Wconversion for now - it's not useful for older GCC and is buggy in GCC 4.3. * configure: Set -Wstrict-overflow to 1 instead of 5, to avoid unreasonable warnings under GCC 4.3. documentation: * Minor improvements to API documentation, including documenting the XAPIAN_FLUSH_THRESHOLD environmental variable in WriteableDatabase::flush() (bug#306). * valueranges.html: Fix typos in example code, and drop superfluous empty destructor from ValueRangeProcessor subclass. * HACKING: Several improvements. examples: * copydatabase: Also copy user metadata. Xapian-core 1.0.8 (2008-09-04): API: * Fix output of RSet::get_description testsuite: * Report subtotals per backend, rather than per testgroup per backend to make the output easier to read. flint backend: * Fix WritableDatabase::add_document() and replace_document() not to be O(n*n) in the number of values in the new document. * Fix handling of a table created lazily after the database has had commits, and which is then cursored while still in sequential mode. * Fix failure to remove all the Btree entries in some cases when all the postings for a term are removed. (bug#287) * xapian-inspect: Show the help message on start-up. Correct the documented alias for next from ' ' to ''. Avoid reading outside of input string when it is empty. (bug#286) quartz backend: * Backport fix from flint for WritableDatabase::add_document() and replace_document() not to be O(n*n) in the number of values in the new document. build system: * configure: Report bug report URL in --help output. * xapian-config: Report bug report URL in --help output. * configure: Fix deprecation error for --enable-debug=full to say to instead use '--enable-assertions --enable-log' not '--enable-debug --enable-log'. documentation: * valueranges.html: Expand on some sections. examples: * quest: Fix to catch QueryParserError instead of const char * which QueryParser threw in Xapian < 1.0.0. * copydatabase: Use C++ forms of C headers. Only treat '\' as a directory separator on platforms where it is. Update counter every 13 counting up to the end so that the digits all "rotate" and the counter ends up on the exact total. portability: * Eliminate literal top-bit-set characters in testsuite source code.