summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2015-09-22Update p5-XML-Atom-SimpleFeed to 0.863.kleink2-6/+6
0.863 Thu 10 Sep 2015 [INCOMPATIBLE CHANGE] - Datetime object support now via ->epoch method instead of ->strftime. Despite the fact that this is an incompatible change, it should actually be a nonevent for almost all users, because every datetime module I could find that supports ->strftime also supports ->epoch (and vice versa). However, the ->strftime methods of many modules are (subtly or badly) broken in the face of timezones even as their ->epoch methods work right (or else are broken subtly enough to escape notice). But if you have written your own datetime class, and it has a ->strftime method but not an ->epoch method, and you pass instance of that class to instance of this module, then the feeds you generate that way will now be broken. On balance, I believe that this change will unbreak vastly more code than it breaks. Therefore I decided to switch.
2015-09-21Update textproc/miller to 2.2.0:wiz2-6/+6
Multi-character RS,FS,PS You can process CRLF-terminated DKVP files with mlr --dkvp --rs crlf. You can process LF-terminated CSV files with mlr --csv --rs lf. You can process TSV using mlr --fs tab; you can convert TSV to CSV using mlr --ifs tab --ofs comma. Along with many more possibilities. Please see mlr -h for more information. There is one minor, backward-incompatible change which I felt not worth calling this 3.0.0: default field separator for NIDX format is now space, not comma.
2015-09-19Fix build problem with ruby22.taca2-13/+31
2015-09-17Improve error message when distfile missing, like multimedia/libdvdcss.tnn1-1/+10
2015-09-13Update ruby-redcarpet to 3.3.2.taca2-6/+6
Exact changes are not available.
2015-09-13Update ruby-multi_json to 1.11.2taca2-6/+6
1.11.2 ------ * [Only pass one argument to JrJackson when two is not supported](https://github.com/intridea/multi_json/commit/e798fa517c817fc706982d3f3c61129b6651d601)
2015-09-13Update ruby-kramdown to 1.8.0.taca3-7/+12
kramdown 1.8.0 released This release contains only some minor updates and bug fixes. Changes * 4 minor changes: - The LaTeX converter now uses \texttt instead of \tt for code spans (fixes #257, reported by richard101696) - New option footnote_backlink for changing the backlink of footnotes in the HTML converter (fixes #247, requested by Benjamin Esham) - A quote directly followed by an ellipsis is now converted into an opening quotation mark (fixes #253, requested by Michael Franzl) - Removed warning for self-closing HTML elements that are not self-closed (fixes #262, requested by Gregory Pakosz) * 3 bug fixes: - Fixed #251: The special character sequence \` now works correctly when used in footnotes or headers that appear in the table of contents (reported by Peter Kehl) - Fixed #254: kramdown crashed on encountering a table with multiple consecutive separator lines (reported by Christian Kruse) - Fixed #256: Certain footnote definitions and codeblocks lead to crashes or unneeded backtracking in the regular expression engine - fixed by using atomic grouping (reported by Ali Ok)
2015-09-13Update ruby-haml to 4.0.7.taca4-9/+16
Add pkg_alternative support. # Haml Changelog ## 4.0.7 Released on August 10, 2015 ([diff](https://github.com/haml/haml/compare/4.0.6...4.0.7)). * Significantly improve performance of regexp used to fix whitespace handling in textareas (thanks [Stan Hu](https://github.com/stanhu)).
2015-09-13Revbump all packages using Go after the Go update.bsiegert1-1/+2
2015-09-12cloud.github.com unconditionally redirects to https. Use it directly.tnn2-4/+4
2015-09-12Needs pkg-config.joerg1-1/+2
2015-09-11Add a minimum version to dependancies, it seems that pbulk won't workbouyer2-4/+4
without it.
2015-09-11+p5-XML-Descentbouyer1-1/+3
+p5-XML-TokeParser
2015-09-11Import p5-XML-Descent-1.04 to pkgsrcbouyer3-0/+28
The conventional models for parsing XML are either DOM (a data structure representing the entire document tree is created) or SAX (callbacks are issued for each element in the XML). XML grammar is recursive - so it's nice to be able to write recursive parsers for it. XML::Descent allows such parsers to be created.
2015-09-11Import p5-XML-TokeParser 0.05 to pkgsrcbouyer3-0/+35
XML::TokeParser provides a procedural ("pull mode") interface to XML::Parser in much the same way that Gisle Aas' HTML::TokeParser provides a procedural interface to HTML::Parser. XML::TokeParser splits its XML input up into "tokens," each corresponding to an XML::Parser event. A token is a bless'd reference to an array whose first element is an event-type string and whose last element is the literal text of the XML input that generated the event, with intermediate elements varying according to the event type. Each token is an object of type XML::TokeParser::Token. Read "XML::TokeParser::Token" to learn what methods are available for inspecting the token, and retrieving data from it.
2015-09-10Updated textproc/qprint to 1.1.wiz2-7/+7
Changes not found. Addresses PR 50225 by Dario Niedermann.
2015-09-09Add missing dependencies.gdt1-2/+5
From Jason Bacon via pkgsrc-users.
2015-09-09Update p5-XML-Atom-SimpleFeed to 0.862.kleink2-7/+6
0.862 Tue 08 Sep 2015 - Correct RFC 3339 formatting for datetime objects (Thanks, virii)
2015-09-07Now that _STRIPFLAG_INSTALL is disabled by default on Darwin, remove manualjperkin1-2/+2
settings of INSTALL_UNSTRIPPED=yes for Darwin in individual packages.
2015-09-07Update to 1.04mef2-7/+6
-------------- 1.04 2015-07-04 NEILB - Added [MetaJSON] to dist.ini, so release will include META.json. RT#105628 from ETHER++
2015-09-07Update to 1.31mef2-7/+6
-------------- 1.31 2015-07-04 NEILB - Added [MetaJSON] to dist.ini, so release will include META.json. RT#105631 from ETHER++
2015-09-07Update to 9.29mef2-7/+6
-------------- 0.29 May 25 2015 Fixed RT bug #104637 [PATCH]improve documentation on acronym input Removed redundant call that remained after 104419 patch was applied
2015-09-07Update to 1.55mef2-7/+6
-------------- 1.55 2015-07-26 rurban ---- * Fix t/z_pod-coverage.t with old Test::More by Kent Frederik. RT #106090. * Fix t/tee.t + t/order.t race under parallel testing. RT #105396. Thanks to Kent Frederik * Fix Filter exec refcount, breaking earlier parse exits with __DATA__ RT #101668 Thanks to user42_kevin@yahoo.com.au * Add missing filter_del in exec filter. * Add pod for Filter::Util::Call::unimport to fix t/z_pod-coverage.t
2015-09-07Update 2.73 to 2.76mef2-7/+6
------------------- $Revision: 2.76 $ $Date: 2015/07/31 02:18:28 $ ! ucm/koi8-u.ucm Pulled: Fix 0x95 https://github.com/dankogai/p5-encode/pull/41 2.75 2015/06/30 09:59:53 ! Unicode/Unicode.pm Unicode/Unicode.xs encoding.pm VERSION++'ed to make bleadperl happy Message-Id: <CADED=K4QjMxGFAOLEuZUx3OtN-d-hokhurr4BYBE2E3okoxA7g@mail.gmail.com> 2.74 2015/06/25 ! Unicode/Unicode.xs Applied: #101486: [PATCH] reduce compiler warnings and stderr noise (again) https://rt.cpan.org/Ticket/Display.html?id=101486 ! bin/enc2xs Applied patch: #105471: make Encode build with -pedantic https://rt.cpan.org/Ticket/Display.html?id=105471 ! Byte/Makefile.PL CN/Makefile.PL EBCDIC/Makefile.PL JP/Makefile.PL KR/Makefile.PL Makefile.PL Symbol/Makefile.PL TW/Makefile.PL Applied patch: #102826: non-deterministic Makefiles https://rt.cpan.org/Ticket/Display.html?id=102826
2015-09-06Update 2.1.1 to 2.1.3mef2-6/+6
--------------------- ChangeLog is unknown.
2015-09-06Update to 2.00:wiz2-9/+8
2.00 2015-09-05 NEILB * Bumped version to 2.00 because last release failed to index: some of the earlier releases had a dist version of 0.xx, but various of the modules had version numbers 1.0x. PAUSE is now right stricter about monotically increasing version numbers, so easiest to just to jump ahead. 0.30 2015-09-04 NEILB * Dropped File::Slurp in favour of File::Slurper. RT#106853 from Henk van Oers. * Don't try to convert text to UTF-8 if it's already tagged as UTF-8. Thanks to Artem Krivopolenov. * Changed all remaining instances of 'use base' to 'use parent'; it was previously using a mixture of parent *and* base. * Added "use warnings" and "require 5.006" to all modules. 0.29_02 2015-08-03 NEILB * Don't try to convert text to UTF-8 if it's already tagged as UTF-8. Thanks to Artem Krivopolenov. * Changed all remaining instances of 'use base' to 'use parent'; it was previously using a mixture of parent *and* base. 0.29_01 2015-08-02 NEILB * Switched to Dist::Zilla * Added "use warnings" and "require 5.006" to all modules. * Moved custom-pom2 script from bin/ to examples/ * Reformatted this file as per CPAN::Changes::Spec
2015-09-06Add following line for make testmef1-1/+4
BUILD_DEPENDS+= p5-Test-Deep-[0-9]*:../../devel/p5-Test-Deep
2015-09-05Add following line for make testmef1-1/+4
BUILD_DEPENDS+= p5-Test-Differences-[0-9]*:../../devel/p5-Test-Differences BUILD_DEPENDS+= p5-Text-Diff-[0-9]*:../../textproc/p5-Text-Diff
2015-09-05Add one more line for make test.mef1-1/+2
BUILD_DEPENDS+= p5-Text-Diff-[0-9]*:../../textproc/p5-Text-Diff
2015-09-04Update miller to 2.1.1.wiz2-6/+8
Changes: v2.1.1 Incremental read-performance increase for CSV format While #51 is still underway, already there is nearly a 2x read-performance increase in v2.1.1 over v2.1.0. v2.1.0 Minor enhancements and bug fixes Highlights: travis-CI integration (thanks @SikhNerd!); hour-minute-second functions; fixed pretty-print alignment of UTF-8 data.
2015-09-03I'm not currently using this module, reset MAINTAINER.dsainty1-2/+2
2015-09-01Update to 2.0122mef2-7/+6
---------------- 2.0122 2015-09-01 - Enable the memory test on cygwin as well as Linux. - https://rt.cpan.org/Ticket/Display.html?id=104666 - Thanks to https://me.yahoo.com/howdidwegetherereally#f714d for the report. - Fix a typo in createElementNS - https://rt.cpan.org/Public/Bug/Display.html?id=106807 - Thanks to Rich for the report.
2015-09-01Update iso-codes 3.61mef2-6/+6
-------------- Dr. Tobias Quathamer <toddy@debian.org> Tue, 1 Sep 2015 [ ISO 639 translations ] * Thai by Theppitak Karoonboonyanan [ ISO 639-3 translations ] * Thai by Theppitak Karoonboonyanan [ ISO 3166 translations ] * Turkish by Atila KOÇ. Closes: #796207 * Hebrew by Lior Kaplan * Korean by Changwoo Ryu (TP) * Danish by Joe Hansen (TP) * Icelandic by Sveinn í Felli (TP) * Slovak by Ivan Masár [ ISO 4217 translations ] * Ukrainian by Yuri Chornoivan (TP) * Swedish by Anders Jonsson (TP) * Norwegian Bokmaal by Hans Fredrik Nordhaug (TP) * Danish by Joe Hansen (TP) * Icelandic by Sveinn í Felli (TP) * Polish by Jakub Bogusz (TP) * Hungarian by Balázs Úr (TP) [ ISO 3166-2 translations ] * Danish by Joe Hansen (TP) * Polish by Jakub Bogusz (TP) [ ISO 15924 translations ] * Swedish by Anders Jonsson (TP)
2015-08-31Unbreak on Bitrig by adding necessary parts to autoconf related filessevan6-21/+62
Add Bitrig to runConfigure script http://bugs.icu-project.org/trac/ticket/11881 http://bugs.icu-project.org/trac/ticket/11882
2015-08-31+otsnros1-1/+2
2015-08-31Import of open text summarizer as textproc/ots to pkgsrc-current.nros6-0/+101
Package in pkgsrc-wip by nros. The Open Text Summarizer is an open source tool for summarizing texts. The program reads a text and decides which sentences are important and which are not. OTS supports many (25+) languages which are configured in XML files. Several academic publications have benchmarked it and praised it.
2015-08-30Update pear-File_Find to 1.3.3.taca2-6/+6
1.3.3: Fix release 1.3.2: QA release * Bug #18779 PHP Deprecated: Assigning the return value of new by reference is deprecated * Bug #19530 Array to string conversion
2015-08-30Update pear-Console_Table to 1.2.1.taca2-6/+6
1.2.1: * Add travis configuration (Christian Weiske). * Try to autoload Console_Color2 first (Jurgen Rutten, PR #11). * Fix Composer definition syntax (Rob Loach, PR #9). 1.2.0: * Make border visibility configurable (Christian Weiske, Request #20186). * Allow to customize all border characters (Christian Weiske, Request #20182). * Fix notice when using setAlign() on other than first column (Christian Weiske, Bug #20181). * Use Console_Color2 to avoid notices from PHP 4 code (Christian Weiske, Bug #20188) 1.1.6: Use line breaks dependent on the current operating system.
2015-08-29Let's assume that the second p5-Sub-Exporter location is just a typo...joerg1-2/+2
2015-08-29Update to 3.3.7wen2-8/+8
Update DEPENDS Upstream changes: 3.3.7 2015-08-28 13:45:00+0900 - Fix for older Perl 5.8.8 or lower(#145) - Enable 5.8 tests again 3.3.6 2015-08-25 13:50:00+0900 - Fix issue 'include' makes stack pointer incorrect(#130) 3.3.5 2015-08-05 18:50:00+0900 - Update Mouse version for Perl 5.22 or higher
2015-08-28Update to 1.24mef2-7/+6
-------------- 2015-08-28 Sean M. Burke sburke@cpan.org * RELEASE 1.24. Fixing a little (BIG) bug that David Cusimano is a superstar for having noticed. Ah, what a difference a ";" vs a "," makes! [https://rt.cpan.org/Public/Bug/Display.html?id=105420] * I'M BACK. After nine months of semi-catastrophic system failures, and after Voyager-style flybys of a dozen project deadlines... and now I can somehow try to get back in the swing of things. * ANOTHER superstar is Mistah Brendan Byrd who said that there are [ https://rt.cpan.org/Public/Bug/Display.html?id=102357 ] many ports of Unidecode to other languages and that I should brag about that fact, and he is very extremely correct, so now the Pod in Unidecode.pm indeed does just that. * (I got my distro-building back up and running. WOLVERIIIINES!) * I'm thinking of having future Unidecode/*.pm data files contain the canonical Unicode character name for every character as a comment. Obviously, this would make the dist pretty big. But the lib/Unidecode/*.pm files is somewhere around a meg. What's a few megs more?... with the benefit of added clarity? Everyone's a winner!
2015-08-28+ miller.wiz1-1/+2
2015-08-28Import miller-2.0.0 as textproc/miller.wiz4-0/+40
Miller is like sed, awk, cut, join, and sort for name-indexed data such as CSV. With Miller, you get to use named fields without needing to count positional indices. This is something the Unix toolkit always could have done, and arguably always should have done. It operates on key-value-pair data while the familiar Unix tools operate on integer-indexed fields: if the natural data structure for the latter is the array, then Miller's natural data structure is the insertion-ordered hash map. This encompasses a variety of data formats, including but not limited to the familiar CSV. (Miller can handle positionally-indexed data as a special case.)
2015-08-26Update to 1.60. Changes:shattered2-6/+6
+ add configure option --with-man2html + update configure macros + update config.guess, config.sub
2015-08-26Update to 5.2.5:wiz7-34/+400
[ANNOUNCE] Link Grammar version 5.2.0 is now available. This is a major release of the parser, with many important changes in it. The internals of the parser have been re-organized, resulting in a speedup of 2x to 4x for typical English texts. Multiple multi- threading bugs were fixed, and there is now a simple multi-threading unit test. A memory leak was fixed, and a memory over-consumption bug was fixed. These changes were enabled by the final removal of the "fat link" code from the parser. Parser internals work continues apace: it is expected that a version 5.3.0 will follow shortly, featuring a completely re-designed tokenizer. This redesign should enable simpler and better morphology support. The ChangeLog notes other fixes as well: Version 5.2.0 (27 December 2014) * y'all, ain't, gonna, y'gotta: Beverly Hillbillies basilect. * Permanent removal of the fat-link code. * Remove deprecated constituent tree code. * Windows: add terminal screen resizing support. * Windows: a build fix. * reign, rule, run, leave, come: can take predicative adjective. * Rework costs for many verb-derived adjectives. * Handle (predicative) adjectival modifiers for assorted perfect verbs. * Fixes for various color names. * Fixes for various affirmative answers. * Add 100 missing verbs. * Add preliminary lxc-docker (docker.io) support. * Remove MSVC6 support. * Fix memleak introduced in version 5.1.0 * Speedup of 1.7x to 4x (depending on text) from linkage processing redesign. * Fix multi-threading safety bug. * Fix link-and-domain printing alignment (to handle utf8 char widths). * Windows: fixes for MSVC12 support. * Fix memory consumption bug (EMPTY_WORD) introduced in version 4.7.10. * Get rid of xrealloc, which clashes with libbfd symbol xrealloc. * Add multi-threaded parsing unit test. ================================================================= Link Grammar version 5.1.2 is now available. Download from: http://www.abisource.com/downloads/link-grammar/5.1.2/link-grammar-5.1.2.tar.gz The most serious fix in this release is a build-break fix for Apple OSX Mavericks. Other fixes, from the ChangeLog: * Fix greeting: "How do you do?" * Fix indirect object in 'what' questions: 'To what do you owe your success?' * Fix assorted questions with verb "to be". * Compile fixes for Apple OSX version "Mavericks" ================================================================= [ANNOUNCE] link-grammar version 5.1.0 This version includes a number of important changes. One of these is that the connectors can now be given a direction (head and tail indicators), so that link-grammar dependencies can now be true, hierarchical dependency arrows. This is of marginal importance for English, where dependency directions are implicit, but is vital for free-word-order languages, where bi-directional links are not enough. Another important change is that costs can now be arbitrary floating point numbers. This is particularly useful for providing fine-grained parse ranking. The LG cost system assigns a "cost" to every connector, and the sum-total of costs for a sentence determines the parse ranking. Since costs are additive, they behave as entropies (log P -- the logarithm of a probability: probabilities are multiplicative, logarithms are additive). Under the covers, there's been some major work on the tokenization (splitting sentences into words) and morphology (splitting words into morphemes) code. This work is ongoing, and should eventually result in much better support for non-English languages. Other notable changes include an updated Russian dictionary, and an assortment of changes to the English dictionary. An intriguing step towards phonology: LG can now distinguish between the use of the determiners "a" and "an" preceding nouns that start with consonants or vowels. Whether fancier phonology support is possible is a curious question. The full changelog is below: * Updated Russian dictionaries from Sergei Protasov. * Added morphology-based unknown-word handling for Russian, from Sergei. * Fix up fat-linkage code, which was recently broken... * API cleanup: many command-line options never belonged in the API. * New emoticon support was clobbering certain dictionary words. * Fix: "Go to spot X", "It happens at time T." * Add a dozen missing verbs. * Minor work on greetings. * Add mechanism for denoting fractional costs in the file-backed dict. * Fix: broken handling of gerunds (due to bad verb-wall connectors) * Major redesign of morpheme splitting mechanism (from AmirP) * Minor extensions to support numeric formulas, e.g. 1 + 1 = 2. * Remove fat linkage support from the SAT solver. * Enable build of SAT solver by default. * Fix multiple bugs with unit stripping. * Add bounds-checking to the C API. * Fix the old disjunct-printing implementation. * Add support for easy-to-use link direction indicator. * Add random morphology generator tool. * Partial support for phonetic use of "a" vs. "an" for English. * Rework how coordination between conjunctions works: "either... or ...", etc. * Major redesign of tokenization mechanism (from AmirP) ================================================================= Version 5.0.0 of the Link Grammar Parser is now available. (Yes, its April 1st. No, this is not a joke. Maybe I'll think of something snarky next year.) We are proud to announce a major new release of the Link Grammar Parser! It contains many important changes and new additions. One of the most significant changes is that the license has been changed from the BSD license to the LGPL. This was done to enable considerably more flexibility in accepting contributions to the project: it seems that few are particularly interested in contributing to a BSD-licensed project. This change has enabled folding in some new work: o Arabic and Persion dictionaries! These were previously maintained as separate add-ons. Including them as part of the distribution should make it easier for interested users. o A new 'bindings' directory, containing code for Java, Python, Common Lisp, OCaML and AutoIt programming languages. The Python bindings are an updated version of the older pylinkgrammar-0.2.13 bindings. A SWIG interface file should make it easy to create other language bindigns as well. o Improved morphology support. This will be invisible to most users, but it lays the groundwork for add Hebrew support to the parser. o Expanded Lithuanian support. This remains a simplistic prototype, but it now performs a more sophisticated morphological analysis. o Experimental Turkish and Hebrew dictionaries. o A demo of the JSON parser server: it shows how to run the server, which will accept accept raw sentences on a socket, and returns the parsed forms. o Some slightly incompatible changes to the API: it was time for some housekeeping. o Misc minor updates to the English Language dictionaries. o Preliminary work for SQL-backed dynamic dictionaries. This should enable certain types of automated language learning. The full changelog is shown below. CHANGELOG: Version 5.0.0 (1 April 2014) * License upgrade to LGPLv2.1 * Arabic dictionaries, from Jon Dehdari * Persian dictionaries, from Jon Dehdari * Support for Hebrew tokenization, from Amir P. * Fix wild-card matching for user-supplied word lookup. * Prototype Turkish dictionary from Can Bruce. * Re-arrange programming language bindings directory. * Adopt the orphaned/unsupported pylinkgrammar Python bindings. * Deprecate the obsolete CNode interface. * Provide low-level perl bindings. * Adopt the orphaned/unsupported OCaML bindings. * Support affirmative replies: "Who did it?" "John's evil twin." * Expanded Lithuanian dictionary. * Minor disjunct printing fixes. * Fix: "Mary is too XXX to talk to." * Prototype Hebrew dictionary from Amir P. * Change !suffixes flag to !morphology. * Introduce a bi-directional connector, for free-word-order languages. * Introduce a symmetric-AND operator, for free-word-order languages. * Add demo shell script for running the JSON parse server. * Bugfix: Java server failing when input sentence has commas in it! * New !test and !debug commands for selective debugging support. * Print post-processing rejection message, when !bad is enabled. * Remove some deprecated functions for C API. * Remove all deprecated functions from Java API. * Initial support for an SQL-backed dynamic dictionary. ================================================================= Version 4.8.5 of the Link Grammar Parser is now available. This is the third release in about a week; each prompted by a build-break in the previous version. Sorry! There's been assorted (minor) new work, and this has been enough to cause trouble for various people. Some notable changes in the last 6 weeks: * Improved Russian (UTF-8) support for MSWindows users. * Build files for MSVC12 * Several Java binding fixes * English dictionary: add a verb-wall connector for present participles. A full list of changes is given below. If none of these seem to affect you, there is no particular need to upgrade. CHANGELOG: Version 4.8.5 (5 January 2014) * Update memory usage accounting; fix accounting bugs. * Fix Java garbage collection bug. * Fix numerous compiler warnings in the SAT-solver code. * Fix build-break involving multiple declaration of 'Boolean'. Version 4.8.4 (30 December 2013) * Fix build break for Mac OSX. Version 4.8.3 (30 December 2013) * Create new msvc12 build files, restore old msvc9 files. * Revert location of the Windows mbrtowc declaration. * Add verb-wall connector for present participles. * Fix build-time include file directory paths. * Provide the 'any' language to enumerate all possible linkages. * Fix recognition of U+00A0, c2 a0, NO-BREAK SPACE as whitespace. * Improve parse-time performance of exceptionally long sentences. * Fix crash on certain sentences containing equals sign. Version 4.8.2 (25 November 2013) * More MSWindows UTF-8/multi-byte fixes (for Russian). * Add missing JSONUtils file. Version 4.8.1 (21 November 2013) * Ongoing work on viterbi. * Updated MSVC9 project files from Jand Hashemi (Lucky--) * Fix important bug in Java services: return top parses, not random ones. * Java: for the link-diagram string, do not limit to 80 char term width. * Windows: UTF-8 fixes so that Russian works in most MSWindows locales. ================================================================= Version 4.8.0 of the Link Grammar Parser is now available. This is the start of a new version series, containing an important change to the English language dictionary. Three new link types are introduced WV, CV and IV. These are used to connect the left-wall to the primary verb of the sentence (WV), to connect the ruling clause to the primary verb of a dependent clause (CV), and a similar link for certain infinitive verbs (IV). The goal of these links is to make it easier to locate verbs, and thus to provide a more direct mapping from the link-grammar formalism to a dependency parse (as dependency parses always put the verb at the root of a sentence). These are not the first links that explicitly indicate root verbs: several other link types already play this role: The AF, CP, Eq, COq and B links already play this role. The new WV, CV and IV links round out this capability and do so in a very general form. See http://www.abisource.com/projects/link-grammar/dict/section-WV.html for details. With this release, we expect that all (non-auxiliary) verbs in a sentence will be linked either to the wall, or to a controlling parent. We also expect there to be some additional fixes and tightening-up to occur in future releases, especially in regards to comparative sentences. This release also includes a variety of fixes to the Java API/server. In addition, some ancient, deprecated C code was removed. CHANGELOG: Version 4.8.0 (24 October 2013) * Fix "he answered yes" * Support bulleted, numbered lists. * New link types from Lian Ruiting, for identifying the head-verb. * Java: fix bug when totaling WordNet word-sense score. * Java: add info to README about using the JSON parse server. * Java: remove many deprecated functions. * C API: remove some deprecated functions. * Java: fix silent failure when library is not found. * Java: Add support for fetching the ASCII-art diagram string. * Java: Fix insane language selection initialization. * Fix: "The pig runs SLOWER than the cat." * Fix: conjoined superlatives: "... the longest and the farthest." * Fix: "inside" can be used with conjunction: "near or inside..." * Fix: conjoined question modifiers: "exactly when and precisely where..." * Fix: issue 59: crash/corruption when dictionary opened twice. * Fix: assorted exclamations! ================================================================= ================================================================= ================================================================= Version 4.7.12 of the Link Grammar Parser is now available. The biggest change in this version is a sharply updated Russian dictionary, which fixes a large number of bugs generated during during the initial release. Thanks to Sergey Protasov who did almost all this work! The other notable change is that the fat-link code is no longer build by default. It will be permanently removed in some future version, "real soon now". A miscellany of other minor changes are listed below. The link-grammar homepage: http://www.abiword.org/projects/link-grammar/ Download: http://www.abiword.org/downloads/link-grammar/4.7.12/link-grammar-4.7.12.tar.gz WHAT IS LINK GRAMMAR? The Link Grammar Parser is a syntactic parser of English (and other languages as well), based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of a set of labelled links connecting pairs of words. The parser also produces a "constituent" (Penn tree-bank style phrase tree) representation of a sentence (showing noun phrases, verb phrases, etc.). The RelEx extension provides dependency-parse output. CHANGELOG: Version 4.7.12 (25 May 2013) * Large fixes to the Russian dictionaries. * Windows: Explicitly fail if cygwin version is too old. * Tweak the lt dict to work again with the modern parser. * Make the fat linkages code be compile-time configurable. * Disable fat linkages by default; mark as deprecated. * Fix SAT-solver build; recent changes had broken it. * Export read-dict.h as a public API. * Ongoing development of the Viterbi prototype. * Windows: some UTF8/widechar refactoring. * Java bindings: add method to set the language. * CMake: add version checking to the CMakefile * Fix: failed handling of capitalized first word for Russian. * Fix: stemming failures in many cases (for Russian dictionaries) * Add flag to suppress stem-suffix printing. * Windows: Fixes to MSVC6 build files. * Fix: hash-table bug affecting Russian dictionaries
2015-08-23Remove two bl3.mk files that shouldn't be there.wiz2-28/+0
No headers or libraries to link to.
2015-08-23Bump PKGREVISION for nettle shlib major bump.wiz1-2/+2
2015-08-23Update to 1.43:wiz2-6/+6
1.43 2015-08-21 NEILB - Got rid of the "Redundant argument in sprintf" warnings from Text:Diff::Table on Perl 5.021+. RT#100505 and RT#106602. - Metadata and doc now refer to NEILB's repo rather than OVID's.
2015-08-22Add DWB.leot1-1/+2
2015-08-22Import textproc/DWB as DWB-20150517. From Carsten Kunze via pkgsrc-wip.leot28-0/+1017
The Documenter's Workbench (DWB) Release 3.3 is AT&T's original software distribution of nroff and troff (ditroff), the preprocessors tbl, eqn, pic, and grap, and the macro packages man, ms, and mm.