summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2010-06-06Fix building with different db libsadam1-3/+4
2010-06-05Update to Jinja2 2.5markd3-12/+67
drop maintainership. Version 2.5 * improved the sort filter (should have worked like this for a long time) by adding support for case insensitive searches. * fixed a bug for getattribute constant folding. * support for newstyle gettext translations which result in a nicer in-template user interface and more consistent catalogs. (Newstyle Gettext) * it's now possible to register extensions after an environment was created. Version 2.4.1 * fixed an error reporting bug for undefineds. Version 2.4 * the environment template loading functions now transparently pass through a template object if it was passed to it. This makes it possible to import or extend from a template object that was passed to the template. * added a ModuleLoader that can load templates from precompiled sources. The environment now features a method to compile the templates from a configured loader into a zip file or folder. * the _speedups C extension now supports Python 3. * added support for autoescaping toggling sections and support for evaluation contexts (Evaluation Context). * extensions have a priority now. Version 2.3.1 * fixed an error reporting bug on all python versions * fixed an error reporting bug on Python 2.4 Version 2.3 * fixes issue with code generator that causes unbound variables to be generated if set was used in if-blocks and other small identifier problems. * include tags are now able to select between multiple templates and take the first that exists, if a list of templates is given. * fixed a problem with having call blocks in outer scopes that have an argument that is also used as local variable in an inner frame * greatly improved error message reporting * implicit tuple expressions can no longer be totally empty. This change makes {% if %}...{% endif %} a syntax error now. * added support for translator comments if extracted via babel. * added with-statement extension. * experimental Python 3 support. Version 2.2.1 * fixes some smaller problems for Jinja2 on Jython. Version 2.2 * Include statements can now be marked with ignore missing to skip non existing templates. * Priority of not raised. It's now possible to write not foo in bar as an alias to foo not in bar like in python. Previously the grammar required parentheses (not (foo in bar)) which was odd. * Fixed a bug that caused syntax errors when defining macros or using the {% call %} tag inside loops. * Fixed a bug in the parser that made {{ foo[1, 2] }} impossible. * Made it possible to refer to names from outer scopes in included templates that were unused in the callers frame * Fixed a bug that caused internal errors if names where used as iteration variable and regular variable after the loop if that variable was unused before the loop. * Added support for optional scoped modifier to blocks. * Added support for line-comments. * Added the meta module. * Renamed (undocumented) attribute overlay to overlayed on the environment because it was clashing with a method of the same name. * speedup extension is now disabled by default.
2010-06-04Fix dependencyjoerg1-2/+2
2010-06-01Update to Sphinx-0.6.6markd3-7/+10
many bug fixes.
2010-06-01Update py-enchant to 1.6.2.obache5-12/+85
Based on PR#43384 by Wen Heping. * Migrated from SWIG to ctypes since 1.5.0: * Add dependency on devel/py-ctypes for python2.4 * Fix paths for libenchant to exactly load suitable one. * No need c compiler and buildlink with textproc/enchant. Changes: 1.6.2: * Upgraded bundled enchant to v1.6.0. * Fixed bug in printf() utility function; all input args are now converted to strings before printing. 1.6.1: * Fixed loading of enchant DLL on win32 without pkg_resources installed. * Fixed HTMLChunker to handle unescaped < and > characters that are clearly not part of a tag. 1.6.0: * Upgraded to enchant v1.5.0: * new Broker methods get_param() and set_param() allow runtime customisation of provider data * Added the concept of 'chunkers' to enchant.tokenize.get_tokenizer(). These serve split split the text into large chunks of checkable tokens. * implemented a simple HTMLChunker class * Moved error classes into 'enchant.errors' for easier importing * Moved testcases into separate files so they're not loaded by default * Allowed SpellChecker to use default language if none is specified * Improved compatibility with Python 3 1.5.3: * Fixed termination conditions in English tokenization loop. * Improved unicode detection in English tokenizer. * Made enchant spellcheck all of its docstrings as part of the unittest suite. 1.5.2: * Modify utils.get_resource_filename and utils.win32_data_files for compatibility with py2exe (which was broken in the move to ctypes). Thanks to Stephen George for the fix. 1.5.1: * SpellChecker.add_to_personal renamed to SpellChecker.add and fixed to use the corresponding Dict method. 1.5.0: * Migrated from SWIG to ctypes: * now runs under PyPy! * also opens possibilities for Jython, IronPython, ... * Compatibility updates for Python 3.0, mostly around unicode strings * Dropped compatibility with Python 2.2
2010-05-28Fix buildjoerg3-6/+16
2010-05-26Prefer same section order as mdocml. Bump version.wiz2-4/+6
2010-05-24Update p5-Pod-Simple from version 3.13 to version 3.14.seb2-6/+6
Upstream changes: 2009-04-27 * Release 3.14 Removed explicit loading of UNIVERSAL. RJBS. Reversed the change applied in release 3.09 to fix RT #12239. POD tag found inside a complex POD tag (e.g., "C<<< I<foo> >>>") is again parsed as a tag embedded in a tag instead of text and entities. The previous interpretation of `perldoc perlpod` was mistaken. (RT #55602 from Christopher J. Madsen).
2010-05-20It seems that something wrong with distinfo, DIST_SUBR directory name.taca1-4/+4
2010-05-19+ cabocha, p5-cabocha, py-cabocha, ruby-cabochaobache1-1/+5
2010-05-19Import ruby-cabocha-0.53 as textproc/ruby-cabocha.obache3-0/+39
CaboCha is a Japanese dependency analysis machine based on Support Vector Machines. It is (89.29%) system that accuracy is the highest as a statistical Japanese dependency analysis machine as of June, 2001. Moreover, definite analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is adopted, and an efficient analysis can be done comparatively. This package is ruby module for CaboCha.
2010-05-19Import py-cabocha-0.53 as textproc/py-cabocha.obache3-0/+33
CaboCha is a Japanese dependency analysis machine based on Support Vector Machines. It is (89.29%) system that accuracy is the highest as a statistical Japanese dependency analysis machine as of June, 2001. Moreover, definite analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is adopted, and an efficient analysis can be done comparatively. This package is python module for CaboCha.
2010-05-19Import p5-cabocha-0.53 as textproc/p5-cabocha.obache2-0/+28
CaboCha is a Japanese dependency analysis machine based on Support Vector Machines. It is (89.29%) system that accuracy is the highest as a statistical Japanese dependency analysis machine as of June, 2001. Moreover, definite analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is adopted, and an efficient analysis can be done comparatively. This package is perl module for CaboCha.
2010-05-19Import cabocha-0.53 as textproc/cabocha.obache8-0/+140
CaboCha is a Japanese dependency analysis machine based on Support Vector Machines. It is (89.29%) system that accuracy is the highest as a statistical Japanese dependency analysis machine as of June, 2001. Moreover, definite analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is adopted, and an efficient analysis can be done comparatively.
2010-05-19+ yamcha, p5-yamcha, py-yamcha, ruby-yamchaobache1-1/+5
2010-05-19Import ruby18-yamcha-0.33 as textproc/ruby-yamcha.obache3-0/+36
YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in 1995. This package is ruby module for YamCha.
2010-05-19Import py26-yamcha-0.33 as textproc/py-yamcha.obache3-0/+33
YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in 1995. This package is python module for YamCha.
2010-05-19Import p5-yamcha-0.33 as textproc/p5-yamcha.obache2-0/+28
YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in 1995. This package is perl module for YamCha.
2010-05-19Import yamcha-0.33 as textproc/yamcha.obache6-0/+85
YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in 1995. YamCha is exactly the same system which performed the best in the CoNLL2000 Shared Task, Chunking and BaseNP Chunking task.
2010-05-17Add uni2asciimorr1-1/+2
2010-05-17uni2ascii and ascii2uni convert between UTF-8 Unicode and any of amorr4-0/+44
variety of 7-bit ASCII equivalents including: hexadecimal and decimal HTML numeric characters references, \u-escapes, standard hexadecimal and raw hexadecimal. Packaged by bartosz.kuzma, import from pkgsrc-wip.
2010-05-17Bump PKGREVISION for config file path fix in a2x.wiz1-1/+2
2010-05-16Fix a2x CONF_DIR.gls1-2/+2
Thanks wiz!
2010-05-16LICENSE=ruby-licenseobache2-4/+4
2010-05-16Update ruby-json-pure to 1.4.3.obache2-6/+6
2010-05-05 (1.4.3) * Fixed some test assertions, from Ruby r27587 and r27590, patch by nobu. * Fixed issue http://github.com/flori/json/issues/#issue/20 reported by electronicwhisper@github. Thx!
2010-05-16Update ruby-json to 1.4.3.obache3-11/+11
2010-05-05 (1.4.3) * Fixed some test assertions, from Ruby r27587 and r27590, patch by nobu. * Fixed issue http://github.com/flori/json/issues/#issue/20 reported by electronicwhisper@github. Thx!
2010-05-16Note commented out LICENSE.obache1-1/+2
mainly ICOT Free Software License.
2010-05-16LICENSE=gnu-lgpl-v2.1obache1-1/+2
2010-05-16LICENSE=modified-bsdobache1-1/+2
2010-05-15LICENSE=gnu-gpl-v2obache1-1/+2
2010-05-15Add comment about "used by suikyo-docs".obache1-1/+2
2010-05-15Updated to 8.5.3.gls1-28/+0
Patches no longer required.
2010-05-15Updated to 8.5.3.gls3-15/+27
Changelog way too long to mention here. Refer to http://www.methods.co.nz/asciidoc/CHANGELOG.html for complete list of changes.
2010-05-15Note commented out LICENSE (Ruby's).obache1-1/+2
2010-05-15Note commented out LICENSE (Ruby's).obache1-1/+2
2010-05-08+ gutcheck.wiz1-1/+2
2010-05-08Initial import of gutcheck-0.991:wiz4-0/+42
Gutcheck is a plain-text checking program that specializes in reporting the problems that spellcheckers don't--errors like mismatched quotes, misplaced punctuation, unintended blank lines. It is specifically tuned for checking texts for submission to Project Gutenberg, though it can be useful elsewhere as well.
2010-05-08add a patch originating from eric4 (the IDE) which supposedly fixesdrochner3-3/+16
a problem with foreign characters bump PKGREVISION
2010-05-07Drop maintainership on these packages, I am no longer interested.ahoka6-12/+12
2010-05-06Work around the inconsistent behavior "cp -r" when coping symbolic link.sbd2-5/+9
Closes PR 41142.
2010-05-05Update p5-YAML-LibYAML from version 0.32 to version 0.33.seb2-8/+7
Pkgsrc changes: - this package minimum required Perl version (PERL5_REQD) can be dropped as pkgsrc requires a newer version anyway - placate pkglint: whitespace fix Upstream changes: version: 0.33 date: Wed Apr 14 17:57:10 PDT 2010 changes: - Applied Gisle Aas' patch from https://rt.cpan.org/Ticket/Display.html?id=43115
2010-05-04Update tex-xmltex{,-doc} to Revision 17012.minskim5-32/+18
- Fixed the options to build a format.
2010-05-04Update ruby-json-pure from 1.2.0 to 1.4.2.taca4-11/+25
2010-04-26 (1.4.2) * Applied patch from naruse Yui NARUSE <naruse@airemix.com> to make building with Microsoft Visual C possible again. * Applied patch from devrandom <c1.github@niftybox.net> in order to allow building of json_pure if extensiontask is not present. * Thanks to Dustin Schneider <dustin@stocktwits.com>, who reported a memory leak, which is fixed in this release. * Applied 993f261ccb8f911d2ae57e9db48ec7acd0187283 patch from josh@github. 2010-04-25 (1.4.1) * Fix for a bug reported by Dan DeLeo <dan@kallistec.com>, caused by T_FIXNUM being different on 32bit/64bit architectures. 2010-04-23 (1.4.0) * Major speed improvements and building with simplified directory/file-structure. * Extension should at least be comapatible with MRI, YARV and Rubinius. 2010-04-07 (1.2.4) * Triger const_missing callback to make Rails' dynamic class loading work. 2010-03-11 (1.2.3) * Added a State#[] method which returns an attribute's value in order to increase duck type compatibility to Hash. 2010-02-27 (1.2.2) * Made some changes to make the building of the parser/generator compatible to Rubinius. 2009-11-25 (1.2.1) * Added :symbolize_names option to Parser, which returns symbols instead of strings in object names/keys.
2010-05-04Update ruby-json package from 1.2.0 to 1.4.2.taca5-16/+29
2010-04-26 (1.4.2) * Applied patch from naruse Yui NARUSE <naruse@airemix.com> to make building with Microsoft Visual C possible again. * Applied patch from devrandom <c1.github@niftybox.net> in order to allow building of json_pure if extensiontask is not present. * Thanks to Dustin Schneider <dustin@stocktwits.com>, who reported a memory leak, which is fixed in this release. * Applied 993f261ccb8f911d2ae57e9db48ec7acd0187283 patch from josh@github. 2010-04-25 (1.4.1) * Fix for a bug reported by Dan DeLeo <dan@kallistec.com>, caused by T_FIXNUM being different on 32bit/64bit architectures. 2010-04-23 (1.4.0) * Major speed improvements and building with simplified directory/file-structure. * Extension should at least be comapatible with MRI, YARV and Rubinius. 2010-04-07 (1.2.4) * Triger const_missing callback to make Rails' dynamic class loading work. 2010-03-11 (1.2.3) * Added a State#[] method which returns an attribute's value in order to increase duck type compatibility to Hash. 2010-02-27 (1.2.2) * Made some changes to make the building of the parser/generator compatible to Rubinius. 2009-11-25 (1.2.1) * Added :symbolize_names option to Parser, which returns symbols instead of strings in object names/keys.
2010-05-02Update to 2.2.4:wiz2-6/+6
2.2.6 (2010-03-02) ================== Bugs fixed ---------- * Fixed several Python 3 regressions by building with Cython 0.11.3. 2.2.5 (2010-02-28) ================== Features added -------------- * Support for running XSLT extension elements on the input root node (e.g. in a template matching on "/"). Bugs fixed ---------- * Crash in XPath evaluation when reading smart strings from a document other than the original context document. * Support recent versions of html5lib by not requiring its ``XHTMLParser`` in ``htmlparser.py`` anymore. * Manually instantiating the custom element classes in ``lxml.objectify`` could crash. * Invalid XML text characters were not rejected by the API when they appeared in unicode strings directly after non-ASCII characters. * lxml.html.open_http_urllib() did not work in Python 3. * The functions ``strip_tags()`` and ``strip_elements()`` in ``lxml.etree`` did not remove all occurrences of a tag in all cases. * Crash in XSLT extension elements when the XSLT context node is not an element.
2010-05-02Update to 4.2.1:wiz2-6/+6
Sed 4.2.1 * fix parsing of s/[[[[[[[[[]// * security contexts are preserved by -i too under SELinux * temporary files for sed -i are not made group/world-readable until they are complete
2010-05-02Update to 1.0.5: 1.5 years of bug fixes.wiz3-10/+10
2010-05-02Update to 1.6.0:wiz2-8/+7
* Fix bug 12567: the ispell sources aren't licensed under the LGPL * Add a function to get enchant's version (enchant_get_version) * Disable zemberek plugin by default, as it's known to cause issues/crashes with WebKit * Fix bug 12472: Win32 DLL dependency not found popup occurs when module has unmet dependencies * Possibly fix Ubuntu bug 474062 * Fix bug 12409: Registry handle not closed in enchant_get_registry_value_ex * Fix bug 12406: Leak in _enchant_get_user_home_dirs() on Windows * Fix bug 12007: Update FSF address * Fix bug 12305: Zemberek module lists a Turkish dictionary even without Zemberek installed * Don't assert if passed a null string list * Fix bug 12350: enchant_pwl_init_with_file truncates pwl file * Fix a double-free memory corruption bug * Fix bug 12173: fix some small memory leaks * Fix bug 12174: mis-acceptence of dictionaries which start with a partial match of the lang id * Fix bug 12160: enchant 1.5.0 always looks in "lib" dir for plugins * Fix the build with the MSVC compiler * Add a --with-system-myspell option * Package missing compile-resource file * Compare paths ignoring case sensitivity on windows
2010-05-01+ py-Whoosh.wiz1-1/+2
2010-05-01Initial import of py-Whoosh-0.1.13:wiz4-0/+132
Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python. Some of Whoosh's features include: * Pythonic API. * Pure-Python. No compilation or binary packages needed, no mysterious crashes. * Fielded indexing and search. * Fast indexing and retrieval -- much faster than any other pure-Python solution. * Pluggable scoring algorithm (including BM25F), text analysis, storage, posting format, etc. * Powerful query language parsed by pyparsing. * Pure Python spell-checker (as far as I know, the only one). Whoosh takes much inspiration (and sometimes translates code) from other open-source search engines. The fundamental design is similar to (but does not entirely work like and is not compatible with) Lucene, but uses KinoSearch's indexing algorithm, some scoring algorithms from Terrier, and the English morphological variation generator from Minion.