summaryrefslogtreecommitdiff
path: root/textproc
AgeCommit message (Collapse)AuthorFilesLines
2010-05-02Update to 2.2.4:wiz2-6/+6
2.2.6 (2010-03-02) ================== Bugs fixed ---------- * Fixed several Python 3 regressions by building with Cython 0.11.3. 2.2.5 (2010-02-28) ================== Features added -------------- * Support for running XSLT extension elements on the input root node (e.g. in a template matching on "/"). Bugs fixed ---------- * Crash in XPath evaluation when reading smart strings from a document other than the original context document. * Support recent versions of html5lib by not requiring its ``XHTMLParser`` in ``htmlparser.py`` anymore. * Manually instantiating the custom element classes in ``lxml.objectify`` could crash. * Invalid XML text characters were not rejected by the API when they appeared in unicode strings directly after non-ASCII characters. * lxml.html.open_http_urllib() did not work in Python 3. * The functions ``strip_tags()`` and ``strip_elements()`` in ``lxml.etree`` did not remove all occurrences of a tag in all cases. * Crash in XSLT extension elements when the XSLT context node is not an element.
2010-05-02Update to 4.2.1:wiz2-6/+6
Sed 4.2.1 * fix parsing of s/[[[[[[[[[]// * security contexts are preserved by -i too under SELinux * temporary files for sed -i are not made group/world-readable until they are complete
2010-05-02Update to 1.0.5: 1.5 years of bug fixes.wiz3-10/+10
2010-05-02Update to 1.6.0:wiz2-8/+7
* Fix bug 12567: the ispell sources aren't licensed under the LGPL * Add a function to get enchant's version (enchant_get_version) * Disable zemberek plugin by default, as it's known to cause issues/crashes with WebKit * Fix bug 12472: Win32 DLL dependency not found popup occurs when module has unmet dependencies * Possibly fix Ubuntu bug 474062 * Fix bug 12409: Registry handle not closed in enchant_get_registry_value_ex * Fix bug 12406: Leak in _enchant_get_user_home_dirs() on Windows * Fix bug 12007: Update FSF address * Fix bug 12305: Zemberek module lists a Turkish dictionary even without Zemberek installed * Don't assert if passed a null string list * Fix bug 12350: enchant_pwl_init_with_file truncates pwl file * Fix a double-free memory corruption bug * Fix bug 12173: fix some small memory leaks * Fix bug 12174: mis-acceptence of dictionaries which start with a partial match of the lang id * Fix bug 12160: enchant 1.5.0 always looks in "lib" dir for plugins * Fix the build with the MSVC compiler * Add a --with-system-myspell option * Package missing compile-resource file * Compare paths ignoring case sensitivity on windows
2010-05-01+ py-Whoosh.wiz1-1/+2
2010-05-01Initial import of py-Whoosh-0.1.13:wiz4-0/+132
Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python. Some of Whoosh's features include: * Pythonic API. * Pure-Python. No compilation or binary packages needed, no mysterious crashes. * Fielded indexing and search. * Fast indexing and retrieval -- much faster than any other pure-Python solution. * Pluggable scoring algorithm (including BM25F), text analysis, storage, posting format, etc. * Powerful query language parsed by pyparsing. * Pure Python spell-checker (as far as I know, the only one). Whoosh takes much inspiration (and sometimes translates code) from other open-source search engines. The fundamental design is similar to (but does not entirely work like and is not compatible with) Lucene, but uses KinoSearch's indexing algorithm, some scoring algorithms from Terrier, and the English morphological variation generator from Minion.
2010-04-30+ dadadodo.wiz1-1/+2
2010-04-30Initial import of dadadodo-1.04:wiz4-0/+32
dadadodo analyses text files and generates Markov chains of word frequencies; it can then generate random sentences based on that data.
2010-04-28Add patch-aj to include resolv.h conditionally.obache2-1/+20
https://bugzilla.gnome.org/show_bug.cgi?id=617053 Fixes build on Interix-6.0.
2010-04-24Add & enable p5-Text-MicroTemplateseb1-1/+2
2010-04-24Initial import of p5-Text-MicroTemplate version 0.11 in the NetBSDseb3-0/+27
Packages Collection. The Perl 5 module Text::MicroTemplate is a standalone, fast, intelligent, extensible template engine.
2010-04-24Add & enable p5-Text-Microformatseb1-1/+2
2010-04-24Initial import of p5-Text-Microformat version 0.04 in the NetBSDseb3-0/+35
Packages Collection. The Perl 5 module Text::Microformat is a Microformat parser. It sports a very pluggable API, which allows not only new kinds of Microformats to be added, but also extension of the parser itself, to allow new parsing metaphors and source document encodings
2010-04-24Add & enable p5-XML-TreeBuilderseb1-1/+2
2010-04-24Initial import of p5-XML-TreeBuilder version 3.09 in the NetBSDseb3-0/+36
Packages Collection. The Perl 5 module XML::TreeBuilder uses XML::Parser to make XML document trees constructed of XML::Element objects (and XML::Element is a subclass of HTML::Element adapted for XML). XML::TreeBuilder is meant particularly for people who are used to the HTML::TreeBuilder / HTML::Element interface to document trees, and who don't want to learn some other document interface.
2010-04-17remove outdated mirror.zafer1-3/+2
2010-04-15Update ruby-haml from version 2.2.13 to version 2.2.23.seb3-7/+10
Pkgsrc changes: - adjust PLIST Upstream changes: * 2.2.23 [haml/sass] Don’t crash when rake gems is run in Rails with Haml installed. Thanks to Florian Frank. [haml ]Don’t remove \n in filters with interpolation. [haml] Silence those annoying "regexp match /.../n against to UTF-8 string" warnings. [sass] When raising a file-not-found error, add a list of load paths that were checked. [sass] If an import isn’t found for a cached Sass file and the :full_exception option is enabled, print the full exception rather than raising it. [sass] Fix a bug with a weird interaction with Haml, DataMapper, and Rails 3 that caused some tag helpers to go into infinite recursion. * 2.2.22 [haml/sass] Add a railtie so Haml and Sass will be automatically loaded in Rails 3. Thanks to Daniel Neighman. [haml] Add a deprecation message for using - with methods like form_for that return strings in Rails 3. This is the same deprecation that exists in Rails 3. [haml] Make sure line numbers are reported correctly when filters are being used. [haml/sass] Make loading the gemspec not crash on read-only filesystems like Heroku’s. [haml] Don’t crash when methods like form_for return nil in, for example, Rails 3 beta. [haml] Compatibility with Rails 3 beta’s RJS facilities. * 2.2.21 [haml/sass] Fix a few bugs in the git-revision-reporting in Haml::Version#version. In particular, it will still work if git gc has been called recently, or if various files are missing. [hamll/sass] Always use __FILE__ when reading files within the Haml repo in the Rakefile. According to this bug report, this should make Haml work better with Bundler. [haml] Make the error message for - end a little more intuitive based on user feedback. [haml] Compatibility with methods like form_for that return strings rather than concatenate to the template in Rails 3. [haml] Add a with_tabs helper, which sets the indentation level for the duration of a block. * 2.2.20 [haml] The form_tag Rails helper is now properly marked as HTML-safe when using Rails’ XSS protection with Rails 2.3.5. [haml/sass] Calls to defined? shouldn’t interfere with Rails’ autoloading in very old versions (1.2.x). [haml] Fix a bug where calls to ActionView’s render method with blocks and layouts wouldn’t work under the Rails 3.0 beta. [haml] Fix a bug where the closing tags of nested calls to Haml::Helpers#haml_concat were improperly escaped under the Rails 3.0 beta. [sass] If the cache file for a given Sass file is corrupt because it doesn’t have enough content, produce a warning and read the Sass file rather than letting the exception bubble up. This is consistent with other sorts of sassc corruption handling. * 2.2.19 [haml] Fix a bug with the integration with Rails’ XSS support. In particular, correctly override safe_concat. [sass] There were no changes made to Sass between versions 2.2.18 and 2.2.19. * 2.2.18 [haml] Support the new XSS-protection API used in Rails 3. [haml/sass] Use Rails.env rather than RAILS_ENV when running under Rails 3.0. Thanks to Duncan Grazier. [haml] Add a --unix-newlines flag to all executables for outputting Unix-style newlines on Windows. [haml] Fix a couple bugs with the :erb filter: make sure error reporting uses the correct line numbers, and allow multi-line expressions. [haml] Fix a parsing bug for HTML-style attributes including #. [sass] Support :line_numbers as an alias for :line_comments, since that’s what the docs have said forever. Similarly, support --line-numbers as a command-line option. [sass] Add a --unix-newlines flag to all executables for outputting Unix-style newlines on Windows. [sass] Add a :unix_newlines option for Sass::Plugin for outputting Unix-style newlines on Windows. [sass] Fix the --cache-location flag, which was previously throwing errors. Thanks to tav. [sass] Allow comments at the beginning of the document to have arbitrary indentation, just like comments elsewhere. Similarly, comment parsing is a little nicer than before. * 2.2.17 [haml] Fix compilation of HTML5 doctypes when using html2haml. [sass] nil values for Sass options are now ignored, rather than raising errors. [sass] When the :full_exception option is false, raise the error in Ruby code rather than swallowing it and printing something uninformative. [sass] Fixed error-reporting when something goes wrong when loading Sass using the sass executable. This used to raise a NameError because Sass::SyntaxError wasn’t defined. Now it’ll raise the correct exception instead. [sass] Report the filename in warnings about selectors without properties. [sass] Fix a bug that appears when Plugin template locations have multiple trailing slashes. Thanks to Jared Grippe. [sass] Must Read! When @import is given a filename without an extension, the behavior of rendering a CSS @import if no Sass file is found is deprecated. In future versions, @import foo will either import the template or raise an error. * 2.2.16 [haml] Abstract out references to ActionView::TemplateError, ActionView::TemplateHandler, etc. These have all been renamed to ActionView::Template::* in Rails 3.0. [sass] Fixed a bug where modules containing user-defined Sass functions weren’t made available when simply included in Sass::Script::Functions (Functions needed to be re-included in Functions::EvaluationContext). Now the module simply needs to be included in Sass::Script::Functions. * 2.2.15 [haml] Allow if statements with no content followed by else clauses. For example: - if foo - else bar [sass] Added Sass::Script::Color#with for a way of setting color channels that’s easier than manually constructing a new color and is forwards-compatible with alpha-channel colors (to be introduced in Sass 2.4). [sass] Added a missing require in Sass that caused crashes when it was being run standalone. * 2.2.14 [haml] Don’t print warnings when escaping attributes containing non-ASCII characters in Ruby 1.9. [haml] Don’t crash when parsing an XHTML Strict doctype in html2haml. [haml] Support the HTML5 doctype in an XHTML document by using !!! 5 as the doctype declaration. [sass] All Sass functions now raise explicit errors if their inputs are of the incorrect type. [sass] Allow the SassScript rgb() function to take percentages in addition to numerical values. [sass] Fixed a bug where SassScript strings with # followed by #{} interpolation didn’t evaluate the interpolation. [sass] SassScript Ruby API These changes only affect people defining their own Sass functions using Sass::Script::Functions. Sass::Script::Color#value attribute is deprecated. Use Sass::Script::Color#rgb instead. The returned array is now frozen as well. Add an assert_type function that’s available to Sass::Script::Functions. This is useful for typechecking the inputs to functions. [sass] Rack Support Sass 2.2.14 includes Rack middleware for running Sass, meaning that all Rack-enabled frameworks can now use Sass. To activate this, just add require 'sass/plugin/rack' use Sass::Plugin::Rack to your config.ru. See the Sass::Plugin::Rack documentation for more details.
2010-04-15add a build dependency on py-expat so the build doesn't fail with this message:jmcneill1-1/+3
failed to find and generate stubs for libxml2, aborting ... <class 'xml.sax._exceptions.SAXReaderNotAvailable'> No parsers found *** Error code 1 Stop. make: stopped in /usr/pkgsrc/textproc/py-libxml2
2010-04-14+ sgrep, py-cElementTree.wiz1-1/+3
2010-04-14Initial import of sgrep:wiz4-0/+50
sgrep (structured grep) is a tool for searching and indexing text, SGML,XML and HTML files and filtering text streams using structural criteria. The data model of sgrep is based on regions, which are nonempty substrings of text. Regions are typically occurrences of constant strings, SGML-tags, or meaningful text elements, which are recognizable through some delimiting strings or the builtin SGML, XML and HTML parser. Regions can be arbitrarily long, arbitrarily overlapping, and arbitrarily nested. Sgrep is a convenient tool for making queries to almost any kind of text files with some well kown structure. These include programs, mail folders, news folders, HTML, SGML, etc... With relatively simple queries you can display mail messages by their subject or sender, extract titles or links or any regions from HTML files, function prototypes from C or make complex queries to SGML files based on the DTD of the file.
2010-04-13sync w/ base pkgdrochner2-6/+6
2010-04-13update to 1.1.26drochner6-215/+51
changes: -fix for CVE-2008-2935 (was patched in pkgsrc) -misc fixes and improvements
2010-04-13sync w/ base pkgdrochner2-6/+6
2010-04-13update to 2.7.7drochner5-59/+11
changes: -Adding a --xpath option to xmllint -Make HTML parser non-recursive -misc portability improvements and bugfixes
2010-04-11Import py-cElementTree-0 as textproc/py-cElementTree.obache4-0/+69
The cElementTree module is a C implementation of the ElementTree API, optimized for fast parsing and low memory use. On typical documents, cElementTree is 15-20 times faster than the Python version of ElementTree, and uses 2-5 times less memory. On modern hardware, that means that documents in the 50-100 megabyte range can be manipulated in memory, and that documents in the 0-1 megabyte range load in zero time (0.0 seconds). This allows you to drastically simplify many kinds of XML applications.
2010-04-10Update to soprano 2.4.1markd5-43/+19
2.4.0 * New methods Client::SparqlModel::listStatementsAsync and Client::SparqlModel::listContextsAsync * Automatic query prefix expansion in NRLModel * Automatic NRL metadata graph removal in NRLModel * sopranocmd: - New paramter --nrl for sopranocmd which will trigger automatic query prefix expansion based on prefixes stored in the underlying model, automatic context creation for imported statements that do not have one set, and automatic metad ata graph removal on removeContext calls. - New parameter --graphselect which allows to convert SPARQL select query results into a graph result. This is very usable since SPARQL construct queries do not support contexts/named graphs. - New command 'rmgraph' which allows to delete a list of graphs/contexts. - New parameter --foo which enables scriptable binding results without variable names. * ServerCore is now fully multithreaded allowing two clients perform read-only commands simultaneously. * New Virtuoso storage backend which can either connect to a running Virtuoso instance or spawn one locally. * New convenience methods to handle BackendSettings. * New class AsyncQuery which will spawn a new thread for real asnycroneous query execution (in contrast to AsyncModel which only performs the actual query call in another thread while the iteration is syncroneous.) * Removed redland/librdf dependancy from raptor parser. 2.3.2 * Do not fail when trying to index empty strings, but ignore them. * Fixed parsing of plain literals with language code in sopranocmd. * Make onto2vocabularyclass work with the redland backend (not using graph SPARQL queries anymore) 2.3.1 * Changed installation path of cmake modules (SopranoAppOntology.cmake) to <prefix/share/soprano/cmake * Prevent the redland backend to add one statement multiple times. * Properly handle class names containing a dash '-' in onto2vocabularyclass * sopranocmd: Fail if trying to export a non-graph query
2010-04-09add shared-desktop-ontologiesmarkd1-1/+2
2010-04-09Initial import shared-desktop-ontologies-0.3markd5-0/+83
The shared-desktop-ontologies package brings the semantic web to the desktop in terms of vocabulary. It contains the well known core ontologies such as RDF and RDFS as well as the Nepomuk ontologies which are used by projects like KDE or Strigi.
2010-04-09Update json-glib to 0.10.4.obache2-6/+6
Overview of changes for 0.10.4 ============================== • Improve the strictness of the JsonParser code by refactoring a large chunk of it • Fix JsonGenerator when dumping nodes with a floating point value • Add negative conformance test cases
2010-04-07Needs py-expat for buildjoerg1-1/+3
2010-03-22Fix previous (hi joerg!).wiz2-4/+4
Bump version.
2010-03-21include gettext-lib buildlink; necessary to compile dependant packagesjdolecek1-1/+3
(such as php-pspell) on Mac OS X 10.6.2 - without that the compile fails as libintl is not found
2010-03-21add php-intljdolecek1-1/+2
2010-03-21Add php-intl package, using PECL version 1.1.1, modified to compile withjdolecek12-0/+386
both PHP 5.3.2 and PHP 5.3.12 compile tested on Mac OS X 10.6.2 and NetBSD 5.0.1 This module is an internationalization extension, enabling PHP programmers to perform UCA-conformant collation and date/time/number/currency formatting in their scripts. It tends to closely follow ICU APIs, so that people having experience working with ICU in either C/C++ or Java could easily use the PHP API. Also, this way ICU documentation would be useful to understand various ICU functions. Intl consists of several modules, each of them exposes the corresponding ICU API: * Collator: provides string comparison capability with support for appropriate locale-sensitive sort orderings. * Number Formatter: allows to display number according to the localized format or given pattern or set of rules, and to parse strings into numbers. * Message Formatter: allows to create messages incorporating data (such as numbers or dates) formatted according to given pattern and locale rules, and parse messages extracting data from them. * Normalizer: provides a function to transform text into one of the Unicode normalization forms, and provides a routine to test if a given string is already normalized. * Locale: provides interaction with locale identifiers in the form of functions to get subtags from locale identifier; parse, compose, match(lookup and filter) locale identifiers.
2010-03-21Reset maintainer, developer lost his commit bit.wiz1-2/+2
2010-03-21switch php-enchant to use PECL module (version 1.1.0), so that it's also ↵jdolecek2-3/+9
usable with PHP 5.2.* compilation tested with PHP 5.2.13 and 5.3.2 on Mac OS X 10.6.2, NetBSD 5.0.1 slightly adjust COMMENT
2010-03-17Doesn't support Python 2.4.joerg1-1/+3
2010-03-16Updating textproc/p5-Text-DHCPLeases from 0.8 to 0.9sno2-7/+8
pkgsrc changes: - Add license definition Upstream changes: 0.9 2009/08/10 - Added License text in all files - Removed requirement for version.pm
2010-03-16+ py-4Suitepettai1-1/+2
2010-03-164Suite allows users to take advantage of standard XML technologies rapidlypettai4-0/+1374
and to develop and integrate Web-based applications. It also puts practical technologies for knowledge management projects in the hands of developers. It is implemented in Python with C extensions.
2010-03-16Updating textproc/p5-Text-BibTeX from 0.41 to 0.42sno2-6/+6
Upstream changes: Release 0.42 - 16 March, 2010 ----------------------------- * names with more than two commas are left untouched if protected by braces. * can now deal with names with hyphens and generate the correct BibTeX compatible abbreviations. * can generate initials for protected name parts, like BibTeX can. * can deal with generating initials for names which contains Unicode characters. * fixed compilation on machines needing alloca.h * added new README file with more up-to-date information.
2010-03-16Updating textproc/p5-XML-NamespaceSupport from 1.10 to 1.11sno2-7/+7
pkgsrc changes: - Changing module type to use bundled Module::Install Upstream changes: 1.11 tues20100316 15:17 - re-generate the META.yml and remove the stale one from git
2010-03-16Updating textproc/p5-Text-CSV_XS from 0.71 to 0.72sno2-6/+6
Upstream changes: 2010-03-16 0.72 - H.Merijn Brand <h.m.brand@xs4all.nl> * Introduce quote_null attribute (RT#55200) * examples/csv-check can be used for Text::CSV_PP * examples/csv-check more options for sep_, escape_ and quote_char * examples/csv2xls more options for sep_, escape_ and quote_char * examples/csv2xls added auto_diag and verbosity * Dropped YAML spec to 1.0
2010-03-16Added textproc/p5-Text-Trac version 0.15sno1-1/+2
2010-03-16Updating textproc/p5-Text-Trac version 0.15 as dependency for scheduledsno3-0/+37
update of devel/p5-SVN-Notify Text::Trac parses text with Trac WikiFormatting and convert it to html format.
2010-03-16php5-xsl has moved to php-xsl.taca3-34/+0
2010-03-16php5-dom has moved to php-dom.taca2-22/+0
2010-03-16Add and enable php-enchant.taca1-1/+2
2010-03-16Importing textproc/php-enchant 5.3.2 which part of PHP 5.3.2 and successortaca2-0/+29
of php-pspell. PHP is a programming language designed to be embedded into web pages. This module provides support for enchant support. Enchant is a binder for libenchant. Libenchant provides a common API for many spell libraries: - aspell/pspell (intended to replace ispell) - hspell (hebrew) - ispell - myspell/hunspell (OpenOffice project, mozilla) - uspell (primarily Yiddish, Hebrew, and Eastern European languages) A plugin system allows to add custom spell support, see http://www.abisource.com/enchant/.
2010-03-16Remove entry for php5-dom and php5-xsl.taca1-3/+1