summaryrefslogtreecommitdiff
path: root/textproc/py-acora
AgeCommit message (Collapse)AuthorFilesLines
2017-01-12DEPENDS on devel/py-cython.rodent1-1/+3
2017-01-122.0 [2016-03-17]rodent3-15/+13
rewrite of the construction algorithm to speed it up and save memory 1.9 [2015-10-10] recompiled with Cython 0.23.4 for better compatibility with recent Python versions.
2016-06-08Switch to MASTER_SITES_PYPI.wiz1-2/+2
2015-11-04Add SHA512 digests for distfiles for textproc categoryagc1-1/+2
Problems found locating distfiles: Package cabocha: missing distfile cabocha-0.68.tar.bz2 Package convertlit: missing distfile clit18src.zip Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz Otherwise, existing SHA1 digests verified and found to be the same on the machine holding the existing distfiles (morden). All existing SHA1 digests retained for now as an audit trail.
2014-02-13Import py27-acora-1.8 as textproc/py-acora.rodent4-0/+50
Acora is 'fgrep' for Python, a fast multi-keyword text search engine. Based on a set of keywords, it generates a search automaton (DFA) and runs it over string input, either unicode or bytes. It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset construction. Acora comes with both a pure Python implementation and a fast binary module written in Cython. However, note that the current construction algorithm is not suitable for really large sets of keywords (i.e. more than a couple of thousand).