diff options
author | rodent <rodent@pkgsrc.org> | 2014-02-13 00:50:13 +0000 |
---|---|---|
committer | rodent <rodent@pkgsrc.org> | 2014-02-13 00:50:13 +0000 |
commit | e10ebfb23e1a7a8bd6598037d3f0d7ff2d9a831f (patch) | |
tree | 8bb61268c2597df5c4c5d00f1deb23a2f75e6d40 /textproc/py-acora/DESCR | |
parent | 47b79856c5784debd74c19c65af83436a3ebc4aa (diff) | |
download | pkgsrc-e10ebfb23e1a7a8bd6598037d3f0d7ff2d9a831f.tar.gz |
Import py27-acora-1.8 as textproc/py-acora.
Acora is 'fgrep' for Python, a fast multi-keyword text search engine.
Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.
It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.
Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).
Diffstat (limited to 'textproc/py-acora/DESCR')
-rw-r--r-- | textproc/py-acora/DESCR | 12 |
1 files changed, 12 insertions, 0 deletions
diff --git a/textproc/py-acora/DESCR b/textproc/py-acora/DESCR new file mode 100644 index 00000000000..7472c7f6941 --- /dev/null +++ b/textproc/py-acora/DESCR @@ -0,0 +1,12 @@ +Acora is 'fgrep' for Python, a fast multi-keyword text search engine. + +Based on a set of keywords, it generates a search automaton (DFA) and runs it +over string input, either unicode or bytes. + +It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset +construction. + +Acora comes with both a pure Python implementation and a fast binary module +written in Cython. However, note that the current construction algorithm is not +suitable for really large sets of keywords (i.e. more than a couple of +thousand). |