summaryrefslogtreecommitdiff
path: root/textproc/ruby-classifier-reborn
AgeCommit message (Collapse)AuthorFilesLines
2015-12-13Update ruby-classifier-reborn to 2.0.4.taca3-8/+25
v2.0.4 Major Enhancements * Classification thresholds can be enabled or disabled. The default is disabled. The threshold value can be set at initialization time or dynamically during processing (#47) * Made auto-categorization optional, defaulting to false (#45) * Added the ability to handle an array of classifications to the constructor (#44) * Classification with a threshold has been added to the api (#39) Minor Enhancements * Documentation around threshold usage (#54) * Fixed UTF-8 encoding for hasher.rb (#50) * Removed some unnecessary methods (#43) * Add optional CachedContentNode (GSL only) (#43) * Caches the transposed search_vector (#43) * Added custom marshal_ methods to not save the cache when dumping/loading (#43) * Optimized some numeric comparisons and iterators (#43) * Added cached calculation table when computing raw_vectors (#43) * If a category name is already a symbol, just return it (#45) * Various Hash improvements (#45) * Eliminated several Ruby :warning:s when run with RUBYOPT="-w" (#38) * Simple performance improvements for the Hasher process (#41) * Fixes for broken regex splitting for non-ascii characters and removal of the unused punctuation filter (#41) * Add multiple language stopwords with customizable stop word paths (#40) Bug Fixes * Fixed the bug where adding the same category a second time would clobber the category that was already there (#45) * Fixed deprecation warning for <=> in ls.rb (#33) * Remove references to Madeline in the README and replace it with Marshal or Redis (#32) Development Fixes * Added development dependency on mini_test and added 2.2 to travis.yml (#36)
2015-11-04Add SHA512 digests for distfiles for textproc categoryagc1-1/+2
Problems found locating distfiles: Package cabocha: missing distfile cabocha-0.68.tar.bz2 Package convertlit: missing distfile clit18src.zip Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz Otherwise, existing SHA1 digests verified and found to be the same on the machine holding the existing distfiles (morden). All existing SHA1 digests retained for now as an audit trail.
2015-02-07Add ruby-classifier-reborn 2.0.3, successor of ruby-classifier.taca5-0/+49
Classifier is a general classifier module for Ruby to allow Bayesian and other types of classifications. Classifier Reborn is a fork of cardmagic/classifier under more active development.