summaryrefslogtreecommitdiff
path: root/mail/spamprobe
AgeCommit message (Collapse)AuthorFilesLines
2014-10-09Remove pkgviews: don't set PKG_INSTALLATION_TYPES in Makefiles.wiz1-3/+1
2014-09-25Make bdb a suggested option. The package doesn't actually build withoutjperkin1-2/+2
a bdb present. Not bumping PKGREVISION as the only way the package would have built previously is by using a builtin version anyway.
2014-05-31libungif is not relevant any longer since the GIF relevant patentswiz2-4/+4
expired (about 10 years ago). Unconditionally switch to libgif. As discussed on tech-pkg without dissent. Bump PKGREVISION.
2014-05-25Revbump after updating graphics/giflibobache1-2/+2
2014-05-25Fixes build with giflib-5.1.obache2-3/+14
2014-05-25As LICENSE.txt, LICENSE=qpl-v1.0obache1-1/+2
2014-01-05Fix build for changed DGifOpen in giflib.htodd2-1/+15
2013-05-28Compator function must be const.joerg2-1/+15
2013-04-13Hand in maintainership.ghen1-2/+2
2013-02-16Recursive bump for png-1.6.wiz1-2/+2
2012-12-20Fix template look up.joerg2-1/+24
2012-10-08Drop PKG_DESTDIR_SUPPORT setting, "user-destdir" is default these days.asau1-2/+1
2011-12-06Fix build with newer GCCjoerg3-1/+28
2011-03-27Fix build by using giflib.buildlink3.mkhubertf1-2/+2
2011-01-13png shlib name changed for png>=1.5.0, so bump PKGREVISIONs.wiz1-2/+2
2010-12-23Mechanically replace references to graphics/jpeg with the suitabledsainty1-2/+2
alternative from mk/jpeg.buildlink3.mk This allows selection of an alternative jpeg library (namely the x86 MMX, SSE, SSE2 accelerated libjpeg-turbo) via JPEG_DEFAULT=libjpeg-turbo, and follows the current standard model for alternatives (fam, motif, fuse etc). The mechanical edits were applied via the following script: #!/bin/sh for d in */*; do [ -d "$d" ] || continue for i in "$d/"Makefile* "$d/"*.mk; do case "$i" in *.orig|*"*"*) continue;; esac out="$d/x" sed -e 's;graphics/jpeg/buildlink3\.mk;mk/jpeg.buildlink3.mk;g' \ -e 's;BUILDLINK_PREFIX\.jpeg;JPEGBASE;g' \ < "$i" > "$out" if cmp -s "$i" "$out"; then rm -f "$out" else echo "Edited $i" mv -f "$i" "$i.orig" && mv "$out" "$i" fi done done
2010-09-16Revbump after updating databases/db5adam1-2/+2
2010-06-15Fix build with png-1.4.x. Addresses PR 43477.wiz2-1/+27
2010-06-14PKGREVISION bump for png-1.4.x shlib change.wiz1-2/+2
(missed those and *emacs* the first time round because they pull in their png dependencies via default-on options; they were included in the test bulk build though)
2010-06-02Fix building with db5; revision bump for db4 updateadam4-10/+24
2010-01-24More PKGREVISION bumps of jpeg dependencies.wiz1-2/+2
Identified by parsing the NetBSD-i386-5.0.1/2009Q3 pkg_summary files and Robert Elz.
2009-08-26bump revision because of graphics/jpeg updatesno1-1/+2
2007-10-25Fix typo, spotted by Chris Ross.ghen1-2/+2
2007-10-25The package supports installation to DESTDIR.heinz1-1/+2
2007-10-25Update to 1.4d (previous 1.4.2 was 1.4b). Take MAINTAINERship.ghen4-24/+18
2006-11-16 Brian Burton <brian@burton-computer.com> * Released as 1.4d * configure.ac: Added ability to selectively disable image processing using --without-gif, --without-jpeg, and/or --without-png. * src/spamprobe/spamprobe.cc (set_headers): Added ability to selectively ignore individual headers using -H-headername. * src/includes/Ptr,Ref,Array.h: Restored missing <cassert> include. * src/parser/PngParser.cc (tokenizeImage): added basic tokens from PNG images. 2006-11-16 Brian Burton <brian@localhost.localdomain> * src/parser/PngParser.cc (PngParser): Stub for PNG parsing using libpng. * src/parser/JpegParser.cc (tokenizeMarker): Preliminary implementation of jpeg parsing using jpeglib. * configure.ac: Auto detect of either libungig or libgif depending on which one is available. 2007-01-04 Brian Burton <brian@burton-computer.com> * Released as 1.4c * spamprobe.1: Modified man page to remove unnecessary informaton and make it more conformant with man page conventions. * src/spamprobe/spamprobe.cc (process_extended_options): added ignore-body option. * src/parser/HeaderPrefixList.cc (HeaderPrefixList::addHeaderPrefix): Forced header prefixes and names to lower case instead of relying on an assert to enforce the restriction. * src/database/FrequencyDBImpl_hash.cc (hash::FrequencyDBImpl_hash): Disabled experimental hash database auto-cleaning. * src/includes/Ref.h: Removed cassert include. * src/spamprobe/spamprobe.cc (process_extended_options): Added whitelist option to allow use of SP as a bayesian white list in conjunction with other filters.
2007-02-22Whitespace cleanup, courtesy of pkglint.wiz1-3/+3
Patch provided by Sergey Svishchev in private mail.
2006-08-13Fix build with gcc4.wiz2-1/+14
2006-08-04Update spamprobe to 1.4b (PKGNAME changed to 1.4.2).ghen2-6/+7
Changes in 1.4a: This release fixes a bug in decoding malformed base64 encoded data that manifested itself in some architectures. Also corrects a typo in the online help. Some code cleanup and minor performance tweaks were added as well. Changes in 1.4b: This release fixes pair of bugs related to messages with no lines in their bodies. No new features were added.
2006-01-07Update spamprobe to version 1.4.hubertf4-14/+26
Submitted by Ossi 'ohub' Herrala <oherrala@ee.oulu.fi> in private mail as discussed on IRCnet #NetBSD. Changes to software: * Spamprobe has a config file! * Added comprehensive command line option help. * Added support for maildir directories to all file based commands. * Added support for auto-purge (-P command line option). * Commands create-db and create-config to greate database and config file. * Added gif parser support using libungif to analyzes image attachments and derive useful information from them. Changes to package: * bdb is not in SUGGESTED_OPTIONS anymore. * added depency to graphics/libungif and gif is in SUGGESTED_OPTIONS.
2006-01-06DEPENDS and PKGREVISION bumps because db4's library name changed.wiz1-1/+2
2005-12-05Ran "pkglint --autofix", which corrected some of the quoting issues inrillig1-2/+2
CONFIGURE_ARGS.
2005-06-29Update spamprobe to 1.2a.hubertf3-9/+21
SpamProbe changes: - New email parsing implementation - New hash database, in addition to Berkeley DB (bdb/db4) (=> Hopefully no more "caught runtime exception: get: DB_PAGE_NOTFOUND: Requested page not found" on Linux every other day!) - And much more, check the ChangeLog for accurate information pkgsrc changes: - Added db4 database backend to be optional (not requirement anymore) Patches sent by the maintainer Ossi Herrala <oherrala@ee.oulu.fi> in private mail.
2005-06-12Pass the correct path to the installed db4 files. Also, transformjlam1-2/+4
"-ldb" to "-ldb4" so that the configure script doesn't use a system libdb.so instead of the pkgsrc libdb4.so. This fixes PR pkg/23615.
2005-04-11Remove USE_BUILDLINK3 and NO_BUILDLINK; these are no longer used.tv1-2/+1
2005-02-24Add RMD160 digests.agc1-1/+2
2004-11-18Update spamprobe to 1.0a, patch sent via IRC by the maintainer.hubertf2-6/+5
Changes: * MimeLineReader.cc: 1.0 branch - fixed MBX record header regex * spamprobe.cc (main): Added exec and exec-shared commands. (import_words): modified import command to allow negative values to be specified in the import file. * Applied patches for configure.in and aclocal.m4 contributed by Siggy Brentrup for debian compatibility. * FrequencyDBImpl_pbl.cc: Invokes new WordData methods to allow storing data in big endian format. * WordData.h: Added optional support for storing counts/flags in big endian order for data portability. * MimeLineReader.cc (readMBXFileHeader): UW IMAP MBX file format is now auto detected from the first line of the mailbox file. * spamprobe.cc (process_extended_options): Removed -o imap-mbx option. * spamprobe.cc (process_extended_options): Added -o imap-mbx option to process files as WU-IMAP MBX files rather than mbox files. * MimeLineReader.cc (readLine): Added support for WU-IMAP MBX file format. * spamprobe.cc (process_stream): Added -o tokenized option to allow people to use an external tokenizer with spamprobe. * SpamFilter.cc (scoreToken): Reduced sorting overhead by pre-computing and integer sort value with sorting priorities reflected in the value. This eliminates several calculations inside of the sort routine. * SpamFilter.cc (computeRatio): Capped ratios in calculations to within MIN_PROB and MAX_PROB. Widened that range. This avoids problems with div/0 and makes it easier to sort terms. * spamprobe.cc (dump_words): dump command can now optionally accept a regular expression as an argument and will only dump terms matching the regular expression. (purge_terms): Added purge-terms command to purge from the database all terms matching a regular expression. * spamprobe.cc (main): Fixed bug in command line processing. Thanks to Jem for bug report. * spamprobe.cc (train_on_message): Code simplified. Eliminated redundant recalculation of scores. (train_on_message): Timestamps are now longer updated by train-spam and train-good commands. They are still updated by train command. (main): Fixed assertion if -P option is specified in a read only operation. * spamprobe.cc (main): Added -C command line option to allow users to specify their own min word count. * SpamFilter.cc (SpamFilter): Set default minimum word count back to 5 (was 3). * spamprobe.cc (process_extended_options): Removed "alt-score" from -o options list because it distributes scores poorly. New formula achieves the same end with better accuracy. Added "orig-score" option to allow people to continue using the old formula. Added "honor-xstatus-header" option for people whose mail server uses X-Status: rather than Status: for the deleted flag. (main): Added -l command line option to allow people to set their own spam threshold if they don't like the default value. * SpamFilter.cc (scoreMessage): Added a new scoring formula based on Paul's but taking the nth root of spam and good probabilities to produce more evenly distributed scores. Lowered the spam threshold to 0.6 to keep accuracy about the same as the original formula. Highest score seen for a ham so far in tests is 0.44 so 0.6 seems safe. Made the new formula the default instead of Paul's.
2004-10-03Libtool fix for PR pkg/26633, and other issues. Update libtool to 1.5.10tv1-1/+2
in the process. (More information on tech-pkg.) Bump PKGREVISION and BUILDLINK_DEPENDS of all packages using libtool and installing .la files. Bump PKGREVISION (only) of all packages depending directly on the above via a buildlink3 include.
2004-06-17this needs a c and c++ compiler.grant1-1/+2
2004-03-11add warning about older procmail versionslukem1-0/+9
2004-02-03Update spamprobe to 0.9h. ChangeLog:hubertf3-31/+5
* New manual page * spamprobe.cc (process_stream): Added -o tokenized option to allow people to use an external tokenizer with spamprobe. * SpamFilter.cc (scoreToken): Reduced sorting overhead by pre-computing and integer sort value with sorting priorities reflected in the value. This eliminates several calculations inside of the sort routine. * SpamFilter.cc (computeRatio): Capped ratios in calculations to within MIN_PROB and MAX_PROB. Widened that range. This avoids problems with div/0 and makes it easier to sort terms. * spamprobe.cc (dump_words): dump command can now optionally accept a regular expression as an argument and will only dump terms matching the regular expression. (purge_terms): Added purge-terms command to purge from the database all terms matching a regular expression. Patch submitted by Ossi Herrala <PGP: 0x78CD0337> in private email.
2004-01-20Update spamprobe to 0.9g2.hubertf2-5/+6
Patch provided by Ossi Herrala <$MAINTAINER> in private mail. CHANGED: * This release moves from Paul's original formula to a slightly modified one that yields more evenly distributed scores. To continue using old formula, use -o orig-score command line option. ADDED: * -C <number> command line option. This tells SpamProbe to assign a default, somewhat neutral, probability to any term that does not have a weighted (good count doubled) count of at least specified number in the database. This prevents terms which have been seen only a few times from having an unreasonable influence on the score of an email containing them. Default count have changed. It is now 5. Old was 3. * Added -o <option_name> command line option to specify alternate way of scoring. Consult README.txt for more info. * Added -l <number> command line option. Changes the spam probability threshold for emails from the default (0.7) to <number>. * Added tokenize command. Prints the tokens found in the file one word per line in human readable format with spam probability, good count, spam count, message count, and word in columns separated by whitespace. IMPROVED: * -H command line option to add more headers to scan. * Improved performance by removing some redundant calculations and reducing the amount of I/O in train-* mode.
2004-01-20Fix typos in the manual page.lukem2-1/+27
2004-01-05Update to spamprobe-0.9e.hubertf2-9/+10
Too many changes to list, the old pkg was from 2002(!) Update probided by ohub @ #NetBSD/IRCnet, buildlink3 code OK'd by jlam.
2003-12-01reorder lines to appease pkglint (hi, zuntum!)grant1-3/+4
2003-07-17s/netbsd.org/NetBSD.org/grant1-2/+2
2003-06-10Bump PKGREVISION because of db4 shlib major bump.wiz1-1/+2
2003-06-02Use tech-pkg@ in favor of packages@ as MAINTAINER for orphaned packages.jschauma1-2/+2
Should anybody feel like they could be the maintainer for any of thewe packages, please adjust.
2003-02-17Initial import of mail/spamprobe (version 0.8b)zuntum4-0/+27
Spamprobe is a fast, intelligent, automatic spam detector using Bayesian analysis of terms contained in emails. Works with procmail, maildrop or a similar tool to produce a complete server or client side spam filtering system. Provided by Daniel Farrugia in PR#20286, buildlinkified by me.