Re: [OT] Tom's/Marc's spam filters? - Mailing list pgsql-general

From Bruce Momjian
Subject Re: [OT] Tom's/Marc's spam filters?
Date
Msg-id 200404211845.i3LIjma06527@candle.pha.pa.us
Whole thread Raw
In response to Re: [OT] Tom's/Marc's spam filters?  (Joe Conway <mail@joeconway.com>)
Responses Re: [OT] Tom's/Marc's spam filters?
List pgsql-general
Joe Conway wrote:
> I get a comparible amount of spam (~600 to 1000 per day) and my setup
> *was* about 98% effective until a month or so ago. These days it is more
> like 80%. I've noticed many of the spam getting through appears
> specifically targeted at getting by SA -- no HTML, a paragraph of
> nonsense (or sometimes out of some public domain book), and a one liner
> trying to sell me a mortgage or something.
>
> The one thing I had *not* been doing, but started to do as of last
> night, is to use the false-negatives to explicitly train the Bayesian
> filter.  It was easy enough to set up. I created an hourly cron job as
> follows:
>
>    /usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox
>
> Now I just drop all false negatives into that mailbox, and clean them
> out periodically. Hopefully that will make a significant improvement.

I can tell you it certainly will.

--
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

pgsql-general by date:

Previous
From: Jan Wieck
Date:
Subject: Re: Redundancy software for PostgreSQL
Next
From: Tom Lane
Date:
Subject: Re: 7.3.4 on Linux: UPDATE .. foo=foo+1 degrades massivly over time