Re: List moderation - need a break! - Mailing list pgsql-www

From Greg Smith
Subject Re: List moderation - need a break!
Date
Msg-id alpine.GSO.2.01.0906151830450.21975@westnet.com
Whole thread Raw
In response to Re: List moderation - need a break!  ("Marc G. Fournier" <scrappy@hub.org>)
Responses Re: List moderation - need a break!  ("Joshua D. Drake" <jd@commandprompt.com>)
Re: List moderation - need a break!  (Dave Page <dpage@pgadmin.org>)
List pgsql-www
On Mon, 15 Jun 2009, Marc G. Fournier wrote:

> I've tried unsucessfully in the past to do language based regex's and failed 
> miserably though ... anyone out there good at this? ;

I thought Josh was suggesting leaning on the SpamAssassin toolset here, 
you're certainly not going to write this yourself in any reasonable amount 
of time.  The two rules you can use are:

CHARSET_FARAWAY Character set indicates a foreign language
UNWANTED_LANGUAGE_BODY Message written in an undesired language

which both default to a relatively high score (around +3 points on the rev 
I just checked).  The languages you're willing to accept goes into 
ok_languages, http://email.about.com/cs/spamassassintips/qt/et032504.htm 
has a reasonable primer here.  That defaults to "all".

Since tripping that rule alone isn't enough to pass a typical threshold, 
legit messages from people that just happen to have foreign stuff in their 
signature and such should typically survive.  You might start by setting 
ok_languages and reducing the point value for the rules to something small 
in order to judge its impact, before using the higher default score.

--
* Greg Smith gsmith@gregsmith.com http://www.gregsmith.com Baltimore, MD


pgsql-www by date:

Previous
From: Alvaro Herrera
Date:
Subject: Re: PUG for Ecuador
Next
From: "Joshua D. Drake"
Date:
Subject: Re: List moderation - need a break!