Home > mailing lists

Re: Performance problems testing with Spamassassin 3.1.0 - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: Performance problems testing with Spamassassin 3.1.0
Date	August 4, 2005 14:37:12
Msg-id	15610.1123166222@sss.pgh.pa.us Whole thread Raw
In response to	Re: Performance problems testing with Spamassassin 3.1.0 (Matthew Schumacher <matt.s@aptalaska.net>)
Responses	Re: Performance problems testing with Spamassassin 3.1.0 Re: Performance problems testing with Spamassassin 3.1.0
List	pgsql-performance

Tree view

Matthew Schumacher <matt.s@aptalaska.net> writes:
>   for i in array_lower(intokenary, 1) .. array_upper(intokenary, 1)
>   LOOP
>     _token := intokenary[i];
>     INSERT INTO bayes_token_tmp VALUES (_token);
>   END LOOP;

>   UPDATE
>     bayes_token
>   SET
>     spam_count = greatest_int(spam_count + inspam_count, 0),
>     ham_count = greatest_int(ham_count + inham_count , 0),
>     atime = greatest_int(atime, 1000)
>   WHERE
>     id = inuserid
>   AND
>     (token) IN (SELECT intoken FROM bayes_token_tmp);

I don't really see why you think that this path is going to lead to
better performance than where you were before.  Manipulation of the
temp table is never going to be free, and IN (sub-select) is always
inherently not fast, and NOT IN (sub-select) is always inherently
awful.  Throwing a pile of simple queries at the problem is not
necessarily the wrong way ... especially when you are doing it in
plpgsql, because you've already eliminated the overhead of network
round trips and repeated planning of the queries.

            regards, tom lane

pgsql-performance by date:

From: John A Meinel
Date: 04 August 2005, 14:10:33
Subject: Re: Performance problems testing with Spamassassin 3.1.0

From: John A Meinel
Date: 04 August 2005, 15:08:38
Subject: Re: Performance problems testing with Spamassassin 3.1.0

Re: Performance problems testing with Spamassassin 3.1.0 - Mailing list pgsql-performance

Previous

Next