Home > mailing lists

Re: Why does adding SUM and GROUP BY destroy performance? - Mailing list pgsql-general

From	Christopher Browne
Subject	Re: Why does adding SUM and GROUP BY destroy performance?
Date	September 18, 2003 13:16:18
Msg-id	m3smmu8fk3.fsf@wolfe.cbbrowne.com Whole thread Raw
In response to	Why does adding SUM and GROUP BY destroy performance? (David Link <dvlink@yahoo.com>)
Responses	Re: Why does adding SUM and GROUP BY destroy performance?
List	pgsql-general

Tree view

In the last exciting episode, dvlink@yahoo.com (David Link) wrote:
> Why does adding SUM and GROUP BY destroy performance?

When you use SUM (or other aggregates), there are no short cuts to
walking through each and every tuple specified by the WHERE clause.

On some systems there are statistics mechanisms that can short-circuit
that.  On PostgreSQL, the use of MVCC to let new data "almost
magically appear" :-) has the demerit, in the case of aggregates, of
not leaving much opening for short cuts.

There are some cases where you CAN do much better than the aggregates
do.

  SELECT MAX(FIELD) FROM TABLE WHERE A='THIS' and B='THAT';

may be replaced with the likely-to-be-faster:

  select field from table where a = 'THIS' and b='THAT' order by field
    desc limit 1;

MIN() admits a similar rewriting.  If there is an index on FIELD, this
will likely be _way_ faster than using MIN()/MAX().

In a sense, it's not that aggregates "destroy" performance; just that
there are no magical shortcuts to make them incredibly fast.
--
wm(X,Y):-write(X),write('@'),write(Y). wm('cbbrowne','acm.org').
http://www.ntlug.org/~cbbrowne/multiplexor.html
"And  1.1.81 is  officially BugFree(tm),  so  if you  receive any  bug
reports on it, you know they are just evil lies." -- Linus Torvalds

pgsql-general by date:

From: david_shadovitz@xontech.com (David Shadovitz)
Date: 18 September 2003, 13:16:09
Subject: Re: Where are PL/pgSQL functions stored?

From: "Joshua D. Drake"
Date: 18 September 2003, 13:16:22
Subject: Re: State of Beta (2)

Re: Why does adding SUM and GROUP BY destroy performance? - Mailing list pgsql-general

Previous

Next