Home > mailing lists

Re: Performance of count(*) - Mailing list pgsql-performance

From	Tino Wildenhain
Subject	Re: Performance of count(*)
Date	March 22, 2007 15:31:59
Msg-id	4602A15B.7000908@wildenhain.de Whole thread Raw
In response to	Re: Performance of count(*) ("Craig A. James" <cjames@modgraph-usa.com>)
Responses	Re: Performance of count(*)
List	pgsql-performance

Tree view

Craig A. James schrieb:
...
> In our case (for a variety of reasons, but this one is critical), we
> actually can't use Postgres indexing at all -- we wrote an entirely
> separate indexing system for our data, one that has the following
> properties:
>
>  1. It can give out "pages" of information (i.e. "rows 50-60") without
>     rescanning the skipped pages the way "limit/offset" would.
>  2. It can give accurate estimates of the total rows that will be returned.
>  3. It can accurately estimate the time it will take.
>

Thats certainly not entirely correct. There is no need to store or
maintain this information along with postgres when you can store
and maintain it directly in postgres as well. When you have some
outside application I think I can savely assume you are doing
less updates compared to many reads to have it actually pay out.

So why not store this information in separate "index" and "statistic"
tables? You would have just to join with your real data for retrival.

On top of that, postgres has a very flexible and extensible index
system. This would mean you save on database roundtrips and
double information storage (and the sync problems you certainly
get from it)

Regards
Tino

pgsql-performance by date:

From: "Craig A. James"
Date: 22 March 2007, 15:17:12
Subject: Re: Performance of count(*)

From: Tom Lane
Date: 22 March 2007, 15:37:22
Subject: Re: Potential memory usage issue

Re: Performance of count(*) - Mailing list pgsql-performance

Previous

Next