Re: count(*) slow on large tables - Mailing list pgsql-performance

From Tom Lane
Subject Re: count(*) slow on large tables
Date
Msg-id 6743.1065286173@sss.pgh.pa.us
Whole thread Raw
In response to Re: count(*) slow on large tables  (Bruce Momjian <pgman@candle.pha.pa.us>)
Responses Re: count(*) slow on large tables  (Bruce Momjian <pgman@candle.pha.pa.us>)
List pgsql-performance
Bruce Momjian <pgman@candle.pha.pa.us> writes:
> We do have a TODO item:
>     * Consider using MVCC to cache count(*) queries with no WHERE clause

> The idea is to cache a recent count of the table, then have
> insert/delete add +/- records to the count.  A COUNT(*) would get the
> main cached record plus any visible +/- records.  This would allow the
> count to return the proper value depending on the visibility of the
> requesting transaction, and it would require _no_ heap or index scan.

... and it would give the wrong answers.  Unless the cache is somehow
snapshot-aware, so that it can know which other transactions should be
included in your count.

            regards, tom lane

pgsql-performance by date:

Previous
From: Tom Lane
Date:
Subject: COUNT(*) again (was Re: [HACKERS] Index/Function organized table layout)
Next
From: Bruce Momjian
Date:
Subject: Re: count(*) slow on large tables