Home > mailing lists

Re: Use of sequence rather than index scan for one text column on one instance of a database - Mailing list pgsql-performance

From	Bill Kirtley
Subject	Re: Use of sequence rather than index scan for one text column on one instance of a database
Date	October 3, 2009 23:04:09
Msg-id	AFAA5C92-FFFA-4A9C-823B-5A5F6D8B9B78@actbluetech.com Whole thread Raw
In response to	Re: Use of sequence rather than index scan for one text column on one instance of a database (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-performance

Tree view

Hi Tom-

Thanks for the response, but I'm not sure what to do with it.

Are you suggesting we might have some transaction (or residue) that's
hanging around and causing this problem?

We do have transactions that run on the order of  a couple minutes at
times.  In the past, under heavy db load they have piled up on top of
each other, but as far as I can tell they've finished.

Is there something more I can look at to try and diagnose this?  As I
mentioned we do have a workaround of copying the column and building
an index on the new column ... is it time to take that step?  And if
so, should we be monitoring for this sort of thing on an ongoing basis?

Are there different options we can pass to CREATE INDEX to build and
index that would be usable?  Should we be stopping our application
while applying indices (we don't always) to ensure the db is quiescent
at the time?

Regards,
-Bill Kirtley

On Sep 24, 2009, at 12:26 PM, Tom Lane wrote:

> Bill Kirtley <bill@actbluetech.com> writes:
>> select xmin,* from pg_index where indexrelid =
>> 'index_users_on_email'::regclass;
>>    xmin   | indexrelid | indrelid | indnatts | indisunique |
>> indisprimary | indisclustered | indisvalid | indcheckxmin |
>> indisready
>> | indkey | indclass | indoption | indexprs | indpred
>> ----------+------------+----------+----------+-------------
>> +--------------+----------------+------------+--------------
>> +------------+--------+----------+-----------+----------+---------
>>  12651453 |   24483560 |    17516 |        1 | t           |
>> f            | f              | t          | t            | t
>> | 6      |    10042 | 0         |          |
>> (1 row)
>
> Okay, the basic cause of the issue is now clear: the index has
> indcheckxmin true, which means it's not usable until local
> TransactionXmin exceeds the tuple's xmin (12651453 here).  This
> is all a pretty unsurprising consequence of the HOT optimizations
> added in 8.3.  The question is why that state persisted long
> enough to be a problem.  Perhaps you have long-running background
> transactions?  TransactionXmin is normally the oldest XID that was
> running when your own transaction started, so basically the index
> isn't usable until all transactions that were running while it
> was built complete.  I had been thinking that this only happened
> for concurrent index builds, but actually regular builds can be
> subject to it as well.
>
> We've seen some complaints about this behavior before.  I wonder if
> there's a way to work a bit harder to avoid the indcheckxmin labeling
> --- right now the code is pretty conservative about setting that bit
> if there's any chance at all of an invalid HOT chain.
>
>             regards, tom lane

pgsql-performance by date:

From: std pik
Date: 03 October 2009, 23:02:55
Subject: Postgres performance

From: "Sgarbossa Domenico"
Date: 03 October 2009, 23:04:39
Subject: Performance problems with DISTINCT ON

Re: Use of sequence rather than index scan for one text column on one instance of a database - Mailing list pgsql-performance

Previous

Next