Home > mailing lists

Re: Use of inefficient index in the presence of dead tuples - Mailing list pgsql-general

From	Tom Lane
Subject	Re: Use of inefficient index in the presence of dead tuples
Date	May 29, 2024 00:53:21
Msg-id	2771.1716944001@sss.pgh.pa.us Whole thread Raw
In response to	Use of inefficient index in the presence of dead tuples (Alexander Staubo <alex@purefiction.net>)
Responses	Re: Use of inefficient index in the presence of dead tuples Re: Use of inefficient index in the presence of dead tuples
List	pgsql-general

Tree view

Alexander Staubo <alex@purefiction.net> writes:
> (2) Set up schema. It's important to create the index before insertion, in order to provoke a
> situation where the indexes have dead tuples:
> ...
> (4) Then ensure all tuples are dead except one:

>     DELETE FROM outbox_batches;
>     INSERT INTO outbox_batches (receiver, id) VALUES ('dummy', 'test');

> (5) Analyze:

>     ANALYZE outbox_batches;

So the problem here is that the ANALYZE didn't see any of the dead rows
and thus there is no way to know that they all match 'dummy'.  The cost
estimation is based on the conclusion that there is exactly one row
that will pass the index condition in each case, and thus the "right"
index doesn't look any cheaper than the "wrong" one --- in fact, it
looks a little worse because of the extra access to the visibility
map that will be incurred by an index-only scan.

I'm unpersuaded by the idea that ANALYZE should count dead tuples.
Since those are going to go away pretty soon, we would risk
estimating on the basis of no-longer-relevant stats and thus
creating problems worse than the one we solve.

What is interesting here is that had you done ANALYZE *before*
the delete-and-insert, you'd have been fine.  So it seems like
somewhat out-of-date stats would have benefited you.

It would be interesting to see a non-artificial example that took
into account when the last auto-vacuum and auto-analyze really
happened, so we could see if there's any less-fragile way of
dealing with this situation.

            regards, tom lane

pgsql-general by date:

From: Ian Lawrence Barwick
Date: 29 May 2024, 00:27:30
Subject: Re: tds_fdw >> Install Foreign data Wrapper on EDB Postgres to connect to SQL server database

From: Muhammad Imtiaz
Date: 29 May 2024, 03:43:11
Subject: Re: tds_fdw >> Install Foreign data Wrapper on EDB Postgres to connect to SQL server database

Re: Use of inefficient index in the presence of dead tuples - Mailing list pgsql-general

Previous

Next