Home > mailing lists

Re: Adding skip scan (including MDAM style range skip scan) to nbtree - Mailing list pgsql-hackers

From	Peter Geoghegan
Subject	Re: Adding skip scan (including MDAM style range skip scan) to nbtree
Date	September 20, 2024 17:21:29
Msg-id	CAH2-WznjrqL3yFBwWWUER513AMdO3PSLWPfun_uK7m+xH4Cr3g@mail.gmail.com Whole thread
In response to	Re: Adding skip scan (including MDAM style range skip scan) to nbtree (Tomas Vondra <tomas@vondra.me>)
Responses	Re: Adding skip scan (including MDAM style range skip scan) to nbtree
List	pgsql-hackers

Tree view

On Fri, Sep 20, 2024 at 9:45 AM Tomas Vondra <tomas@vondra.me> wrote:
> 3) restart cluster, drop caches
>
> 4) run the query from the SQL script
>
> I suspect you don't do (3). I didn't mention this explicitly, my message
> only said "with uncached data", so maybe that's the problem?

You're right that I didn't do step 3 here. I'm generally in the habit
of using fully cached data when testing this kind of work.

The only explanation I can think of is that (at least on your
hardware) OS readahead helps the master branch more than skipping
helps the patch. That's surprising, but I guess it's possible here
because skip scan only needs to access about every third page. And
because this particular index was generated by CREATE INDEX, and so
happens to have a strong correlation between key space order and
physical block order. And probably because this is an index-only scan.

> I wasn't suggesting it's a sympathetic case for skipscan. My point is
> that it perfectly matches the costing assumptions, i.e. columns are
> independent etc. But if it's not sympathetic, maybe the cost shouldn't
> be 1/5 of cost from master?

The costing is pretty accurate if we assume cached data, though --
which is what the planner will actually assume. In any case, is that
really the only problem you see here? That the costing might be
inaccurate because it fails to account for some underlying effect,
such as the influence of OS readhead?

Let's assume for a moment that the regression is indeed due to
readahead effects, and that we deem it to be unacceptable. What can be
done about it? I have a really hard time thinking of a fix, since by
most conventional measures skip scan is indeed much faster here.

--
Peter Geoghegan

pgsql-hackers by date:

From: Tomas Vondra
Date: 20 September 2024, 17:07:41
Subject: Re: Adding skip scan (including MDAM style range skip scan) to nbtree

From: Tom Lane
Date: 20 September 2024, 17:21:34
Subject: Re: Clock-skew management in logical replication

Re: Adding skip scan (including MDAM style range skip scan) to nbtree - Mailing list pgsql-hackers

Previous

Next