Re: Reverse collations (initially for making keyset pagination covermore cases) - Mailing list pgsql-hackers

From David Fetter
Subject Re: Reverse collations (initially for making keyset pagination covermore cases)
Date
Msg-id 20191117215426.GA7444@fetter.org
Whole thread Raw
In response to Re: Reverse collations (initially for making keyset pagination cover more cases)  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Reverse collations (initially for making keyset pagination cover more cases)
List pgsql-hackers
On Sun, Nov 17, 2019 at 02:30:35PM -0500, Tom Lane wrote:
> David Fetter <david@fetter.org> writes:
> > Please find attached a patch for $Subject.
> 
> I think there's a reason why this hasn't been proposed before.
> 
> Back before we had full support of ASC/DESC index sort order, there was
> interest in having reverse-sort operator classes, and there are bits and
> pieces still in the code that tried to cater to that.  But we never got
> it to the point where such things would really be pleasant to use.
> Now that we have ASC/DESC indexes, there's no value in a reverse-sort
> operator class, so the idea's pretty much been consigned to the dustbin.
> 
> This looks to me like it's trying to go down that same path at the
> collation level, and it seems like just as bad of an idea here.
> 
> The fundamental problem with what you propose is that it'd require
> a bunch of infrastructure (which you haven't even attempted) to teach
> the planner about the relationships between forward- and reverse-sort
> collation pairs, so that it could figure out that scanning some index
> backwards would satisfy a request for the reverse-sort collation,
> or vice versa.  Without such infrastructure, the feature is really
> just a gotcha, because queries won't get optimized the way users
> would expect them to.
> 
> And no, I don't think we should accept the feature and then go write
> that infrastructure.  If we couldn't make it work well at the opclass
> level, I don't think things will go better at the collation level.
> 
> Lastly, your proposed use-case has some attraction, but this proposal
> only supports it if the column you need to be differently sorted is
> textual.  What if the sort columns are all numerics and timestamps?

Those are pretty straightforward to generate: -column, and
-extract('epoch' FROM column), respectively.

> Thinking about that, it seems like what we'd want is some sort of
> more-general notion of row comparison, to express "bounded below in
> an arbitrary ORDER BY ordering".  Not quite sure what it ought to
> look like.

I'm not, either, but the one I'm proposing seems like a lot less
redundant code (and hence a lot less room for errors) than what people
generally see proposed for this use case, to wit:

(a, b, c) < ($1, $2 COLLATE "C_backwards", $3)
...
ORDER BY a, b DESC, c

as opposed to the "standard" way to do it

(a > $1) OR
(a = $1 AND b < $2) OR
(a = $1 AND b = $2 AND c > $3)
...
ORDER BY a, b DESC, c

which may not even get optimized correctly.

Best,
David.
-- 
David Fetter <david(at)fetter(dot)org> http://fetter.org/
Phone: +1 415 235 3778

Remember to vote!
Consider donating to Postgres: http://www.postgresql.org/about/donate



pgsql-hackers by date:

Previous
From: Thomas Munro
Date:
Subject: Re: Invisible PROMPT2
Next
From: Andrew Gierth
Date:
Subject: Re: Reverse collations (initially for making keyset pagination cover more cases)