Re: Add standard collation UNICODE - Mailing list pgsql-hackers

From Thomas Munro
Subject Re: Add standard collation UNICODE
Date
Msg-id CA+hUKGKZEDiXeGFcPw2GAGxdPCr2L+9yPdWeHvS3v+C9ipgpKA@mail.gmail.com
Whole thread Raw
In response to Re: Add standard collation UNICODE  (Jeff Davis <pgsql@j-davis.com>)
Responses Re: Add standard collation UNICODE
List pgsql-hackers
On Sun, Mar 5, 2023 at 7:30 AM Jeff Davis <pgsql@j-davis.com> wrote:
> Sorting by codepoint should be encoding-independent (i.e. decode to
> codepoint first); but the C collation is just strcmp, which is
> encoding-dependent. So is UCS_BASIC wrong today?

It's created for UTF-8 only, and UTF-8 sorts the same way as the
encoded code points, when interpreted as a sequence of unsigned char
by memcmp(), strcmp() etc.  Seems right?



pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: [Proposal] Allow pg_dump to include all child tables with the root table
Next
From: Joseph Koshakow
Date:
Subject: Re: Date-time extraneous fields with reserved keywords