Home > mailing lists

Re: collate not support Unicode Variation Selector - Mailing list pgsql-hackers

From	Thomas Munro
Subject	Re: collate not support Unicode Variation Selector
Date	August 3, 2022 05:02:08
Msg-id	CA+hUKGK31mD5JUfTwnq5OuBx4i5eO2TaXSV1yvScQppG_Sx+Dg@mail.gmail.com Whole thread Raw
In response to	Re: collate not support Unicode Variation Selector (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: collate not support Unicode Variation Selector
List	pgsql-hackers

Tree view

On Wed, Aug 3, 2022 at 12:56 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Maybe it would help if you run the strings through normalize() first?
> I'm not sure if that can combine combining characters.

I think the similarity between Latin combining characters and these
ideographic variations might end there.  I don't think there is a
single codepoint version of U&'\+003436' || U&'\+0E0101', unlike é.
This system is for controlling small differences in rendering for the
"same" character[1].  My computer doesn't even show the OP's example
glyphs as different (to my eyes, at least; I can see on a random
picture I found[2] that the one with the e0101 selector is supposed to
have a ... what do you call that ... a tiny gap :-)).

[1] http://www.unicode.org/reports/tr37/tr37-14.html
[2] https://glyphwiki.org/wiki/u3436

pgsql-hackers by date:

From: Dmitry Koterov
Date: 03 August 2022, 04:57:41
Subject: Does having pg_last_wal_replay_lsn[replica] >= pg_current_wal_insert_lsn[master] guarantee that the replica is caught up?

From: Dong Wook Lee
Date: 03 August 2022, 05:19:59
Subject: Re: pgstattuple: add test for coverage

Re: collate not support Unicode Variation Selector - Mailing list pgsql-hackers

Previous

Next