Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

From Robert Haas
Subject Re: Pre-proposal: unicode normalized text
Date
Msg-id CA+TgmobgMg8x7xjq42_tiBT_Yxr+SX5qkp6MGZ3OtvdJ92b9jw@mail.gmail.com
Whole thread Raw
In response to Re: Pre-proposal: unicode normalized text  (Isaac Morland <isaac.morland@gmail.com>)
List pgsql-hackers
On Tue, Oct 17, 2023 at 11:38 AM Isaac Morland <isaac.morland@gmail.com> wrote:
> On Tue, 17 Oct 2023 at 11:15, Robert Haas <robertmhaas@gmail.com> wrote:
>> Are code points assigned from a gapless sequence? That is, is the
>> implementation of codepoint_is_assigned(char) just 'codepoint <
>> SOME_VALUE' and SOME_VALUE increases over time?
>
> Not even close. Code points are organized in blocks, e.g. for mathematical symbols or Ethiopic script. Sometimes new
blocksare added, sometimes new characters are added to existing blocks. Where they go is a combination of convenience,
history,and planning. 

Ah. Good to know.

--
Robert Haas
EDB: http://www.enterprisedb.com



pgsql-hackers by date:

Previous
From: Robert Haas
Date:
Subject: Re: run pgindent on a regular basis / scripted manner
Next
From: Nathan Bossart
Date:
Subject: Re: stopgap fix for signal handling during restore_command