Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

From Peter Eisentraut
Subject Re: Pre-proposal: unicode normalized text
Date
Msg-id 3af7e977-660d-4161-85fe-d5f4a205aa3e@eisentraut.org
Whole thread Raw
In response to Re: Pre-proposal: unicode normalized text  (Robert Haas <robertmhaas@gmail.com>)
Responses Re: Pre-proposal: unicode normalized text
List pgsql-hackers
On 10.10.23 16:02, Robert Haas wrote:
> On Tue, Oct 10, 2023 at 2:44 AM Peter Eisentraut <peter@eisentraut.org> wrote:
>> Can you restate what this is supposed to be for?  This thread appears to
>> have morphed from "let's normalize everything" to "let's check for
>> unassigned code points", but I'm not sure what we are aiming for now.
> 
> Jeff can say what he wants it for, but one obvious application would
> be to have the ability to add a CHECK constraint that forbids
> inserting unassigned code points into your database, which would be
> useful if you're worried about forward-compatibility with collation
> definitions that might be extended to cover those code points in the
> future.

I don't see how this would really work in practice.  Whether your data 
has unassigned code points or not, when the collations are updated to 
the next Unicode version, the collations will have a new version number, 
and so you need to run the refresh procedure in any case.




pgsql-hackers by date:

Previous
From: "Zhijie Hou (Fujitsu)"
Date:
Subject: Add null termination to string received in parallel apply worker
Next
From: Peter Eisentraut
Date:
Subject: Re: Pre-proposal: unicode normalized text