Home > mailing lists

Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

From	Peter Eisentraut
Subject	Re: Pre-proposal: unicode normalized text
Date	October 11, 2023 09:51:27
Msg-id	3af7e977-660d-4161-85fe-d5f4a205aa3e@eisentraut.org Whole thread Raw
In response to	Re: Pre-proposal: unicode normalized text (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: Pre-proposal: unicode normalized text
List	pgsql-hackers

Tree view

On 10.10.23 16:02, Robert Haas wrote:
> On Tue, Oct 10, 2023 at 2:44 AM Peter Eisentraut <peter@eisentraut.org> wrote:
>> Can you restate what this is supposed to be for?  This thread appears to
>> have morphed from "let's normalize everything" to "let's check for
>> unassigned code points", but I'm not sure what we are aiming for now.
> 
> Jeff can say what he wants it for, but one obvious application would
> be to have the ability to add a CHECK constraint that forbids
> inserting unassigned code points into your database, which would be
> useful if you're worried about forward-compatibility with collation
> definitions that might be extended to cover those code points in the
> future.

I don't see how this would really work in practice.  Whether your data 
has unassigned code points or not, when the collations are updated to 
the next Unicode version, the collations will have a new version number, 
and so you need to run the refresh procedure in any case.

pgsql-hackers by date:

From: "Zhijie Hou (Fujitsu)"
Date: 11 October 2023, 09:48:44
Subject: Add null termination to string received in parallel apply worker

From: Peter Eisentraut
Date: 11 October 2023, 09:56:13
Subject: Re: Pre-proposal: unicode normalized text

Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

Previous

Next