Home > mailing lists

Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

From	Jeff Davis
Subject	Re: Pre-proposal: unicode normalized text
Date	October 17, 2023 16:32:18
Msg-id	dfeff43884f7c3da50e32fc93cb2383255aa2e18.camel@j-davis.com Whole thread Raw
In response to	Re: Pre-proposal: unicode normalized text ("Daniel Verite" <daniel@manitou-mail.org>)
List	pgsql-hackers

Tree view

On Tue, 2023-10-17 at 17:07 +0200, Daniel Verite wrote:
> There's a problem in the fact that the set of assigned code points is
> expanding with every Unicode release, which happens about every year.
>
> If we had this option in Postgres 11 released in 2018 it would use
> Unicode 11, and in 2023 this feature would reject thousands of code
> points that have been assigned since then.

That wouldn't be good for everyone, but might it be good for some
users?

We already expose normalization functions. If users are depending on
normalization, and they have unassigned code points in their system,
that will break when we update Unicode. By restricting themselves to
assigned code points, normalization is guaranteed to be forward-
compatible.

Regards,
    Jeff Davis

pgsql-hackers by date:

From: Robert Haas
Date: 17 October 2023, 16:21:00
Subject: Re: run pgindent on a regular basis / scripted manner

From: Tom Lane
Date: 17 October 2023, 16:33:09
Subject: Re: Fix output of zero privileges in psql

Re: Pre-proposal: unicode normalized text - Mailing list pgsql-hackers

Previous

Next