Re: LOCALE C.UTF-8 on EDB Windows v17 server - Mailing list pgsql-general

From Daniel Verite
Subject Re: LOCALE C.UTF-8 on EDB Windows v17 server
Date
Msg-id aded3b3d-4849-4bf4-a072-eecf84f5cd4e@manitou-mail.org
Whole thread Raw
In response to Re: LOCALE C.UTF-8 on EDB Windows v17 server  (Dominique Devienne <ddevienne@gmail.com>)
List pgsql-general
    Dominique Devienne wrote:

> So you're saying datcollate and datctype from pg_database are
> irrelevant to PostgreSQL itself, and only extensions might be affects?

Almost. An exception that still exists in v18, as far as I can see [1],
is the default full text search parser still using libc functions like
iswdigit(), iswpunct(), iswspace()... that depend on LC_CTYPE.

So you could see differences between OSes in tsvector contents
in a database with the builtin provider.
Unless using LC_CTYPE=C. But then the parsing is suboptimal, since the
parser does not recognize Unicode fancy punctuation signs or spaces as
such.
Personally I would still care to set LC_CTYPE to a reasonable UTF-8 locale
with v17 or v18.

[1]
https://doxygen.postgresql.org/wparser__def_8c.html#a420ea398a8a11db92412a2af7bf45e40

Best regards,
--
Daniel Vérité
https://postgresql.verite.pro/



pgsql-general by date:

Previous
From: PetSerAl
Date:
Subject: Re: Combining scalar and row types in RETURNING
Next
From: Jeff Davis
Date:
Subject: Re: LOCALE C.UTF-8 on EDB Windows v17 server