Home > mailing lists

C11: should we use char32_t for unicode code points? - Mailing list pgsql-hackers

From	Jeff Davis
Subject	C11: should we use char32_t for unicode code points?
Date	October 23, 2025 21:15:54
Msg-id	bedcc93d06203dfd89815b10f815ca2de8626e85.camel@j-davis.com Whole thread Raw
Responses	Re: C11: should we use char32_t for unicode code points?
List	pgsql-hackers

Tree view

Now that we're using C11, should we use char32_t for unicode code
points?

Right now, we use pg_wchar for two purposes: 

  1. to abstract away some problems with wchar_t on platforms where
it's 16 bits; and
  2. hold unicode code point values

In UTF8, they are are equivalent and can be freely cast back and forth,
but not necessarily in other encodings. That can be confusing in some
contexts. Attached is a patch to use char32_t for the second purpose.

Both are equivalent to uint32, so there's no functional change and no
actual typechecking, it's just for readability.

Is this helpful, or needless code churn?

Regards,
    Jeff Davis

Attachment

v1-0001-Use-C11-char32_t-for-Unicode-code-points.patch

pgsql-hackers by date:

From: "Matheus Alcantara"
Date: 23 October 2025, 21:14:12
Subject: Re: Include extension path on pg_available_extensions

From: Sami Imseih
Date: 23 October 2025, 21:22:24
Subject: Re: another autovacuum scheduling thread

C11: should we use char32_t for unicode code points? - Mailing list pgsql-hackers

Attachment

Previous

Next