C11: should we use char32_t for unicode code points? - Mailing list pgsql-hackers

From Jeff Davis
Subject C11: should we use char32_t for unicode code points?
Date
Msg-id bedcc93d06203dfd89815b10f815ca2de8626e85.camel@j-davis.com
Whole thread Raw
List pgsql-hackers
Now that we're using C11, should we use char32_t for unicode code
points?

Right now, we use pg_wchar for two purposes: 

  1. to abstract away some problems with wchar_t on platforms where
it's 16 bits; and
  2. hold unicode code point values

In UTF8, they are are equivalent and can be freely cast back and forth,
but not necessarily in other encodings. That can be confusing in some
contexts. Attached is a patch to use char32_t for the second purpose.

Both are equivalent to uint32, so there's no functional change and no
actual typechecking, it's just for readability.

Is this helpful, or needless code churn?

Regards,
    Jeff Davis


Attachment

pgsql-hackers by date:

Previous
From: "Matheus Alcantara"
Date:
Subject: Re: Include extension path on pg_available_extensions
Next
From: Sami Imseih
Date:
Subject: Re: another autovacuum scheduling thread