Re: Latest on CITEXT 2.0 - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Latest on CITEXT 2.0
Date
Msg-id 11143.1214926987@sss.pgh.pa.us
Whole thread Raw
In response to Re: Latest on CITEXT 2.0  ("Marko Kreen" <markokr@gmail.com>)
List pgsql-hackers
"Marko Kreen" <markokr@gmail.com> writes:
> There's another thing I'm probably missing: does current code handle
> multi-wchar codepoints?  Or is it guaranteed they don't happen?

AFAIK we disallow multi-wchar situations (by rejecting the UTF8
combining codes).

> (Wasn't wchar_t usually 16bit value?)

Hmm.  It's unsigned int on my ancient HPUX box.  I think we could have a
problem on any machines whose mbstowcs doesn't support 4-byte UTF8
codes, though ... in particular, what about Windows?
        regards, tom lane


pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Latest on CITEXT 2.0
Next
From: "Marko Kreen"
Date:
Subject: Re: Latest on CITEXT 2.0