Re: Unicode problems on IRC - Mailing list pgsql-hackers

From Bruce Momjian
Subject Re: Unicode problems on IRC
Whole thread Raw
In response to Unicode problems on IRC  (Christopher Kings-Lynne)
List pgsql-hackers
Christopher Kings-Lynne wrote:
> Hey guys,
> The 'Unicode characters above 0x10000' issue keeps rearing its ugly head 
> in the IRC channel.  I propose that it be fixed, even backported...
> This is John Hansen's most recent patch to fix it:
> And from what I can tell it was committed, then reverted because it 
> wasn't a "bug".  It was going to go in for 8.1.
> We on the channel are starting to think that it is in fact a bug.  There 
> are are people with legitimately utf-8 encoded XML documents that they 
> cannot store in PostgreSQL.  Apparently in the distant past, Unicode was 
> limited to 0x10000, but then was extended.
> Perhaps we can reopen this case...

Uh, I thought we fixed this another way, buy not using Unicode-aware
functions for upper/lower/initcap when the locale is "C" or "POSIX". 
That is backpatched to 8.0.X.  Does that not fix the problem reported?

--  Bruce Momjian                        |                |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,

pgsql-hackers by date:

From: Simon Riggs
Subject: Re: prepared statements don't log arguments?
From: Andrew - Supernews
Subject: Re: Unicode problems on IRC