Re: Win32 unicode vs ICU - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Win32 unicode vs ICU
Date
Msg-id 3044.1124813024@sss.pgh.pa.us
Whole thread Raw
In response to Re: Win32 unicode vs ICU  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
I wrote:
> (Just looking at it again, the code in convert_string_to_scalar is
> pretty bogus for multibyte encodings in any case.  Possibly we need to
> rethink the whole approach.)

After studying this some more, I think the code is really so bogus for
any non-ASCII situation that it's probably not worth worrying about
too much.  It's effectively assuming that the output of strxfrm() is
still in an ASCII-superset encoding ... but I don't see anything in
strxfrm's API that guarantees any such thing.

As long as strxfrm() doesn't fail completely for Windows Unicode,
I'd recommend just leaving this alone.  As previously noted, the
worst that can happen is an estimation error that's bounded by the
histogram bin size anyhow.
        regards, tom lane


pgsql-hackers by date:

Previous
From: Josh Berkus
Date:
Subject: Re: Let's drop some GUCs (bgwriter)
Next
From: "Joshua D. Drake"
Date:
Subject: Re: Inconsistencies with create role