Home > mailing lists

Re: tolower() identifier downcasing versus multibyte encodings - Mailing list pgsql-hackers

From	Francisco Figueiredo Jr.
Subject	Re: tolower() identifier downcasing versus multibyte encodings
Date	March 21, 2011 16:57:59
Msg-id	AANLkTimhti52XEyQB8zx+jBxuZ2KzYoc2HxKa2vtx-xL@mail.gmail.com Whole thread Raw
In response to	Re: tolower() identifier downcasing versus multibyte encodings (Marko Kreen <markokr@gmail.com>)
List	pgsql-hackers

Tree view

I just received a feedback from our bug report about this problem and
it seems the problem also occurred on a windows machine.

http://pgfoundry.org/tracker/index.php?func=detail&aid=1010988&group_id=1000140&atid=590



On Sat, Mar 19, 2011 at 14:13, Marko Kreen <markokr@gmail.com> wrote:
> On Sat, Mar 19, 2011 at 5:05 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Marko Kreen <markokr@gmail.com> writes:
>>> On Sat, Mar 19, 2011 at 6:10 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>>>> Or we could bite the bullet and start using str_tolower(), but the
>>>> performance implications of that are unpleasant; not to mention that
>>>> we really don't want to re-introduce the "Turkish problem" with
>>>> unexpected handling of i/I in identifiers.
>>
>>> How about first pass with 'a' - 'A' and if highbit is found
>>> then str_tolower()?
>>
>> Hm, maybe.
>>
>> There's still the problem of what to do in src/port/pgstrcasecmp.c,
>> which won't have the infrastructure needed to do that.
>
> You mean client-side?  Could we have a str_tolower without xxx_l
> branch that always does wide-char conversion if high-bit is set?
>
> Custom locale there won't make sense there anyway?
>
> --
> marko
>



--
Regards,

Francisco Figueiredo Jr.
Npgsql Lead Developer
http://www.npgsql.org
http://fxjr.blogspot.com
http://twitter.com/franciscojunior

pgsql-hackers by date:

From: Greg Stark
Date: 21 March 2011, 16:56:24
Subject: Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL

From: Heikki Linnakangas
Date: 21 March 2011, 16:58:51
Subject: Chinese initdb on Windows

Re: tolower() identifier downcasing versus multibyte encodings - Mailing list pgsql-hackers

Previous

Next