Home > mailing lists

Re: Unicode upper() bug still present - Mailing list pgsql-hackers

From	Tatsuo Ishii
Subject	Re: Unicode upper() bug still present
Date	October 20, 2003 09:37:59
Msg-id	20031020.213715.74754803.t-ishii@sra.co.jp Whole thread Raw
In response to	Re: Unicode upper() bug still present (Hannu Krosing <hannu@tm.ee>)
Responses	Re: Unicode upper() bug still present
List	pgsql-hackers

Tree view

> Tom Lane kirjutas E, 20.10.2003 kell 03:35:
> > Oliver Elphick <olly@lfix.co.uk> writes:
> > > There is a bug in Unicode upper() which has been present since 7.2:
> > 
> > We don't support upper/lower in multibyte character sets, and can't as
> > long as the functionality is dependent on <ctype.h>'s toupper()/tolower().
> > It's been suggested that we could use <wctype.h> where available.
> > However there are a bunch of issues that would have to be solved to make
> > that happen.  (How do we convert between the database character encoding 
> > and the wctype representation?  
> 
> How do we do it for sorting ?
> 
> > How do we even find out what
> > representation the current locale setting expects to use?)
> 
> Why not use the same locale settings as for sorting (i.e. databse
> encoding) until we have a proper multi-locale support in the backend ?

There's absolutely no relationship between database encoding and
locale. IMO depending on the system locale is a completely wrong
design decision and we should go toward for having our own collate
data.  (I think Oracle does this way)
--
Tatsuo Ishii

pgsql-hackers by date:

From: Michael Meskes
Date: 20 October 2003, 05:42:50
Subject: Re: Debian bug report about multibyte in 7.3.3

From: Hannu Krosing
Date: 20 October 2003, 09:44:10
Subject: Re: Unicode upper() bug still present

Re: Unicode upper() bug still present - Mailing list pgsql-hackers

Previous

Next