Home > mailing lists

Re: Unicode combining characters - Mailing list pgsql-hackers

From	Tatsuo Ishii
Subject	Re: Unicode combining characters
Date	September 25, 2001 21:03:23
Msg-id	20010926100313X.t-ishii@sra.co.jp Whole thread Raw
In response to	Re: Unicode combining characters (Patrice Hédé <phede-ml@islande.org>)
List	pgsql-hackers

Tree view

> > > - length() on the server side doesn't handle correctly Unicode [I
> > >   have the same result with char_length()], and returns the number
> > >   of chars (as it is however advertised to do), rather the length
> > >   of the string.
> > 
> > This is a known limitation.
> 
> To solve this, we could use wcwidth() (there is a custom
> implementation for the systems which don't have it in the glibc). I'll
> have a look at it later.

And wcwidth() depends on the locale. That is the another reason we
could not use it.

> As Oleg suggested, I will try to aim for 7.3, first with a version in
> contrib, and later, if the implementation is fine, it could be moved
> to the core (or not ? Though it would be nice to make sure every
> PostgreSQL installation which supports unicode has it, so that users
> won't need to have administrative rights to use the functionality).

I would like to see SQL99's charset, collate functionality for 7.3 (or
later). If this happens, current multibyte implementation would be
dramatically changed. That would be a good timing to merge your
Unicode stuffs into the main source tree.
--
Tatsuo Ishii

pgsql-hackers by date:

From: Doug McNaught
Date: 25 September 2001, 15:27:29
Subject: O_DIRECT and performance

From: Bruce Momjian
Date: 25 September 2001, 21:40:39
Subject: Re: Beta time

Re: Unicode combining characters - Mailing list pgsql-hackers

Previous

Next