Home > mailing lists

Re: Bug in UTF8-Validation Code? - Mailing list pgsql-hackers

From	Martijn van Oosterhout
Subject	Re: Bug in UTF8-Validation Code?
Date	April 3, 2007 11:37:05
Msg-id	20070403143618.GA5405@svana.org Whole thread Raw
In response to	Re: Bug in UTF8-Validation Code? ("Albe Laurenz" <all@adv.magwien.gv.at>)
Responses	Re: Bug in UTF8-Validation Code?
List	pgsql-hackers

Tree view

On Tue, Apr 03, 2007 at 11:43:21AM +0200, Albe Laurenz wrote:
> IMHO this is the only good and intuitive way for CHR() and ASCII().

Hardly. The comment earlier about mbtowc was much closer to the mark.
And wide characters are defined as Unicode points.

Basically, CHR() takes a unicode point and returns that character
in a string appropriately encoded. ASCII() does the reverse.

Just about every multibyte encoding other than Unicode has the problem
of not distinguishing between the code point and the encoding of it.
Unicode is a collection of encodings based on the same set.

Have a nice day,
--
Martijn van Oosterhout   <kleptog@svana.org>   http://svana.org/kleptog/
> From each according to his ability. To each according to his ability to litigate.

pgsql-hackers by date:

From: "Luke Lonergan"
Date: 03 April 2007, 11:32:25
Subject: Re: Modifying TOAST thresholds

From: Peter Eisentraut
Date: 03 April 2007, 12:10:23
Subject: Re: Implicit casts to text

Re: Bug in UTF8-Validation Code? - Mailing list pgsql-hackers

Previous

Next