Re: Bug in UTF8-Validation Code? - Mailing list pgsql-hackers

From Martijn van Oosterhout
Subject Re: Bug in UTF8-Validation Code?
Date
Msg-id 20070403143618.GA5405@svana.org
Whole thread Raw
In response to Re: Bug in UTF8-Validation Code?  ("Albe Laurenz" <all@adv.magwien.gv.at>)
Responses Re: Bug in UTF8-Validation Code?  (Mark Dilger <pgsql@markdilger.com>)
List pgsql-hackers
On Tue, Apr 03, 2007 at 11:43:21AM +0200, Albe Laurenz wrote:
> IMHO this is the only good and intuitive way for CHR() and ASCII().

Hardly. The comment earlier about mbtowc was much closer to the mark.
And wide characters are defined as Unicode points.

Basically, CHR() takes a unicode point and returns that character
in a string appropriately encoded. ASCII() does the reverse.

Just about every multibyte encoding other than Unicode has the problem
of not distinguishing between the code point and the encoding of it.
Unicode is a collection of encodings based on the same set.

Have a nice day,
--
Martijn van Oosterhout   <kleptog@svana.org>   http://svana.org/kleptog/
> From each according to his ability. To each according to his ability to litigate.

pgsql-hackers by date:

Previous
From: "Luke Lonergan"
Date:
Subject: Re: Modifying TOAST thresholds
Next
From: Peter Eisentraut
Date:
Subject: Re: Implicit casts to text