Re: invalidly encoded strings - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: invalidly encoded strings
Date
Msg-id 46E54BCD.30301@dunslane.net
Whole thread Raw
In response to Re: invalidly encoded strings  ("Albe Laurenz" <laurenz.albe@wien.gv.at>)
Responses Re: invalidly encoded strings
List pgsql-hackers

Albe Laurenz wrote:
> I'd like to repeat my suggestion for chr() and ascii().
>
> Instead of the code point, I'd prefer the actual encoding of
> the character as argument to chr() and return value of ascii().
>
>
>   
[snip]
> Of course, if it is generally perceived that the code point
> is more useful than the encoding, then Oracle compliance
> is probably secondary.
>
>
>   

Last time this was discussed, you were the only person arguing for that 
behaviour, IIRC.

And frankly, I don't know how to do it sanely anyway. A character 
encoding has a fixed byte pattern, but a given byte pattern doesn't have 
a single universal number value. I really don't think we want to have 
the value of chr(n) depend on the endianness of the machine, do we?

The reason we are prepared to make an exception for Unicode is precisely 
because the code point maps to an encoding pattern independently of 
architecture, ISTM.

cheers

andrew


pgsql-hackers by date:

Previous
From: Simon Riggs
Date:
Subject: Re: Include Lists for Text Search
Next
From: Tom Lane
Date:
Subject: Re: invalidly encoded strings