Home > mailing lists

Re: invalidly encoded strings - Mailing list pgsql-hackers

From	Andrew Dunstan
Subject	Re: invalidly encoded strings
Date	September 9, 2007 12:56:41
Msg-id	46E417AB.10202@dunslane.net Whole thread Raw
In response to	Re: invalidly encoded strings (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: invalidly encoded strings
List	pgsql-hackers

Tree view


Tom Lane wrote:
>
> A possible answer is to add a verifymbstr to the string literal
> converter anytime it has processed a numeric backslash-escape in the
> string.  Open questions for that are (1) does it have negative effects
> for bytea, and if so is there any hope of working around it?  (2) how
> can we postpone the conversion/test to the parse analysis phase?
>   

Finding out how to do (2) seems to me the only possible answer to (1). 
I'll have a look.

Is that going to cover data coming in via COPY? and parameters for 
prepared statements?


>> . for chr() under UTF8, it seems to be generally agreed that the 
>> argument should represent the codepoint and the function should return 
>> the correspondingly encoded character. If so, possible the argument 
>> should be a bigint to accommodate the full range of possible code 
>> points. It is not clear what the argument should represent for other 
>> multi-byte encodings for any argument higher than 127. Similarly, it is 
>> not clear what ascii() should return in such cases. I would be inclined 
>> just to error out.
>>     
>
> In SQL_ASCII I'd argue for allowing 0..255.  In actual MB encodings,
> OK with throwing error.
>
>     
>   

I was planning on allowing up to 255 for all single byte encodings too.

cheers

andrew

pgsql-hackers by date:

From: Tom Lane
Date: 09 September 2007, 12:54:44
Subject: Re: [BUGS] 'SET LOCAL ROLE blah;' doesn't work?

From: Tom Lane
Date: 09 September 2007, 13:14:48
Subject: Re: invalidly encoded strings

Re: invalidly encoded strings - Mailing list pgsql-hackers

Previous

Next