Thread: BUG #2400: 'Ã' considered invalid UTF-8 character

BUG #2400: 'Ã' considered invalid UTF-8 character

From
"Yusuf Siddiqui"
Date:
The following bug has been logged online:

Bug reference:      2400
Logged by:          Yusuf Siddiqui
Email address:      ysiddiqui@i3.com
PostgreSQL version: 8.1
Operating system:   Linux
Description:        'Æ' considered invalid UTF-8 character
Details:

The character 'Æ' is rejected as an invalid UTF-8 character.

Here are the steps used to recreate it:

create table test (text_field text);
insert into test (text_field) values ('Æ');

Returned error:
invalid UTF-8 byte sequence detected near byte 0x92

This statement also does not work:
insert into test (text_field) values ('\Æ');

Re: BUG #2400: 'Ã

From
tomas@tuxteam.de (Tomas Zerolo)
Date:
On Tue, Apr 18, 2006 at 11:34:53PM +0000, Yusuf Siddiqui wrote:
>
> The following bug has been logged online:
>
> Bug reference:      2400
> Logged by:          Yusuf Siddiqui
> Email address:      ysiddiqui@i3.com
> PostgreSQL version: 8.1
> Operating system:   Linux
> Description:        'Æ' considered invalid UTF-8 character
> Details:
>
> The character 'Æ' is rejected as an invalid UTF-8 character.

Well, maybe it is :-)

> Here are the steps used to recreate it:
>
> create table test (text_field text);
> insert into test (text_field) values ('Æ');
>
> Returned error:
> invalid UTF-8 byte sequence detected near byte 0x92

[...]

I'd need to know more. I gather from your mail that you are entering the
character into psql from a console. Several factors are relevant here:

  - which character encoding does your console have?
    (if it is, e.g. iso-8859-x then this will be probably the culprit)
  - which client encoding is set? (in psql type SHOW CLIENT_ENCODING;)
  - which encoding is the server using (I'd guess utf-8; it doesn't need
    to be the same as the client's, since it will try to convert).

HTH
-- tomás

Re: BUG #2400: 'Ã' considered invalid UTF-8character

From
Peter Eisentraut
Date:
Am Mittwoch, 19. April 2006 01:34 schrieb Yusuf Siddiqui:
> The character 'Æ' is rejected as an invalid UTF-8 character.

Please show the output of

$ psql -c 'show client_encoding'
$ locale

--
Peter Eisentraut
http://developer.postgresql.org/~petere/


Re: BUG #2400: 'Ã' considered invalid UTF-8 character

From
"SunWuKung"
Date:
Aren't you using EMS Postgresql Manager?
I kept getting this message for any non-standard UTF8 character that I
wanted to insert or import.
I wrote to them and they said EMS PgManager does not yet support
unicode.

Balázs