Home > mailing lists

Re: Confused with db client encoding - Mailing list pgsql-general

From	Ian Barwick
Subject	Re: Confused with db client encoding
Date	September 6, 2004 16:51:54
Msg-id	1d581afe04090608511a3d9a2f@mail.gmail.com Whole thread Raw
In response to	Confused with db client encoding (Carlos Correia <carlos@m16e.com>)
List	pgsql-general

Tree view

On Mon, 06 Sep 2004 00:02:24 +0100, Carlos Correia <carlos@m16e.com> wrote:
> Hi,
>
> Here is the output a psql session. Please notice that the identation
> inconsistences in the records containg non ASCII chars is as outputed by
> psql.
>
> The db was created with LANIN9 and the console was ran (in the same
> machine) using UTF-8 (my system's default).
>
> I was surprised to notice that setting the client to unicode (which is
> what that console is using) messed the localized chars as I was
> expecting to see the opposite way.
>
> On the other way, when invoking from a Java app, running on the same
> machine, the accentuaded chars also appeared messed.

(...)
>            3 | Tx. Dinheiro         | TransacÃ§Ãµes a Dinheiro
>           11 | Nota de CrÃ©dito     | Notas de CrÃ©dito
>           12 | Nota de DÃ©bito      | Notas de DÃ©bito
>           21 | G. Remessa           | Guia de Remessa

It looks like this data was entered as UTF-8 but the client encoding
was LATIN9 (or whatever), meaning the two incoming bytes from each
accentuated character in UTF-8 was interpreted by the backend as two
individual bytes in LATINx.

Test case (session in a UTF-8 environment):

test=# CREATE DATABASE ctest encoding 'LATIN1';
CREATE DATABASE
test=# \c ctest;
You are now connected to database "ctest".
ctest=# CREATE TABLE coding (data TEXT);
CREATE TABLE
ctest=# SET client_encoding TO LATIN1;
SET
ctest=# INSERT INTO coding VALUES('müller');
INSERT 349960 1
ctest=# SELECT * FROM coding;
  data
---------
 müller
(1 row)

ctest=# SET client_encoding TO UNICODE;
SET
ctest=# SELECT * FROM coding;
  data
---------
 mÃ¼ller
(1 row)

Ian Barwick

pgsql-general by date:

From: Richard Huxton
Date: 06 September 2004, 15:17:24
Subject: Re: Overloading || ( text,text ) operator

From: Tom Lane
Date: 06 September 2004, 17:25:47
Subject: Re: tg_relation doesn't seem to have the attribute names!

Re: Confused with db client encoding - Mailing list pgsql-general

Previous

Next