Re: Postgres Encoding conversion problem - Mailing list pgsql-general

From Clemens Schwaighofer
Subject Re: Postgres Encoding conversion problem
Date
Msg-id 480DA52B.20702@tequila.co.jp
Whole thread Raw
In response to Re: Postgres Encoding conversion problem  ("Albe Laurenz" <laurenz.albe@wien.gv.at>)
Responses Re: Postgres Encoding conversion problem  ("Albe Laurenz" <laurenz.albe@wien.gv.at>)
List pgsql-general
On 04/22/2008 05:37 PM, Albe Laurenz wrote:
> Clemens Schwaighofer wrote:
>> I sometimes have a problem with conversion of encodings eg from UTF-8
>> tio ShiftJIS:
>>
>> ERROR:  character 0xf0a0aeb7 of encoding "UTF8" has no
>> equivalent in "SJIS"
>>
>> I have no idea what character this is, I cannot view it in my
>> browser, etc.
>
> It translates to Unicode 10BB7, which is not defined.
> I guess that is not intended; can you guess what the character(s) should be?

to be honest no idea. its some chinese character, I have no idea how the
user input this, because this is a japanese page.

I actually found the carachter, but only my Mac OS X can show it. It
looks similar to a japanese character used for a name, but how the
chinese one got selected is a mystery to me ...

>> If I run the conversion through PHP with mb_convert_encoding it works,
>> perhaps he is ignoring the character.
>>
>> Is there a way to do a similar thing, like ignoring this character in
>> postgres too?
>
> As far as I know, no.
> You'll have to fix the data before you import them.

well, the web page & data is in utf8 so I never see this issue, except I
would write a method that detects illegal shift_jis characters, and
thats difficult.

The reporting is only done in CSV ... so I am not sure if it is worth to
waste too much time here.

thanks for the tip.

--
[ Clemens Schwaighofer                      -----=====:::::~ ]
[ IT Engineer/Manager, TEQUILA\ Japan IT Group               ]
[                6-17-2 Ginza Chuo-ku, Tokyo 104-8167, JAPAN ]
[ Tel: +81-(0)3-3545-7703            Fax: +81-(0)3-3545-7343 ]
[ http://www.tequila.co.jp                                   ]

pgsql-general by date:

Previous
From: "Albe Laurenz"
Date:
Subject: Re: Postgres Encoding conversion problem
Next
From: "D. Dante Lorenso"
Date:
Subject: How to modify ENUM datatypes?