Re: MSSQL to PostgreSQL : Encoding problem - Mailing list pgsql-general

From Arnaud Lesauvage
Subject Re: MSSQL to PostgreSQL : Encoding problem
Date
Msg-id 45648B11.80207@freesurf.fr
Whole thread Raw
In response to Re: MSSQL to PostgreSQL : Encoding problem  (Alvaro Herrera <alvherre@alvh.no-ip.org>)
Responses Re: MSSQL to PostgreSQL : Encoding problem
Re: MSSQL to PostgreSQL : Encoding problem
List pgsql-general
Alvaro Herrera a écrit :
> Arnaud Lesauvage wrote:
>> Alvaro Herrera a écrit :
>> >Arnaud Lesauvage wrote:
>> >
>> >>mydb=# SET client_encoding TO LATIN9;
>> >>SET
>> >>mydb=# COPY statistiques.detailrecherche (log_gid,
>> >>champrecherche, valeurrecherche) FROM
>> >>'E:\\Production\\Temp\\detailrecherche_ansi.csv' CSV;
>> >>ERROR:  invalid byte sequence for encoding "LATIN9": 0x00
>> >>HINT:  This error can also happen if the byte sequence does
>> >>not match the encoding expected by the server, which is
>> >>controlled by "client_encoding".
>> >
>> >Huh, why do you have a "0x00" byte in there?  That's certainly not
>> >Latin9 (nor UTF8 as far as I know).
>> >
>> >Is the file actually Latin-something or did you convert it to something
>> >else at some point?
>>
>> This is the file generated by DTS with "ANSI" encoding. It
>> was not altered in any way after that !
>> The doc states that ANSI exports with the local codepage
>> (which is Win1252). That's all I know. :(
>
> I thought Win1252 was supposed to be almost the same as Latin1.  While
> I'd expect certain differences, I wouldn't expect it to use 0x00 as
> data!
>
> Maybe you could have DTS export Unicode, which would presumably be
> UTF-16, then recode that to something else (possibly UTF-8) with GNU
> iconv.

UTF-16 ! That's something I haven't tried !
I'll try an iconv conversion tomorrow from UTF16 to UTF8 !

--
Arnaud

pgsql-general by date:

Previous
From: Tony Caduto
Date:
Subject: Re: Data transfer between databases over the Internet
Next
From: Bruce Momjian
Date:
Subject: Re: MSSQL to PostgreSQL : Encoding problem