encoding question - Mailing list pgsql-admin

From Ben K.
Subject encoding question
Date
Msg-id Pine.GSO.4.64.0603201643250.21796@coe.tamu.edu
Whole thread Raw
Responses Re: encoding question  (Ivo Rossacher <rossacher@bluewin.ch>)
Re: encoding question  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-admin
Hi,

My main server is on Solaris (LANG=C), postgresql 8.1.0, with encoding
SQL_ASCII.

I'm trying to set up a backup server on linux (LANG=en_US.UTF-8), with
encoding UTF8.

I thought UTF8 was safe if all I have is ascii, since there was a relevant
discussion in this list a short while ago, but I'm getting this error, for
a few records (out of tens of thousands in that database).

===============
ERROR:  invalid UTF-8 byte sequence detected near byte 0x85
CONTEXT:  COPY tblcoursesinfo, line 30, column title: "Seminar in<85>
..."
===============

The dump was created by 8.1.0's pg_dump with no options.

That line in the dump has the following contents.

Text:

n<85>       Knowledg

Binary:

02e2100: 6e85 2020 2020 2020 2020 2020 2020 2020  n.
02e2110: 2020 2020 2020 2020 2020 2020 2020 2020
02e2120: 2020 2020 2020 2020 2020 2020 2020 2020
02e2130: 2020 2020 2020 2020 2020 2020 2020 2020
02e2140: 2020 2020 2020 2020 2020 2020 2020 2020
02e2150: 2020 2020 2020 2020 2020 2020 2020 2020
02e2160: 2020 2020 2020 2020 2020 2020 2020 2020
02e2170: 2020 2020 2020 2009 4b6e 6f77 6c65 6467         .Knowledg

Where could this be coming from? Would appreciate help.


Regards,

Ben K.
Developer
http://benix.tamu.edu

pgsql-admin by date:

Previous
From: Ivo Rossacher
Date:
Subject: Re: Character encoding problems and dump import
Next
From: Ivo Rossacher
Date:
Subject: Re: encoding question