Re: error while trying to change the database encoding on a database - Mailing list pgsql-general

From Martijn van Oosterhout
Subject Re: error while trying to change the database encoding on a database
Date
Msg-id 20110124175353.GA1909@svana.org
Whole thread Raw
In response to Re: error while trying to change the database encoding on a database  (Geoffrey Myers <lists@serioustechnology.com>)
List pgsql-general
On Mon, Jan 24, 2011 at 12:16:46PM -0500, Geoffrey Myers wrote:
> We hope to identify the characters and fix them in the existing
> database, then convert.  It appears to be very limited, but it would
> help if there was some way to identify these characters outside of
> simply doing the reload of the data and finding the errors.
>
> Hence the reason I asked about a resource that might identify the
> characters.

Short answer, any byte with the high bit set.

You're going to need to assign them a meaning. Additionally you're
going to have to fix your code to only output correct encoded data.

The suggestion to simply reload the database as if all the current data
was WIN1251 or Latin-9 is a fairly easy way to getting the database
into a reasonable format. The data would have to be checked though.

Have a nice day,
--
Martijn van Oosterhout   <kleptog@svana.org>   http://svana.org/kleptog/
> Patriotism is when love of your own people comes first; nationalism,
> when hate for people other than your own comes first.
>                                       - Charles de Gaulle

Attachment

pgsql-general by date:

Previous
From: Geoffrey Myers
Date:
Subject: Re: error while trying to change the database encoding on a database
Next
From: Adrian Klaver
Date:
Subject: Re: error while trying to change the database encoding on a database