Thanks Steve,
Yes, we're using SQL_ASCII.
Would you please be more specific about manual data cleanup work here? I'm new to Postgres and don't have any
experiencein character set conversion before, so any specific experience shared would be very much appreciated.
Thanks,
Suya
-----Original Message-----
From: pgsql-general-owner@postgresql.org [mailto:pgsql-general-owner@postgresql.org] On Behalf Of Steve Atkins
Sent: Friday, October 18, 2013 11:08 AM
To: pgsql-general@postgresql.org General
Subject: Re: [GENERAL] [ADMIN] what's the efficient/safest way to convert database character set ?
On Oct 17, 2013, at 3:13 PM, "Huang, Suya" <Suya.Huang@au.experian.com> wrote:
> Hi,
>
> I've got a question of converting database from ascii to UTF-8, what's the best approach to do so if the database
sizeis very large? Detailed procedure or experience sharing are much appreciated!
>
The answer to that depends on what you mean by "ascii".
If your current database uses SQL_ASCII encoding - that's not ascii. It could have anything in there, including any mix
ofencodings and there's been no enforcement of any encoding, so there's no way of knowing what they are. If you've had,
forexample, webapps that let people paste word documents into them, you potentially have different encodings used in
differentrows of the same table.
If your current data is like that then you're probably looking at doing some (manual) data cleanup to work out what
encodingyour data is really in, and converting it to something consistent rather than a simple migration from ascii to
utf8.
Cheers,
Steve
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org) To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general