Re: [ADMIN] what's the efficient/safest way to convert database character set ? - Mailing list pgsql-general

From Steve Atkins
Subject Re: [ADMIN] what's the efficient/safest way to convert database character set ?
Date
Msg-id 6E5BC504-F62C-4988-938E-9F86CBCA626D@blighty.com
Whole thread Raw
In response to [ADMIN] what's the efficient/safest way to convert database character set ?  ("Huang, Suya" <Suya.Huang@au.experian.com>)
Responses Re: [ADMIN] what's the efficient/safest way to convert database character set ?
List pgsql-general
On Oct 17, 2013, at 3:13 PM, "Huang, Suya" <Suya.Huang@au.experian.com> wrote:

> Hi,
>
> I’ve got a question of converting database from ascii to UTF-8, what’s the best approach to do so if the database
sizeis very large? Detailed procedure or experience sharing are much appreciated! 
>

The answer to that depends on what you mean by "ascii".

If your current database uses SQL_ASCII encoding - that's not ascii. It could have anything in there, including any mix
ofencodings and there's been no enforcement of any encoding, so there's no way of knowing what they are. If you've had,
forexample, webapps that let people paste word documents into them, you potentially have different encodings used in
differentrows of the same table. 

If your current data is like that then you're probably looking at doing some (manual) data cleanup to work out what
encodingyour data is really in, and converting it to something consistent rather than a simple migration from ascii to
utf8.

Cheers,
  Steve



pgsql-general by date:

Previous
From: Ian Lawrence Barwick
Date:
Subject: Re: Index creation fails with automatic names
Next
From: John R Pierce
Date:
Subject: Re: [ADMIN] what's the efficient/safest way to convert database character set ?