Re: converting databases form SQL_ASCII to UTF8 - Mailing list pgsql-general

From Eric McKeeth
Subject Re: converting databases form SQL_ASCII to UTF8
Date
Msg-id BANLkTinYg586oQ2Hd8d3bpJ_rGimqugqtw@mail.gmail.com
Whole thread Raw
In response to Re: converting databases form SQL_ASCII to UTF8  (Geoffrey Myers <geof@serioustechnology.com>)
List pgsql-general
On Fri, Apr 22, 2011 at 9:16 AM, Geoffrey Myers <geof@serioustechnology.com> wrote:
Vick Khera wrote:

The database's enforcement of the encoding should be the last layer that does so.  Your applications should be enforcing strict utf-8 encoding from start to finish.  Once this is done, and the old data already in the DB is properly encoded as utf-8, then there should be no problems switching on the utf-8 encoding in postgres to get that final layer of verification.

Totally agree.  Still, the question remains, why not leave it as SQL_ASCII?


Well, if your data is supposed to be UTF-8 encoded, then any of those characters with invalid encoding in UTF-8 could reasonably be viewed as data errors. Leaving the database in SQL-ASCII allows those errors to continue accumulating, which will make a switch in the future even harder. If the lack of being able to check encoding errors at the database level doesn't bother you, and you're fine with risking bigger pain later in order to avoid pain now, then I see no compelling reason to move away from SQL_ASCII.

-Eric

pgsql-general by date:

Previous
From: Rob Sargent
Date:
Subject: Re: converting databases form SQL_ASCII to UTF8
Next
From: SUBHAM ROY
Date:
Subject: Re: Needs Suggestion