Unicode encoding - Mailing list pgsql-general

From William Sweet
Subject Unicode encoding
Date
Msg-id BCAAA5D64C837641A9EBB93E2A5089480847E81C@ex2k01.corp.register.com
Whole thread Raw
Responses Re: Unicode encoding
List pgsql-general
Hi Guys,
 
I've been searching the web lately for discussions on PostgreSQL multibyte support and UTF-8 support. The result was similar to the situation of asking 10 doctors the same question and getting 10 different answers. So, I solicit your help on solving this great mystery.
 
As I understand it, multibyte support has been enabled since v7.3. I am running a RH9 installation that had v7.3 as an included pkg, so I installed it as part of the RH9 installation. So I assume multibyte support is enabled. Now, I'd like to only store Unicode chars in my PostgreSQL dbs. I hear there are 3 ways to accomplish this:
 
1) during PostgreSQL configure/build (installation level)
2) during initdb (cluster level)
3) CREATE DATABASE (db level)
 
...but there are some "not-so-happy" stories on the net. For instance, "it's not 'true' Unicode support when implemented at the db level", or "sorting and regex do not work properly with a cluster level implementation", etc. I've read the v7.3 Admin Guide section 7.2 Multibyte support... sounds reasonable. So my question is, what is the official way to enable "true" Unicode storage and retrieval, so that LIKE, sorting, and regex in perl::DBI work properly? I am a tad concerned also that I don't see PostgreSQL mentioned on the Unicode products page; http://www.unicode.org/onlinedat/products.html
 
Any advice would be greatly appreciated.
 
Thanks, Will
 

pgsql-general by date:

Previous
From: Robin Munn
Date:
Subject: Illegal characters in database names, table names, user names...
Next
From: Deepakkumar.Vasudevan@cybernetsoft.com
Date:
Subject: Downloading Windows Version of PGSQL