Re: once again, sorting with Unicode - Mailing list pgsql-sql

From Antti Haapala
Subject Re: once again, sorting with Unicode
Date
Msg-id Pine.GSO.4.44.0302191413590.21258-100000@paju.oulu.fi
Whole thread Raw
In response to Re: once again, sorting with Unicode  ("Troy" <tjk@tksoft.com>)
Responses Re: once again, sorting with Unicode  ("Troy" <tjk@tksoft.com>)
List pgsql-sql
On Wed, 19 Feb 2003, Troy wrote:

> > I have a multi-lingual database (currently 11 languages) which sorts
> > fine in MySQL (8859-1 character set) I have now converted the data to
> > Unicode and compiled Postgre with unicode support.
> >
> > I can select and insert unicode and so was rather pleased about that.
> > Until I saw that it wasn't working properly when ordering!
>
> The cause for the different values is the fact that unicode characters
> have different numeric values from ISO8859-1 and other encodings. Only
> ascii values are in sync with unicode numeric values. This I am sure you
> knew.

No, ISO8859-1 maps directly to unicode up to U+00FF. So the actual
_numeric_ values are the same. But actual byte patterns are encoding
dependent.

Have you set database encoding to UTF-8? Are you using proper UTF-8
locales? POSIX compiled locales are often charset dependent.

-- 
Antti Haapala





pgsql-sql by date:

Previous
From: "Troy"
Date:
Subject: Re: once again, sorting with Unicode
Next
From: Richard Huxton
Date:
Subject: Re: select from update from select?