Re: Getting weird results with unicode table... - Mailing list pgsql-bugs

From Tom Lane
Subject Re: Getting weird results with unicode table...
Date
Msg-id 26642.1101222881@sss.pgh.pa.us
Whole thread Raw
In response to Getting weird results with unicode table...  ("David B" <postgresql@thegatelys.com>)
List pgsql-bugs
"David B" <postgresql@thegatelys.com> writes:
> Two possible bugs.

> #1 Loading unicode data.

I'm thinking that the dump file probably doesn't set client_encoding,
or if it does, sets it to the wrong thing.  Is the data in the dump file
actually UTF8, or is it in some other encoding?

> select substring ( lower(title), 1, 1), count(*)
> from album
> where lower(title) > 'a%' and lower(title) <= 'e%'
> group by substring ( lower(title), 1, 1) ;

> Should get counts for 'a', 'b', 'c', 'd', 'e'.

... plus any other characters that the current locale chooses to sort
between 'a' and 'e'.  In particular I'd expect to see accented 'a' and/or
accented 'e' sort that way.

            regards, tom lane

pgsql-bugs by date:

Previous
From: Devrim GUNDUZ
Date:
Subject: Re: BUG #1320: 7.3.8 server RPM has file error
Next
From: David Schweikert
Date:
Subject: Re: select to_number('1,000', '999,999');