Re: Weird behaviour on a join with multiple keys - Mailing list pgsql-general

From Tom Lane
Subject Re: Weird behaviour on a join with multiple keys
Date
Msg-id 4269.1173481452@sss.pgh.pa.us
Whole thread Raw
In response to Re: Weird behaviour on a join with multiple keys  (Charlie Clark <charlie@begeistert.org>)
List pgsql-general
Charlie Clark <charlie@begeistert.org> writes:
> Am 09.03.2007 um 16:15 schrieb Tom Lane:
>> There's your problem right there.  The string comparison routines are
>> built on strcoll(), which is going to expect UTF8-encoded data because
>> of the LC_COLLATE setting.  If there are any high-bit-set LATIN1
>> characters in the database, they will most likely look like invalid
>> encoding to strcoll(), and on most platforms that causes it to behave
>> very oddly.  You need to keep lc_collate (and lc_ctype) in sync with
>> server_encoding.

> That does indeed seem to have been the problem even though the
> examples I was looking at were all using plain ASCII characters. Glad
> to know it wasn't a bug and to have learned something new.

Well, it *is* a bug: we really shouldn't let you select incompatible
locale and encoding settings.  This gotcha has been known for a long
time, but it's not clear that there's a bulletproof, portable way to
determine which encoding a particular locale setting implies ...

            regards, tom lane

pgsql-general by date:

Previous
From: Don Lavelle
Date:
Subject: Re: Beginner's Questions
Next
From: "Ted Byers"
Date:
Subject: Re: Setting week starting day