Re: UTF-8 and LIKE vs = - Mailing list pgsql-general

From Ian Barwick
Subject Re: UTF-8 and LIKE vs =
Date
Msg-id 1d581afe04082317071dfca8ab@mail.gmail.com
Whole thread Raw
In response to Re: UTF-8 and LIKE vs =  (David Wheeler <david@kineticode.com>)
Responses Re: UTF-8 and LIKE vs =  (David Wheeler <david@kineticode.com>)
List pgsql-general
On Mon, 23 Aug 2004 16:50:04 -0700, David Wheeler <david@kineticode.com> wrote:
> On Aug 23, 2004, at 4:34 PM, Ian Barwick wrote:
>
> > wild speculation in need of a Korean speaker, but:
> >
> > ian@linux:~/tmp> cat j.txt
> > テスト
> > 환경설
> > 전검색
> > 웹문서
> > 국방비
> > 북한의
> > てすと
> > ian@linux:~/tmp> uniq  j.txt
> > テスト
> > 환경설
> > てすと
> >
> > All but the first and last lines are random Korean (Hangul)
> > characters. Evidently our respective locales think all Hangul strings
> > of the same length are identical, which is very probably not the
> > case...
>
> Does this go away if you change your locale to C?

Yes.

Ian Barwick

pgsql-general by date:

Previous
From: David Wheeler
Date:
Subject: Re: UTF-8 and LIKE vs =
Next
From: "borajetta"
Date:
Subject: Re: ERROR: duplicate key violates unique constraint "pg_class_oid_index"