Re: Tsearch2 and Unicode? - Mailing list pgsql-general

From dom@happygiraffe.net (Dominic Mitchell)
Subject Re: Tsearch2 and Unicode?
Date
Msg-id 20041118133423.GA54846@ppe.happygiraffe.net
Whole thread Raw
In response to Re: Tsearch2 and Unicode?  (Dawid Kuroczko <qnex42@gmail.com>)
List pgsql-general
On Thu, Nov 18, 2004 at 12:17:01PM +0100, Dawid Kuroczko wrote:
> Correct me if I am wrong, but I think that UTF-8 is almost identical
> to ISO-8859-1 in binary form to ISO-8859-1.  I mean, UTF-8 is
> ISO-8859-1 plus multibyte characters from other charsets.

No, UTF-8 and ISO-8859-1 are different from characters 128 and up.  Even
though code point 160 represents "non breaking space" in both, they are
represented differently in terms of bytes.  One is 0xA0 and the other is
0xC2 0xA0.

-Dom

pgsql-general by date:

Previous
From: Dawid Kuroczko
Date:
Subject: Re: Tsearch2 and Unicode?
Next
From: Richard_D_Levine@raytheon.com
Date:
Subject: Re: How to know a record has been updated, then reset the flag?