Home > mailing lists

Re: Patch for collation using ICU - Mailing list pgsql-hackers

From	Tatsuo Ishii
Subject	Re: Patch for collation using ICU
Date	May 8, 2005 10:09:07
Msg-id	20050508.220827.104049106.t-ishii@sra.co.jp Whole thread Raw
In response to	Re: Patch for collation using ICU ("John Hansen" <john@geeknet.com.au>)
List	pgsql-hackers

Tree view

> > I don't buy it. If current conversion tables does the right 
> > thing, why we need to replace. Or if conversion tables are 
> > not correct, why don't you fix it? I think the rule of 
> > character conversion will not change frequently, especially 
> > for LATIN languages. Thus maintaining cost is not too high.
> 
> I never said we need to, but if we're going to implement ICU,
> then we might as well go all the way.

So you admit there's no benefit using ICU for replacing existing
conversions?

Besides ICU does not support all existing conversions, I think ICU has
serious flaw for using conversion. If I understand correctly, ICU uses
UNICODE internally to do the conversion. For example, to implement
SJIS->EUC_JP conversion, ICU first converts SJIS to UNICODE then
converts UNICODE to EUC_JP. Problem is these conversion is not roud
trip(conversion between SJIS/EUC_JP and UNICODE will lose some
information). Thus SJIS->EUC_JP->SJIS conversion using ICU does not
preserve original text.
--
Tatsuo Ishii

pgsql-hackers by date:

From: Heikki Linnakangas
Date: 08 May 2005, 09:32:34
Subject: Re: [PATCHES] Cleaning up unreferenced table files

From: Tatsuo Ishii
Date: 08 May 2005, 10:20:05
Subject: Re: Patch for collation using ICU

Re: Patch for collation using ICU - Mailing list pgsql-hackers

Previous

Next