Home > mailing lists

Re: [HACKERS] UNICODE characters above 0x10000 - Mailing list pgsql-patches

From	Christopher Kings-Lynne
Subject	Re: [HACKERS] UNICODE characters above 0x10000
Date	August 7, 2004 07:48:04
Msg-id	4114B32B.8080509@familyhealth.com.au Whole thread
In response to	Re: [HACKERS] UNICODE characters above 0x10000 (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-patches

Tree view

> Now it's entirely possible that the underlying support is a few bricks
> shy of a load --- for instance I see that pg_utf_mblen thinks there are
> no UTF8 codes longer than 3 bytes whereas your code goes to 4.  I'm not
> an expert on this stuff, so I don't know what the UTF8 spec actually
> says.  But I do think you are fixing the code at the wrong level.

Surely there are UTF-8 codes that are at least 3 bytes.  I have a
_vague_ recollection that you have to keep escaping and escaping to get
up to like 4 bytes for some asian code points?

Chris

pgsql-patches by date:

From: Tatsuo Ishii
Date: 07 August 2004, 07:44:25
Subject: Re: [HACKERS] UNICODE characters above 0x10000

From: "John Hansen"
Date: 07 August 2004, 07:56:20
Subject: Re: [HACKERS] UNICODE characters above 0x10000

Re: [HACKERS] UNICODE characters above 0x10000 - Mailing list pgsql-patches

Previous

Next