Re: UTF8 or Unicode - Mailing list pgsql-hackers

From Bruce Momjian
Subject Re: UTF8 or Unicode
Date
Msg-id 200502150305.j1F358K20470@candle.pha.pa.us
Whole thread Raw
In response to Re: UTF8 or Unicode  (Abhijit Menon-Sen <ams@oryx.com>)
Responses Re: UTF8 or Unicode  (Karel Zak <zakkr@zf.jcu.cz>)
List pgsql-hackers
Abhijit Menon-Sen wrote:
> At 2005-02-14 21:14:54 -0500, pgman@candle.pha.pa.us wrote:
> >
> > Should our multi-byte encoding be referred to as UTF8 or Unicode?
> 
> The *encoding* should certainly be referred to as UTF-8. Unicode is a
> character set, not an encoding; Unicode characters may be encoded with
> UTF-8, among other things.
> 
> (One might think of a charset as being a set of integers representing
> characters, and an encoding as specifying how those integers may be
> converted to bytes.)
> 
> > I know UTF8 is a type of unicode but do we need to rename anything
> > from Unicode to UTF8?
> 
> I don't know. I'll go through the documentation to see if I can find
> anything that needs changing.

I looked at encoding.sgml and that mentions Unicode, and then UTF8 as an
acronym. I am wondering if we need to make UTF8 first and Unicode
second.  Does initdb accept UTF8 as an encoding?

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073
 


pgsql-hackers by date:

Previous
From: "Joshua D. Drake"
Date:
Subject: Re: 8.0.X and the ARC patent
Next
From: Bruce Momjian
Date:
Subject: Re: 8.0.X and the ARC patent