Re: 7.3.2 incorrectly counts characters for unicode varchar field - Mailing list pgsql-bugs

From Matthew Cooper
Subject Re: 7.3.2 incorrectly counts characters for unicode varchar field
Date
Msg-id 000701c37b98$c0a0b970$6600030a@gateway01
Whole thread Raw
In response to 7.3.2 incorrectly counts characters for unicode varchar field  ("Matthew Cooper" <matty@cloverworxs.com>)
List pgsql-bugs
Attached is the UTF-8 encoded sql file in case it got messed up in the mail
transfer.

And here it is pasted in directly from the window that was displaying
chinese characters.

insert into mgc values ('分钟练习分钟练习练习');


Looking at the UTF-8 documentation, 10 chinese characters could be any
number of bytes, each character being say 2 or 3 characters.

Matty.
----- Original Message -----
From: "Tom Lane" <tgl@sss.pgh.pa.us>
To: "Matthew Cooper" <matty@cloverworxs.com>
Cc: <pgsql-bugs@postgresql.org>
Sent: Saturday, September 13, 2003 5:51 PM
Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar
field


> > insert into mgc values ('Ã¥Ë?â? éâ?TŸç»Æ'ä¹ Ã¥Ë?â?
éâ?TŸç»Æ'ä¹ ç»Æ'ä¹ ');
>
> I don't think this string is correctly unicode-encoded.  Anyway "length"
> claims it is 30 characters.
>
> regards, tom lane
>

pgsql-bugs by date:

Previous
From: Theodore Petrosky
Date:
Subject: beta 3 and OS X
Next
From: "Matthew Cooper"
Date:
Subject: Re: 7.3.2 incorrectly counts characters for unicode varchar field