Home > mailing lists

Re: [GENERAL] psql weird behaviour with charset encodings - Mailing list pgsql-hackers

From	hgonzalez@gmail.com
Subject	Re: [GENERAL] psql weird behaviour with charset encodings
Date	May 7, 2010 22:49:04
Msg-id	0016362842accfef6204860b60f4@google.com Whole thread Raw
In response to	Re: [GENERAL] psql weird behaviour with charset encodings (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: [GENERAL] psql weird behaviour with charset encodings
List	pgsql-hackers

Tree view

> However, it appears that glibc's printf
code interprets the parameter as the number of *characters* to print,
and to determine what's a character it assumes the string is in the
environment LC_CTYPE's encoding.

Well, I myself have problems to believe that :-)
This would be nasty... Are you sure?

I couldn reproduce that.
I made a quick test, passing a utf-8 encoded string
(5 bytes correspoding to 4 unicode chars: "niño")
And my glib (same Fedora 12) seems to count bytes,
as it should.

#include<stdio.h>
main () {
char s[] = "ni\xc3\xb1o";
printf("|%.*s|\n",5,s);
}

This, compiled with gcc 4.4.3, run with my root locale (utf8)
did not padded a blank. i.e. it worked as expected.

Hernán

pgsql-hackers by date:

From: Bernd Helmle
Date: 07 May 2010, 21:16:33
Subject: Re: no universally correct setting for fsync

From: hernan gonzalez
Date: 07 May 2010, 23:31:25
Subject: Re: [GENERAL] psql weird behaviour with charset encodings

Re: [GENERAL] psql weird behaviour with charset encodings - Mailing list pgsql-hackers

Previous

Next