Home > mailing lists

Re: Concerning about Unicode-aware string handling - Mailing list pgsql-general

From	Vincas Dargis
Subject	Re: Concerning about Unicode-aware string handling
Date	May 21, 2012 11:03:09
Msg-id	CAPNCXk0zzHBEgrMXfLCV6WpB3qVhun9xOTe4sRe4ogUaspYSBg@mail.gmail.com Whole thread Raw
In response to	Re: Concerning about Unicode-aware string handling ("Albe Laurenz" <laurenz.albe@wien.gv.at>)
List	pgsql-general

Tree view

I've forgot to mention I'm working on Windows XP SP3

Yes, we are using UTF8 encoding and regexp works wrong. It looks like
you replicated that.

2012/5/21 Albe Laurenz <laurenz.albe@wien.gv.at>:
>
> I tried it with 9.1.3 on Linux:
>
> upper() and lower() works fine, no matter what the
> database encoding is:
>
> test=> SELECT upper('acząčž');
>  upper
> --------
>  ACZĄČŽ
> (1 row)
>
> And this seems OK with LATIN7:
>
> lt2=> SHOW server_encoding;
>  server_encoding
> -----------------
>  LATIN7
> (1 row)
>
> lt2=> SHOW lc_ctype;
>  lc_ctype
> ----------
>  lt_LT
> (1 row)
>
> lt2=> SHOW lc_collate;
>  lc_collate
> ------------
>  lt_LT
> (1 row)
>
> lt2=> SELECT 'ą' ~* '\w';
>  ?column?
> ----------
>  t
> (1 row)
>
> But it looks wrong with UTF8:
>
> lt=> SHOW server_encoding;
>  server_encoding
> -----------------
>  UTF8
> (1 row)
>
> lt=> SHOW lc_ctype;
>  lc_ctype
> ------------
>  lt_LT.utf8
> (1 row)
>
> lt=> SHOW lc_collate;
>  lc_collate
> ------------
>  lt_LT.utf8
> (1 row)
>
> lt=> SELECT 'ą' ~* '\w';
>  ?column?
> ----------
>  f
> (1 row)
>
>
> Is that what you are complaining about?
>
> Yours,
> Laurenz Albe

pgsql-general by date:

From: Samba
Date: 21 May 2012, 10:56:28
Subject: Re: Global Named Prepared Statements

From: Tom Lane
Date: 21 May 2012, 11:05:36
Subject: Re: Concerning about Unicode-aware string handling

Re: Concerning about Unicode-aware string handling - Mailing list pgsql-general

Previous

Next