Re: Concerning about Unicode-aware string handling - Mailing list pgsql-general

From Vincas Dargis
Subject Re: Concerning about Unicode-aware string handling
Date
Msg-id CAPNCXk0zzHBEgrMXfLCV6WpB3qVhun9xOTe4sRe4ogUaspYSBg@mail.gmail.com
Whole thread Raw
In response to Re: Concerning about Unicode-aware string handling  ("Albe Laurenz" <laurenz.albe@wien.gv.at>)
List pgsql-general
I've forgot to mention I'm working on Windows XP SP3

Yes, we are using UTF8 encoding and regexp works wrong. It looks like
you replicated that.

2012/5/21 Albe Laurenz <laurenz.albe@wien.gv.at>:
>
> I tried it with 9.1.3 on Linux:
>
> upper() and lower() works fine, no matter what the
> database encoding is:
>
> test=> SELECT upper('acząčž');
>  upper
> --------
>  ACZĄČŽ
> (1 row)
>
> And this seems OK with LATIN7:
>
> lt2=> SHOW server_encoding;
>  server_encoding
> -----------------
>  LATIN7
> (1 row)
>
> lt2=> SHOW lc_ctype;
>  lc_ctype
> ----------
>  lt_LT
> (1 row)
>
> lt2=> SHOW lc_collate;
>  lc_collate
> ------------
>  lt_LT
> (1 row)
>
> lt2=> SELECT 'ą' ~* '\w';
>  ?column?
> ----------
>  t
> (1 row)
>
> But it looks wrong with UTF8:
>
> lt=> SHOW server_encoding;
>  server_encoding
> -----------------
>  UTF8
> (1 row)
>
> lt=> SHOW lc_ctype;
>  lc_ctype
> ------------
>  lt_LT.utf8
> (1 row)
>
> lt=> SHOW lc_collate;
>  lc_collate
> ------------
>  lt_LT.utf8
> (1 row)
>
> lt=> SELECT 'ą' ~* '\w';
>  ?column?
> ----------
>  f
> (1 row)
>
>
> Is that what you are complaining about?
>
> Yours,
> Laurenz Albe

pgsql-general by date:

Previous
From: Samba
Date:
Subject: Re: Global Named Prepared Statements
Next
From: Tom Lane
Date:
Subject: Re: Concerning about Unicode-aware string handling