I've forgot to mention I'm working on Windows XP SP3
Yes, we are using UTF8 encoding and regexp works wrong. It looks like
you replicated that.
2012/5/21 Albe Laurenz <laurenz.albe@wien.gv.at>:
>
> I tried it with 9.1.3 on Linux:
>
> upper() and lower() works fine, no matter what the
> database encoding is:
>
> test=> SELECT upper('acząčž');
> upper
> --------
> ACZĄČŽ
> (1 row)
>
> And this seems OK with LATIN7:
>
> lt2=> SHOW server_encoding;
> server_encoding
> -----------------
> LATIN7
> (1 row)
>
> lt2=> SHOW lc_ctype;
> lc_ctype
> ----------
> lt_LT
> (1 row)
>
> lt2=> SHOW lc_collate;
> lc_collate
> ------------
> lt_LT
> (1 row)
>
> lt2=> SELECT 'ą' ~* '\w';
> ?column?
> ----------
> t
> (1 row)
>
> But it looks wrong with UTF8:
>
> lt=> SHOW server_encoding;
> server_encoding
> -----------------
> UTF8
> (1 row)
>
> lt=> SHOW lc_ctype;
> lc_ctype
> ------------
> lt_LT.utf8
> (1 row)
>
> lt=> SHOW lc_collate;
> lc_collate
> ------------
> lt_LT.utf8
> (1 row)
>
> lt=> SELECT 'ą' ~* '\w';
> ?column?
> ----------
> f
> (1 row)
>
>
> Is that what you are complaining about?
>
> Yours,
> Laurenz Albe