Re: Regexp match with accented character problem - Mailing list pgsql-novice

From Tom Lane
Subject Re: Regexp match with accented character problem
Date
Msg-id 3963.1276005182@sss.pgh.pa.us
Whole thread Raw
In response to Regexp match with accented character problem  (Laslo Forro <getforum@gmail.com>)
Responses Re: Regexp match with accented character problem  (Laslo Forro <getforum@gmail.com>)
List pgsql-novice
Laslo Forro <getforum@gmail.com> writes:
> It seems that accented characters are not recognized as \w.

Just FYI, that's a known problem with the regex operators if you're
using UTF8 database encoding (or more generally, any multibyte encoding,
but UTF8 is usually the one people complain about).  I don't believe
updating to 8.4 would have fixed it for you --- maybe the reason the
problem went away is you switched to a different encoding, such as one
of the LATINn family?

There is a tentative fix in 9.0, FWIW.

            regards, tom lane

pgsql-novice by date:

Previous
From: Thom Brown
Date:
Subject: Re: Regexp match with accented character problem
Next
From: Jon Jensen
Date:
Subject: Re: The Two Towers