BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char - Mailing list pgsql-bugs

From Grzegorz Daniluk
Subject BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char
Date
Msg-id 201011241643.oAOGhhB5083127@wwwmaster.postgresql.org
Whole thread Raw
Responses Re: BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-bugs
The following bug has been logged online:

Bug reference:      5766
Logged by:          Grzegorz Daniluk
Email address:      gdaniluk@gmail.com
PostgreSQL version: 9.0.1
Operating system:   Windows 7 64-bit
Description:        regexp \y doesn't work properly when a word starts on
ends with a UTF-8 char
Details:

select regexp_replace('Foo Pasaż Bar', E'\\yPasaż\\y', '');

Above query doesn't replace the word 'Pasaż'. It returns full 'Foo Pasaż
Bar' string, when the correct behavior is to return 'Foo  Bar'.

When the 'ż' is replaced with normal ASCII character like 'z',
regexp_replace works as expected.

My db details:
ENCODING = 'UTF8'
LC_COLLATE = 'Polish_Poland.1250'
LC_CTYPE = 'Polish_Poland.1250'

pgsql-bugs by date:

Previous
From: Pavel Golub
Date:
Subject: Re: BUG #5765: pg_dump fail to find upper case table name
Next
From: Tom Lane
Date:
Subject: Re: BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char