Re: BUG #18870: weird behavior with regexp_replace - Mailing list pgsql-bugs

From Tom Lane
Subject Re: BUG #18870: weird behavior with regexp_replace
Date
Msg-id 2153095.1743110638@sss.pgh.pa.us
Whole thread Raw
In response to BUG #18870: weird behavior with regexp_replace  (PG Bug reporting form <noreply@postgresql.org>)
List pgsql-bugs
PG Bug reporting form <noreply@postgresql.org> writes:
> select regexp_replace(LOWER('Örebro'), '\W', '_', 'g')  in postgres 15 the
> result is örebro which is correct since ö should fit in the \w for a
> regex.
> select regexp_replace(LOWER('Örebro'), '\W', '_', 'g') --> since postgres 17
> the result is _rebro which is incorrect since \w should also contain
> characters like ö, ä, ë.

This most likely indicates that you've got a different database
collation selected in the v17 installation.  Postgres defers to
the LC_CTYPE setting (or, in some configurations, the ICU collation)
to decide what is a letter.  See

https://www.postgresql.org/docs/current/charset.html

psql's "\l" command will give a quick overview of what collations
you have selected.

            regards, tom lane



pgsql-bugs by date:

Previous
From: Tom Lane
Date:
Subject: Re: BUG #18866: Running pg_freespace() on views triggers an Abort
Next
From: Daniel Gustafsson
Date:
Subject: Re: BUG #18845: DEREF_OF_NULL.RET guc_malloc possibly returns NULL