Home > mailing lists

Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8) - Mailing list pgsql-bugs

From	Duncan Rance
Subject	Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)
Date	February 15, 2012 08:21:39
Msg-id	35CBD9EE-B188-4FD2-B1D6-2576B06D3BC4@dunquino.com Whole thread Raw
In response to	Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8) (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-bugs

Tree view

On 14 Feb 2012, at 18:28, Tom Lane wrote:
>
> Oh, I see the reason for this: the code in cclass() in regc_locale.c
> doesn't go further up than U+00FF, so no codes above that will be
> thought to be letters (or members of any other character class).
> Clearly we need to go further when we are dealing with UTF8.
> I'm not sure what a sane limit would be though.

The Basic Multilingual Plane goes up to FFFF:

https://en.wikipedia.org/wiki/Mapping_of_Unicode_characters#Planes

pgsql-bugs by date:

From: Andy Grimm
Date: 15 February 2012, 04:09:33
Subject: Re: BUG #6412: psql & fe-connect truncate passwords

From: Duncan Rance
Date: 15 February 2012, 14:35:23
Subject: Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)

Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8) - Mailing list pgsql-bugs

Previous

Next