Home > mailing lists

Re: case insensitive match in unicode - Mailing list pgsql-general

From	SunWuKung
Subject	Re: case insensitive match in unicode
Date	April 6, 2006 18:12:43
Msg-id	MPG.1e9fa2a061975939989698@news.postgresql.org Whole thread Raw
In response to	case insensitive match in unicode (SunWuKung <Balazs.Klein@axelero.hu>)
Responses	Re: case insensitive match in unicode Re: case insensitive match in unicode
List	pgsql-general

Tree view

In article <20060327114037.GD30791@svana.org>, kleptog@svana.org says...
> On Mon, Mar 27, 2006 at 12:45:05PM +0200, SunWuKung wrote:
> > This sounds like a very interesting concept.
> > It wouldn't be 'case insensitive' just insensitive.
> >
> > The way I imagine it now is a special case of the ~ function.
> > I create matchgroups in a table and check each character if it is in the
> > group. If it is I will replace the character with the group in [éÉE],
> > [oóOÓ??] and do a regexp with that.
>
> No need to reinvent the wheel. ICU provides a range of services to deal
> with this. For example the following filter in ICU:
>
>  NFD; [:Nonspacing Mark:] Remove; NFC.
>
> Will remove all accents from characters. And it works for all Unicode
> characters. With a bit more thinking you can work with case variations
> also.
>
> There is also a locale-independant case-mapping module there plus
> various locale specific ones also.
>
> http://icu.sourceforge.net/userguide/Transform.html
> http://icu.sourceforge.net/userguide/caseMappings.html
> http://icu.sourceforge.net/userguide/normalization.html
>
> Have a nice day,
>
Thanks, I looked at this and it looks like something that would indeed
solve the problem.
However I was so far unable to figure out how could I use this from
within Postgres. If you have experience with it could you give me an
example?

Thanks
Balázs

pgsql-general by date:

From: "samfisch@googlemail.com"
Date: 06 April 2006, 18:10:54
Subject: recovering from a --clean dump

From: Martijn van Oosterhout
Date: 06 April 2006, 18:20:02
Subject: Re: Debian package for freeradius_postgresql module

Re: case insensitive match in unicode - Mailing list pgsql-general

Previous

Next