Home > mailing lists

Re: text column indexing in UTF-8 database - Mailing list pgsql-general

From	Steve Atkins
Subject	Re: text column indexing in UTF-8 database
Date	March 12, 2009 21:33:28
Msg-id	4875B492-F3CF-425A-AE4A-6AE9CDB86125@blighty.com Whole thread Raw
In response to	text column indexing in UTF-8 database (Reece Hart <reece@harts.net>)
Responses	Re: text column indexing in UTF-8 database
List	pgsql-general

Tree view

On Mar 12, 2009, at 5:15 PM, Reece Hart wrote:

> Do I really need 4 indexes per column to handle the 4 combinations of
> {equality,like/regexp} x {unfolded,casefolded} in a UTF-8 encoded
> database?
>
> I have a column that I'd like to be able to search with equality and
> regexp (or like), optionally casefolded. The database is UTF-8
> encoded.
> The table and index defs are below.
>
> Jeff Davis gave me a tip to use text_pattern_ops on indexes to speed
> up
> regexp and like; that worked beautiful. But I discovered a caveat that
> t_p_o apparently doesn't handle equality. Thus, I think I need
> distinct
> indexes for the 4 cases above. Right?

If A=B then lower(A) = lower(B), and if A like B then lower(A) like
lower(B).

So, if nothing else, you could rewrite "where alias = 'Foo'" as
"where lower(alias) = lower('Foo') and alias='Foo'" and take advantage
of the lower() functional index.

I've read that 8.4 will be able to use a text_pattern_ops index for
equality.

Cheers,
   Steve

pgsql-general by date:

From: Adrian Klaver
Date: 12 March 2009, 21:23:34
Subject: Re: Fwd: Question about Privileges

From: Jeff Davis
Date: 12 March 2009, 22:04:28
Subject: Re: text column indexing in UTF-8 database

Re: text column indexing in UTF-8 database - Mailing list pgsql-general

Previous

Next