Home > mailing lists

Re: LIKE search and performance - Mailing list pgsql-performance

From	PFC
Subject	Re: LIKE search and performance
Date	May 24, 2007 19:07:15
Msg-id	op.tsuqhpzycigqcu@apollo13 Whole thread Raw
In response to	Re: LIKE search and performance (Mark Lewis <mark.lewis@mir3.com>)
List	pgsql-performance

Tree view

> PG could scan the index looking for matches first and only load the
> actual rows if it found a match, but that could only be a possible win
> if there were very few matches, because the difference in cost between a
> full index scan and a sequential scan would need to be greater than the
> cost of randomly fetching all of the matching data rows from the table
> to look up the visibility information.

    If you need to do that kind of thing, ie. seq scanning a table checking
only one column among a large table of many columns, then don't use an
index. An index, being a btree, needs to be traversed in order (or else, a
lot of locking problems come up) which means some random accesses.

    So, you could make a table, with 2 columns, updated via triggers : your
text field, and the primary key of your main table. Scanning that would be
faster.

    Still, a better solution for searching in text is :

    - tsearch2 if you need whole words
    - trigrams for any substring match
    - xapian for full text search with wildcards (ie. John* = Johnny)

    Speed-wise those three will beat any seq scan on a large table by a huge
margin.

pgsql-performance by date:

From: Craig James
Date: 24 May 2007, 19:07:11
Subject: Re: LIKE search and performance

From: Richard Huxton
Date: 25 May 2007, 05:18:50
Subject: Re: LIKE search and performance

Re: LIKE search and performance - Mailing list pgsql-performance

Previous

Next