Home > mailing lists

Re: two questions about fulltext searchign / tsvector indexes - Mailing list pgsql-general

From	Jonathan Vanasco
Subject	Re: two questions about fulltext searchign / tsvector indexes
Date	June 10, 2014 17:53:05
Msg-id	B52906B4-04E7-44B7-AEEF-265C9F02521F@2xlp.com Whole thread Raw
In response to	Re: two questions about fulltext searchign / tsvector indexes (Vick Khera <vivek@khera.org>)
List	pgsql-general

Tree view

On Jun 10, 2014, at 8:26 AM, Vick Khera wrote:

Thanks so much for this.

We do a lot of searching on this column, so pre-computing seems to be the way.

I'm not worried about disk space for now, and can revisit that later if there is a problem

Just for clarification on this:

Option A (less fast):

create gin index on tsvector(searchable_column)

Option B (faster):

create tsvector column for `searchable_column`

create gin index on searchable_column

On Mon, Jun 9, 2014 at 8:55 PM, Jonathan Vanasco <postgres@2xlp.com> wrote:
I can't figure out which one to use. This is on a steadily growing table of around 20MM rows that gets 20-80k new records a day, but existing records are rarely updated.

The question as always is a time-space trade-off. How frequently do
you make the full text search? If you do it frequently, then with a
pre-computed tsv column you save all that time per row of computing
the tsvector on every search. If you do it infrequently, the space
savings (and not needing to maintain that column) may benefit you.

Personally in these days of cheap disks I'd go with the dedicated
column. Given that, you want to just have a GIN index on that one
column, and the query you want, given some plain text string like
"fluffy dog" is this:

select plainto_tsquery('fluffy dog') @@ my_tsv_column;

I always use a trigger on insert and update to maintain the ts_vector
column, so there is no doubt of how it was computed by various
programs.

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

// Jonathan Vanasco

c. 646.729.6436 | 415.501.9815

e. jonathan@2xlp.com

w. http://findmeon.com/user/jvanasco

linkedin. http://linkedin.com/in/jonathanvanasco

blog. http://destructuring.net

pgsql-general by date:

From: Keith Fiske
Date: 10 June 2014, 17:40:03
Subject: Re: How can I tell if pg_restore is running?

From: Martin French
Date: 10 June 2014, 17:54:15
Subject: Re: How can I tell if pg_restore is running?

Re: two questions about fulltext searchign / tsvector indexes - Mailing list pgsql-general

Previous

Next