Re: gsoc, text search selectivity and dllist enhancments - Mailing list pgsql-hackers

From Jan Urbański
Subject Re: gsoc, text search selectivity and dllist enhancments
Date
Msg-id 487671DA.8070905@students.mimuw.edu.pl
Whole thread Raw
In response to Re: gsoc, text search selectivity and dllist enhancments  (Alvaro Herrera <alvherre@commandprompt.com>)
Responses Re: gsoc, text search selectivity and dllist enhancments  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
Alvaro Herrera wrote:
> Jan Urbański wrote:
>
>> Oh, one important thing. You need to choose a bucket width for the LC
>> algorithm, that is decide after how many elements will you prune your
>> data structure. I chose to prune after every twenty tsvectors.
>
> Do you prune after X tsvectors regardless of the numbers of lexemes in
> them?  I don't think that preserves the algorithm properties; if there's
> a bunch of very short tsvectors and then long tsvectors, the pruning
> would take place too early for the initial lexemes.  I think you should
> count lexemes, not tsvectors.

Yes, that's what I was afraid of. I'm not sure why I was reluctant to
prune in the middle of a tsvector, maybe it's just in my head.
Still, there's a decision to be made: after how many lexemes should the
pruning occur?

--
Jan Urbanski
GPG key ID: E583D7D2

ouden estin



pgsql-hackers by date:

Previous
From: Alvaro Herrera
Date:
Subject: Re: gsoc, text search selectivity and dllist enhancments
Next
From: Tom Lane
Date:
Subject: Re: gsoc, text search selectivity and dllist enhancments