Home > mailing lists

Re: gsoc, text search selectivity and dllist enhancments - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: gsoc, text search selectivity and dllist enhancments
Date	July 4, 2008 15:54:03
Msg-id	23365.1215186836@sss.pgh.pa.us Whole thread Raw
In response to	Re: gsoc, text search selectivity and dllist enhancments ("Heikki Linnakangas" <heikki@enterprisedb.com>)
Responses	Re: gsoc, text search selectivity and dllist enhancments ("Heikki Linnakangas" <heikki@enterprisedb.com>)
List	pgsql-hackers

Tree view

"Heikki Linnakangas" <heikki@enterprisedb.com> writes:
> Tom Lane wrote:
>> The data structure I'd suggest is a simple array of pointers
>> to the underlying hash table entries.  Since you have a predetermined
>> maximum number of lexemes to track, you can just palloc the array once
>> --- you don't need the expansibility properties of a list. 

> The number of lexemes isn't predetermined. It's 2 * (longest tsvector 
> seen so far), and we don't know beforehand how long the longest tsvector is.

Hmm, I had just assumed without looking too closely that it was stats
target times a fudge factor.  What is the rationale for doing it as
above?  I don't think I like the idea of the limit varying over the
course of the scan --- that means that lexemes in different places
in the input will have significantly different probabilities of
surviving to the final result.
        regards, tom lane

pgsql-hackers by date:

From: Alvaro Herrera
Date: 04 July 2008, 15:06:46
Subject: Re: Review: DTrace probes (merged version)

From: Tom Lane
Date: 04 July 2008, 16:01:36
Subject: Re: [PATCHES] Explain XML patch v2

Re: gsoc, text search selectivity and dllist enhancments - Mailing list pgsql-hackers

Previous

Next