Re: Dictionary chaining and stop words - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Dictionary chaining and stop words
Date
Msg-id 317.1188398575@sss.pgh.pa.us
Whole thread Raw
In response to Dictionary chaining and stop words  ("Heikki Linnakangas" <heikki@enterprisedb.com>)
List pgsql-hackers
"Heikki Linnakangas" <heikki@enterprisedb.com> writes:
> There's clearly need for transforming a word and passing on the
> transformed version to the next dictionary. dict_thesaurus does exactly
> that by supporting a subdictionary which is called before invoking the
> thesaurus, but it should be generic capability not specific to any
> dictionary. Let's modify the lexize API so that a dictionary can:
> - Accept the word (and possibly input with something else)
> - Reject the word
> - Transform word into another (or pass on as is)

This doesn't seem to be enough to solve thesaurus' problem though.
The difficulty there is that (1) it wants to look at several words
at once, (2) it wants to know which words were rejected as stopwords.
If filtering happens before it then how can it do that?
        regards, tom lane


pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: initdb failed on Windows 2000
Next
From: NikhilS
Date:
Subject: StringInfo misc. issues