Re: english parser in text search: support for multiple words in the same position - Mailing list pgsql-hackers

From Robert Haas
Subject Re: english parser in text search: support for multiple words in the same position
Date
Msg-id AANLkTimmykArc7DwhM6tyOGCQnwzYM8D2CR0tES7vtRY@mail.gmail.com
Whole thread Raw
In response to Re: english parser in text search: support for multiple words in the same position  (Sushant Sinha <sushant354@gmail.com>)
List pgsql-hackers
On Wed, Sep 29, 2010 at 1:29 AM, Sushant Sinha <sushant354@gmail.com> wrote:
> Any updates on this?
>
>
> On Tue, Sep 21, 2010 at 10:47 PM, Sushant Sinha <sushant354@gmail.com>
> wrote:
>>
>> > I looked at this patch a bit.  I'm fairly unhappy that it seems to be
>> > inventing a brand new mechanism to do something the ts parser can
>> > already do.  Why didn't you code the url-part mechanism using the
>> > existing support for compound words?
>>
>> I am not familiar with compound word implementation and so I am not sure
>> how to split a url with compound word support. I looked into the
>> documentation for compound words and that does not say much about how to
>> identify components of a token. Does a compound word split by matching
>> with a list of words? If yes, then we will not be able to use that as we
>> do not know all the words that can appear in a url/host/email/file.

It seems to me that you need to familiarize yourself with this stuff
and then post an analysis, or a new patch.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise Postgres Company


pgsql-hackers by date:

Previous
From: Gurjeet Singh
Date:
Subject: Patch to reindex primary keys
Next
From: Greg Stark
Date:
Subject: Re: Path question