Home > mailing lists

Re: english parser in text search: support for multiple words in the same position - Mailing list pgsql-hackers

From	Markus Wanner
Subject	Re: english parser in text search: support for multiple words in the same position
Date	August 2, 2010 10:27:40
Msg-id	4C56C78B.2060601@bluegap.ch Whole thread Raw
In response to	Re: english parser in text search: support for multiple words in the same position (Sushant Sinha <sushant354@gmail.com>)
List	pgsql-hackers

Tree view

Hi,

On 08/02/2010 03:12 PM, Sushant Sinha wrote:
> The current text parser already returns url and url_path. That already
> increases the number of unique tokens.

Well, I think I simply turned that off to be able to search for plain 
words. It still works for complete URLs, those are just treated like 
text, then.

> Earlier people have expressed the need to index urls/emails and
> currently the text parser already does so. Reverting that would be a
> regression of functionality. Further, a ranking function can take
> advantage of direct match of a token.

That's a point, yes. However, simply making the same string turn up 
twice in the tokenizer's output doesn't sound like the right solution to 
me. Especially considering that the query parser uses the very same 
tokenizer.

Regards

Markus Wanner

pgsql-hackers by date:

From: Etienne Dube
Date: 02 August 2010, 10:24:56
Subject: Re: Postgres as Historian

From: Robert Haas
Date: 02 August 2010, 10:33:08
Subject: Re: english parser in text search: support for multiple words in the same position

Re: english parser in text search: support for multiple words in the same position - Mailing list pgsql-hackers

Previous

Next