Re: Mailing list search engine: surprising missing results? - Mailing list pgsql-www

From Oleg Bartunov
Subject Re: Mailing list search engine: surprising missing results?
Date
Msg-id CAF4Au4yttKJ1KAP-cO+HMLQ2_66vmx0dLTBUbE4W8Aa64foafg@mail.gmail.com
Whole thread Raw
In response to Re: Mailing list search engine: surprising missing results?  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Mailing list search engine: surprising missing results?  (Laurenz Albe <laurenz.albe@cybertec.at>)
List pgsql-www


On Mon, Jan 24, 2022 at 11:47 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Bruce Momjian <bruce@momjian.us> writes:
> On Mon, Jan 24, 2022 at 08:27:41AM +0100, Laurenz Albe wrote:
>> The reason is that the 'moore' in 'boyer-moore' is stemmed, since it
>> is at the end of the word, while the 'moore' in 'Boyer-Moore-Horspool'
>> isn't:

> Wow, he showed me this problem earlier but I never suspected it was
> stemming issue because I never considered proper nowns could be
> stem-adjusted, but it is obvious they can.

I wonder if we should change that so that components of a compound
word are consistently stemmed the same way.


Something like this

SELECT to_tsvector('english', 'Boyer-Moore-Horspool');
                       to_tsvector
----------------------------------------------------------
 'boyer':2 'boyer-moore-horspool':1 'boyer-moore':1  'moore-horspool':1  'horspool':4 'moor':3
(1 row)




 

                        regards, tom lane




--
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

pgsql-www by date:

Previous
From: Célestin Matte
Date:
Subject: Re: [PATCHES] pglister: make organization name generic
Next
From: Laurenz Albe
Date:
Subject: Re: Mailing list search engine: surprising missing results?