Re: Flexible configuration for full-text search - Mailing list pgsql-hackers

From Dmitry Dolgov
Subject Re: Flexible configuration for full-text search
Date
Msg-id CA+q6zcU1ruksx9c-QDvLJOnZe7sRq1Z7Bg5ukXhUJpqZJGMK1w@mail.gmail.com
Whole thread Raw
In response to Re: Flexible configuration for full-text search  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Flexible configuration for full-text search
List pgsql-hackers
> On Wed, Aug 29, 2018 at 10:38 AM Aleksandr Parfenov <a.parfenov@postgrespro.ru> wrote:
>
> On Tue, 28 Aug 2018 12:40:32 +0700
> Aleksandr Parfenov <a.parfenov@postgrespro.ru> wrote:
>
> >On Fri, 24 Aug 2018 18:50:38 +0300
> >Alexander Korotkov <a.korotkov@postgrespro.ru> wrote:
> >>Agreed, backward compatibility is important here.  Probably we should
> >>leave old dictionaries for that.  But I just meant that if we
> >>introduce new (better) way of stop words handling and encourage users
> >>to use it, then it would look strange if default configurations work
> >>the old way...
> >
> >I agree with Alexander. The only drawback I see is that after addition
> >of new dictionaries, there will be 3 dictionaries for each language:
> >old one, stop-word filter for the language, and stemmer dictionary.
>
> During work on the new version of the patch, I found an issue in
> proposed syntax. At the beginning of the conversation, there was a
> suggestion to split stop word filtering and words normalization. At this
> stage of development, we can use a different dictionary for stop word
> detection, but if we drop the word, the word counter wouldn't increase
> and the stop word will be processed as an unknown word.

Maybe it would be better if you or some of your colleagues (Alexander, Arthur?)
will post this new version, because the current one has some conflicts - so it
would be easier for a reviewers. For now I'll move it to the next CF.


pgsql-hackers by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: PostgreSQL Limits and lack of documentation about them.
Next
From: Stephen Frost
Date:
Subject: Re: pg_config wrongly marked as not parallel safe?