Home > mailing lists

Re: snowball ASCII stemmer configuration - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: snowball ASCII stemmer configuration
Date	June 19, 2020 13:44:20
Msg-id	1646699.1592574260@sss.pgh.pa.us Whole thread Raw
In response to	Re: snowball ASCII stemmer configuration (Peter Eisentraut <peter.eisentraut@2ndquadrant.com>)
List	pgsql-hackers

Tree view

Peter Eisentraut <peter.eisentraut@2ndquadrant.com> writes:
> Do we *have* to have an ASCII stemmer that corresponds to an actual 
> language?  Couldn't we use the simple stemmer or no stemmer at all?
> In my experience, ASCII text in, say, Russian or Greek will typically be 
> acronyms or brand names or the like, and there doesn't seem to be a 
> great need to stem that kind of thing.  Just doing nothing seems at 
> least as good.

Well, I have no horse in this race.  But the reason it's like this for
Russian is that Oleg, Teodor, and crew set it up that way ages ago.
I'd tend to defer to their opinion about what's the most usable
configuration for Russian.  You could certainly argue that the situation
is different for $other-language ... but without some hard evidence for
that position, making these cases all behave similarly seems like a
reasonable approach.

            regards, tom lane

pgsql-hackers by date:

From: Bruce Momjian
Date: 19 June 2020, 13:02:57
Subject: Re: Global snapshots

From: Tom Lane
Date: 19 June 2020, 14:00:51
Subject: Re: doing something about the broken dynloader.h symlink

Re: snowball ASCII stemmer configuration - Mailing list pgsql-hackers

Previous

Next