Re: new full text search configurations - Mailing list pgsql-hackers

From Emre Hasegeli
Subject Re: new full text search configurations
Date
Msg-id CAE2gYzzFT-jxYv2gRaQC-MVNvwpmYLr8NCFxYwjKOojPLP1vLg@mail.gmail.com
Whole thread Raw
In response to new full text search configurations  (Oleg Bartunov <obartunov@gmail.com>)
List pgsql-hackers
> I checked new snowball site http://snowballstem.org/ and found several new
> stemmers appeared (as external contributions):
>
> Irish and Czech
> Object Pascal codegenerator for Snowball
> Two stemmers for Romanian
> Hungarian
> Turkish
> Armenian
> Basque (Euskera)
> Catalan
>
> Some of them we don't have in our list of default configurations. Since
> these are external, not official stemmers, it'd be nice if  people  look and
> test them. If they are fine, we can prepare new configurations for 9.6.

We have configurations for the ones included to the Snowball, namely
Romanian, Hungarian, and Turkish.  I don't know why the others are not
included but listed on the page as external contributions.  It might
be a good idea to wait for someone to commit them to the upstream.

I have checked the changes on the algorithms [1].  They don't seemed
to be updated much after 2007, but recently a new one for Tamil
language is added.  It might be a good candidate for a new
configuration.

[1] https://github.com/snowballstem/snowball/commits/master/algorithms



pgsql-hackers by date:

Previous
From: Pavel Stehule
Date:
Subject: custom function for converting human readable sizes to bytes
Next
From: Alvaro Herrera
Date:
Subject: Re: CreateFunction Statement