Re: Tsearch2 Dutch snowball stemmer in PG8.1 - Mailing list pgsql-general

From Oleg Bartunov
Subject Re: Tsearch2 Dutch snowball stemmer in PG8.1
Date
Msg-id Pine.LNX.4.64.0710031911200.3304@sn.sai.msu.ru
Whole thread Raw
In response to Re: Tsearch2 Dutch snowball stemmer in PG8.1  (Alban Hertroys <a.hertroys@magproductions.nl>)
List pgsql-general
On Wed, 3 Oct 2007, Alban Hertroys wrote:

> Alban Hertroys wrote:
>> The only odd thing is that to_tsvector('dutch', 'some dutch text') now
>> returns '|' for stop words...
>>
>> For example:
>>  select to_tsvector('nederlands', 'De beste stuurlui staan aan wal');
>>                   to_tsvector
>> ------------------------------------------------
>>  '|':1,5 'bes':2 'wal':6 'staan':4 'stuurlui':3
>
> I found the cause. The stop words list I found contained comments
> prefixed by '|' signs. Removing the contents and recreating the database
> solved the problem. Just updating the reference didn't seem to help...

you need to recreate tsvector field and index, after changing any dicts.

>
> There's undoubtedly some cleaner way to replace the stop words list, but
> at the current stage of our project this was the simplest to achieve.
>
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

pgsql-general by date:

Previous
From: Oleg Bartunov
Date:
Subject: Re: Tsearch2 Dutch snowball stemmer in PG8.1
Next
From: Erik Jones
Date:
Subject: Re: pg_cancel_backend() does not work with buzz queries