Tsearch2 slovak UTF-8 - Mailing list pgsql-general

From Jan Sunavec
Subject Tsearch2 slovak UTF-8
Date
Msg-id op.t49ybjyetuteoe@akela-pc
Whole thread Raw
List pgsql-general
Hi All

I have PostgreSQL 8.2.6 running on Windows. I tryed install slovak
dictionary for tsearch2.

INSERT INTO pg_ts_dict
VALUES('ispell_slovak','spell_init(internal)','DictFile="C:/slovak_utf8.dict",
AffFile="C:/slovak_utf8.aff", StopFile="C:/slovak_utf8.stop"',
'spell_lexize(internal,internal,integer)','Slovak ISpell. UTF8 Encoding');

INSERT INTO pg_ts_cfg VALUES('utf8_slovak', 'default', 'Slovak_Slovakia');


INSERT INTO pg_ts_cfgmap
   SELECT 'utf8_slovak',tok_alias,dict_name
     FROM pg_ts_cfgmap WHERE ts_name='utf8_russian';

UPDATE pg_ts_cfgmap SET dict_name='{ispell_slovak,simple}'
   WHERE ('ru_stem'=ANY(dict_name) OR 'ru_stem_utf8'=ANY(dict_name) OR
'en_stem' = ANY(dict_name))
     AND ts_name='utf8_slovak';

There are few problems.
1. It's not possible use path DictFile="C:/Program
Files/PostgreSQL/8.2/slovak_utf8.dict"
2. I always got "ERROR:  Affix parse error at 63 line

In statement:
SELECT to_tsvector('utf8_slovak', 'žltá vody');"

Second problem is most important. On that line is first UTF8 character. I
read that it's necessary install patch tsearch_snowball_82.gz. But I am
running under windows and I am not so skill for windows compilation. I
thought that lastest PostgreSQL version has this patch included.

Thanks for help.

   Jan Sunavec


pgsql-general by date:

Previous
From: Sam Mason
Date:
Subject: handling of COUNT(record) vs IS NULL
Next
From: Calum
Date:
Subject: Bug (?) in JDB