Thread: Tsearch2 slovak UTF-8

Tsearch2 slovak UTF-8

From
"Jan Sunavec"
Date:
Hi All

I have PostgreSQL 8.2.6 running on Windows. I tryed install slovak
dictionary for tsearch2.

INSERT INTO pg_ts_dict
VALUES('ispell_slovak','spell_init(internal)','DictFile="C:/slovak_utf8.dict",
AffFile="C:/slovak_utf8.aff", StopFile="C:/slovak_utf8.stop"',
'spell_lexize(internal,internal,integer)','Slovak ISpell. UTF8 Encoding');

INSERT INTO pg_ts_cfg VALUES('utf8_slovak', 'default', 'Slovak_Slovakia');


INSERT INTO pg_ts_cfgmap
   SELECT 'utf8_slovak',tok_alias,dict_name
     FROM pg_ts_cfgmap WHERE ts_name='utf8_russian';

UPDATE pg_ts_cfgmap SET dict_name='{ispell_slovak,simple}'
   WHERE ('ru_stem'=ANY(dict_name) OR 'ru_stem_utf8'=ANY(dict_name) OR
'en_stem' = ANY(dict_name))
     AND ts_name='utf8_slovak';

There are few problems.
1. It's not possible use path DictFile="C:/Program
Files/PostgreSQL/8.2/slovak_utf8.dict"
2. I always got "ERROR:  Affix parse error at 63 line

In statement:
SELECT to_tsvector('utf8_slovak', 'žltá vody');"

Second problem is most important. On that line is first UTF8 character. I
read that it's necessary install patch tsearch_snowball_82.gz. But I am
running under windows and I am not so skill for windows compilation. I
thought that lastest PostgreSQL version has this patch included.

Thanks for help.

   Jan Sunavec