Tsearch2 custom dictionaries - Mailing list pgsql-general

From psql-mail@freeuk.com
Subject Tsearch2 custom dictionaries
Date
Msg-id E19klSc-000OPk-00@buckaroo.freeuk.net
Whole thread Raw
Responses Re: Tsearch2 custom dictionaries  (Oleg Bartunov <oleg@sai.msu.su>)
List pgsql-general
Part1.

I have created a dictionary called 'webwords' which checks all words
and curtails them to 300 chars (for now)

after running
make
make install

I then copied the lib_webwords.so into my $libdir

I have run

psql mybd < dict_webwords.sql

The tutorial shows how to install the intdict for integer types. How
should i install my custom dictionary?


Part2.

The dictionary I am trying to create is to be used for searching
multilingual text. My aim is to have fast search over all text, but
ignore binary encoded data which is also present. (i will probably move
to ignoring long words in the text eventually).
What is the best approach to tackle this problem?
As the text can be multilingual I don't think stemming is possible?
I also need to include many none-standard words in the index such as
urls and message ID's contained in the text.

I get the feeling that building these indexs will by no means be an
easy task so any suggestions will be gratefully recieved!

Thanks...

--

pgsql-general by date:

Previous
From: Hubert Fröhlich
Date:
Subject: Re: crashing Xeon?
Next
From: Jonathan Bartlett
Date:
Subject: Re: ext3 block size