Full text search advice requested - Mailing list pgsql-general

From Johann Spies
Subject Full text search advice requested
Date
Msg-id 20120712104200.GB18111@sun.ac.za
Whole thread Raw
List pgsql-general
I have a table with bibliometric information on published articles.
Fields of interest for full text searches are the 'title' and 'abstract'
fields.

Those fields can contain several languages but most of the entries use
English. A grouped query on the 'language' field reveals that the
following languages are involved:

Afrikaans
Chinese
Dutch
English
French
Gaelic (?)
German
Hungarian
Italian
Japanese
Korean
Polish
Portuguese
Rumanian
Russian
Slovene
Sotho
Spanish
Turkish
Xhosa
Zulu

Now my questions:

1. Is it possible at all to use full text search in such a setup?
2. If so, how would I approach the different languages in indexing and
   querying.
3. How do I ask postgresql which dictionaries are already available in
   the installation for full text search?
4. If full text searches cannot be utilised in such a setup, can
   trgm-related indexing using 'similarity' be a replacement?  I think
   not.

Regards
Johann
--
Johann Spies                            Telefoon: 021-808 4699
Databestuurder /  Data manager

Sentrum vir Navorsing oor Evaluasie, Wetenskap en Tegnologie
Centre for Research on Evaluation, Science and Technology
Universiteit Stellenbosch.

     "Delight thyself also in the LORD: and he shall give
      thee the desires of thine heart."
                                  Psalms 37:4
E-pos vrywaringsklousule

Hierdie e-pos mag vertroulike inligting bevat en mag regtens geprivilegeerd wees en is slegs bedoel vir die persoon aan
wiedit geadresseer is. Indien u nie die bedoelde ontvanger is nie, word u hiermee in kennis gestel dat u hierdie
dokumentgeensins mag gebruik, versprei of kopieer nie. Stel ook asseblief die sender onmiddellik per telefoon in kennis
envee die e-pos uit. Die Universiteit aanvaar nie aanspreeklikheid vir enige skade, verlies of uitgawe wat voortspruit
uithierdie e-pos en/of die oopmaak van enige l��s aangeheg by hierdie e-pos nie. 

E-mail disclaimer

This e-mail may contain confidential information and may be legally privileged and is intended only for the person to
whomit is addressed. If you are not the intended recipient, you are notified that you may not use, distribute or copy
thisdocument in any manner whatsoever. Kindly also notify the sender immediately by telephone, and delete the e-mail.
TheUniversity does not accept liability for any damage, loss or expense arising from this e-mail and/or accessing any
filesattached to this e-mail. 

pgsql-general by date:

Previous
From: Craig Ringer
Date:
Subject: Re: PostgreSQL limitations question
Next
From: Sachin Srivastava
Date:
Subject: Re: question about installation