Re: TSearch queries with multiple languages - Mailing list pgsql-general

From Tom Lane
Subject Re: TSearch queries with multiple languages
Date
Msg-id 19463.1234483622@sss.pgh.pa.us
Whole thread Raw
In response to TSearch queries with multiple languages  (Gordon Callan <gordon_callan@hotmail.com>)
Responses Re: TSearch queries with multiple languages
List pgsql-general
Gordon Callan <gordon_callan@hotmail.com> writes:
> Next we create an index on the ts_vector column:
>  CREATE INDEX node_ts_body on node USING gin(ts_body);

> From the documentation, it seems this index will know what config each row has.

No, actually the index doesn't know and doesn't care.  The tsvector
representation is language-independent --- it contains "just strings".
All the language-dependent processing happens during reduction of the
document text to tsvector (or reduction of a search string to tsquery).
So if words from different languages happen to reduce to the same
string, searches in both languages will find that entry.

Usually this works the way people want; but if not, you could add an
additional WHERE condition to your queries to match only documents in
the desired language.

            regards, tom lane

pgsql-general by date:

Previous
From: Gordon Callan
Date:
Subject: TSearch queries with multiple languages
Next
From: Craig Ringer
Date:
Subject: Re: audit table