Re: pg_trgm - Mailing list pgsql-hackers

From Tom Lane
Subject Re: pg_trgm
Date
Msg-id 14655.1274969745@sss.pgh.pa.us
Whole thread Raw
In response to Re: pg_trgm  (Tatsuo Ishii <ishii@postgresql.org>)
Responses Re: pg_trgm
List pgsql-hackers
Tatsuo Ishii <ishii@postgresql.org> writes:
>> It's not a problem, it's just pilot error, or possibly inadequate
>> documentation.  pg_trgm uses the locale's definition of "alpha",
>> "digit", etc.  In C locale only basic ASCII letters and digits will be
>> recognized as word constituents.

> That means there is no chance to make pg_trgm work with multibyte + C
> locale?  If so, I will leave pg_trgm as it is and provide private
> patches for those who need the functionality.

Exactly what do you consider to be the missing functionality?
You need a notion of word vs non-word character from somewhere,
and the locale setting is the standard place to get that.  The
core text search functionality behaves the same way.
        regards, tom lane


pgsql-hackers by date:

Previous
From: Robert Haas
Date:
Subject: Re: Streaming Replication: Checkpoint_segment and wal_keep_segments on standby
Next
From: Tatsuo Ishii
Date:
Subject: Re: pg_trgm