Re: something better than pgtrgm? - Mailing list pgsql-general

From Andrew Sullivan
Subject Re: something better than pgtrgm?
Date
Msg-id 20121009151601.GH594@crankycanuck.ca
Whole thread Raw
In response to Re: something better than pgtrgm?  (Willy-Bas Loos <willybas@gmail.com>)
Responses Re: something better than pgtrgm?  (Willy-Bas Loos <willybas@gmail.com>)
List pgsql-general
On Tue, Oct 09, 2012 at 03:54:35PM +0200, Willy-Bas Loos wrote:
>
> >  If so, I
> > can almost imagine a way this could work
> >
>
> Great! How?

Well, it involves very large tables.  But basically, you work out a
"variant" table for any language you like, and then query across it
with subsets of the trigrams you were just working with.  It probably
sucks in performance, but at least you're likely to get valid
sequences this way.

For inspiration on this (and why I have so much depressing news on the
subject of internationalization in a multi-script and multi-lingual
environment), see RFC 3743 and RFC 4290.  These are related (among
other things) to how to make "variants" of different DNS labels
somehow hang together.  The problem is not directly related to what
you're working on, but it's a similar sort of problem: people have
rough ideas of what they're entering, and they need an exact match.
You have the good fortune of being able to provide them with a hint!
I wish I were in your shoes.

A

--
Andrew Sullivan
ajs@crankycanuck.ca


pgsql-general by date:

Previous
From: Tom Lane
Date:
Subject: Re: Dump/restore indexes and functions in public schema
Next
From: José Pedro Santos
Date:
Subject: PostgreSQL and WMS/WFS Service