Re: Fuzzy string matching of product names - Mailing list pgsql-general

From Dimitri Fontaine
Subject Re: Fuzzy string matching of product names
Date
Msg-id m2d3y8kaag.fsf@hi-media.com
Whole thread Raw
In response to Re: Fuzzy string matching of product names  (Leif Biberg Kristensen <leif@solumslekt.org>)
List pgsql-general
Leif Biberg Kristensen <leif@solumslekt.org> writes:
> On Monday 5. April 2010 22.00.41 Peter Geoghegan wrote:
>> similar they sound. How can that actually be applied to get the
>> functionality that I've described?
>
> I've got a similar problem in my 18th century research, when clerks usually
> took pride in being able to spell a name in any number of ways. I've landed on
> a solution where I'm sending search strings to SIMILAR TO. I usually get far
> too many hits, but it's much easier to browse through 100 hits than the entire
> dataset which is approaching 60,000 records.

In both your cases I'd play with trigram search. The idea is dead simple
and it performs really well. It's the poor man's Full Text Search, but
for catalog look ups it's exactly what you want I suppose:

  http://www.postgresql.org/docs/8.3/static/pgtrgm.html

Regards,
--
dim

pgsql-general by date:

Previous
From: Alan Millington
Date:
Subject: When is an explicit cast necessary?
Next
From: Corin
Date:
Subject: [Fwd: Re: fulltext search stemming/ spelling problems]