Re: How to find double entries - Mailing list pgsql-sql

From Craig Ringer
Subject Re: How to find double entries
Date
Msg-id 48057F21.30607@postnewspapers.com.au
Whole thread Raw
In response to How to find double entries  (Andreas <maps.on@gmx.net>)
List pgsql-sql
Andreas wrote:
> Hi,
> 
> how can I find double entries in varchar columns where the content is
> not 100% identical because of a spelling error or the person considered
> it "looked nicer" that way?

When doing some near-duplicate elimination as part of converting a
legacy data set to PostgreSQL I found the `fuzzystrmatch' contrib module
immensely helpful.

http://www.postgresql.org/docs/current/static/fuzzystrmatch.html

--
Craig Ringer


pgsql-sql by date:

Previous
From: Tom Lane
Date:
Subject: Re: How to find double entries
Next
From: "Tena Sakai"
Date:
Subject: Re: How to find double entries