Re: request a new feature in fuzzystrmatch - Mailing list pgsql-hackers

From Atri Sharma
Subject Re: request a new feature in fuzzystrmatch
Date
Msg-id CAOeZVifPtHw2e79n9HHwx-B_X6UTpZTsoJb1W5mEtnc6K5fOsA@mail.gmail.com
Whole thread Raw
In response to Re: request a new feature in fuzzystrmatch  (Liming Hu <dawninghu@gmail.com>)
Responses Re: request a new feature in fuzzystrmatch
List pgsql-hackers
On Fri, May 17, 2013 at 8:51 PM, Liming Hu <dawninghu@gmail.com> wrote:
> Hi Atri,
>
> Thanks for the quick response.
>
> levenshtein edit distance defines operations of: insertion, deletion,
> modification.
> Levenshtein-Damerau edit distance defines operations of:  insertion,
> deletion, modification
> and transposition (
> it will be two operations in levenshtein edit distance:
> one deletion, and one insertion,
> ).
>
> In spelling checker/corrector, i.e.
>  levenshtein('cta', 'cat') will return 2.
> but Levenshtein-Damerau('cta', 'cat') will return 1.
> if the maximum error is 1, we can not get 'cat'.
>
> In practice, Levenshtein-Damerau is more widely used than Levenshtein.
> I believe you notice "Google automated search suggestions", they use
> Levenshtein-Damerau.
>

Sounds interesting. How can we build this over our current
implementation, or do we need to build it from scratch?

Regards,

Atri



pgsql-hackers by date:

Previous
From: Liming Hu
Date:
Subject: Re: request a new feature in fuzzystrmatch
Next
From: Liming Hu
Date:
Subject: Re: request a new feature in fuzzystrmatch