Re: contrib/levenshtein() has a bug? - Mailing list pgsql-general

From Ben
Subject Re: contrib/levenshtein() has a bug?
Date
Msg-id Pine.LNX.4.64.0609281421510.21293@GRD.cube42.tai.silentmedia.com
Whole thread Raw
In response to Re: contrib/levenshtein() has a bug?  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-general
Ah, yes, you are correct.

Hm, it's too bad levenshtein() is ascii-only.

On Thu, 28 Sep 2006, Tom Lane wrote:

> Ben <bench@silentmedia.com> writes:
>> The levenshtein function from contrib/fuzzystrmatch.sql has a max arg
>> length of 255. OK, that's cool. But check this out:
>
>> mbrainz_db=> select max(length(name)) from public.track;
>>   max
>> -----
>>   255
>> (1 row)
>
>> mbrainz_db=> select levenshtein(name,'foo') from public.track;
>> ERROR:  argument exceeds max length: 255
>
>> That seems odd.
>
> length() measures in characters whereas the limit in question is being
> enforced in bytes.  You got any multibyte characters in there?
>
> (It looks to me like levenshtein() is utterly non-multibyte-aware,
> which is probably a bug in itself.)
>
>             regards, tom lane
>

pgsql-general by date:

Previous
From: Geoffrey
Date:
Subject: Re: continued segmentation fault
Next
From: "Jack Orenstein"
Date:
Subject: Row versions and indexes