Thread: spelling errors in query terms

spelling errors in query terms

From
Joel Rodrigues
Date:
Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

Anyone know what I'm talking about ?

Cheers,
Joel




_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T�l�charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1�re messagerie instantan�e de France


Re: spelling errors in query terms

From
"Craig O'Shannessy"
Date:
Hi Joel,

If your using java, and full text, fuzzy searching is an important part of
your app, you might want to check out Lucene.  I evaluated it for a
project, and it was very good (the project was a fuzzy search of huge
pages of text).

http://jakarta.apache.org/lucene/docs/index.html


On Mon, 8 Dec 2003, Joel Rodrigues wrote:

> Hi,
>
> I seem to recall once coming across & using functionality in PostgreSQL
> that allowed for some flexibility in the spelling of a query term. For
> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
> was even a 'looseness' factor of sorts available. I've spent a lot of
> time trying to find it in the docs and various articles & tutorials on
> my hard drive, even an hour on Google in vain. It was not
> contrib/fuzzystrmatch.
>
> Anyone know what I'm talking about ?
>
> Cheers,
> Joel
>
>
>
>
> _____________________________________________________________________
> Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
> http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 3: if posting/reading through Usenet, please send an appropriate
>       subscribe-nomail command to majordomo@postgresql.org so that your
>       message can get through to the mailing list cleanly
>


Re: spelling errors in query terms

From
Oleg Bartunov
Date:
On Mon, 8 Dec 2003, Joel Rodrigues wrote:

> Hi,
>
> I seem to recall once coming across & using functionality in PostgreSQL
> that allowed for some flexibility in the spelling of a query term. For
> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
> was even a 'looseness' factor of sorts available. I've spent a lot of
> time trying to find it in the docs and various articles & tutorials on
> my hard drive, even an hour on Google in vain. It was not
> contrib/fuzzystrmatch.

You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.


>
> Anyone know what I'm talking about ?
>
> Cheers,
> Joel
>
>
>
>
> _____________________________________________________________________
> Envie de discuter en "live" avec vos amis ? TИlИcharger MSN Messenger
> http://www.ifrance.com/_reloc/m la 1Хre messagerie instantanИe de France
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 3: if posting/reading through Usenet, please send an appropriate
>       subscribe-nomail command to majordomo@postgresql.org so that your
>       message can get through to the mailing list cleanly
>

    Regards,
        Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

Re: spelling errors in query terms

From
Joel Rodrigues
Date:
Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.


Cheers ! - Joel


On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:

> On Mon, 8 Dec 2003, Joel Rodrigues wrote:
>
>> Hi,
>>
>> I seem to recall once coming across & using functionality in PostgreSQL
>> that allowed for some flexibility in the spelling of a query term. For
>> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
>> was even a 'looseness' factor of sorts available. I've spent a lot of
>> time trying to find it in the docs and various articles & tutorials on
>> my hard drive, even an hour on Google in vain. It was not
>> contrib/fuzzystrmatch.
>
> You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
> which uses trigram.

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T�l�charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1�re messagerie instantan�e de France


Re: spelling errors in query terms

From
Joel Rodrigues
Date:
Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.


Cheers ! - Joel


On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:

> On Mon, 8 Dec 2003, Joel Rodrigues wrote:
>
>> Hi,
>>
>> I seem to recall once coming across & using functionality in PostgreSQL
>> that allowed for some flexibility in the spelling of a query term. For
>> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
>> was even a 'looseness' factor of sorts available. I've spent a lot of
>> time trying to find it in the docs and various articles & tutorials on
>> my hard drive, even an hour on Google in vain. It was not
>> contrib/fuzzystrmatch.
>
> You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
> which uses trigram.

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T�l�charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1�re messagerie instantan�e de France


Re: spelling errors in query terms

From
Joel Rodrigues
Date:
Thanks for the tip. However I use Python with psycopg to interface with
PostgreSQL, so unfortunately it's not an option.

It's really strange, I could swear I used a function within PostgreSQL
that could handle small spelling errors in the query term.

Cheers,
Joel

On Monday, December 8, 2003, at 11:34 , Craig O'Shannessy wrote:

> Hi Joel,
>
> If your using java, and full text, fuzzy searching is an important part
> of
> your app, you might want to check out Lucene.  I evaluated it for a
> project, and it was very good (the project was a fuzzy search of huge
> pages of text).
>
> http://jakarta.apache.org/lucene/docs/index.html

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T�l�charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1�re messagerie instantan�e de France