Re: Use of ISpell dictionaries with tsearch2 - what is the point? - Mailing list pgsql-general

From Don Walker
Subject Re: Use of ISpell dictionaries with tsearch2 - what is the point?
Date
Msg-id 001901c66b00$90c35410$dbd849c6@donxp
Whole thread Raw
In response to Use of ISpell dictionaries with tsearch2 - what is the point?  ("Don Walker" <don.walker@versaterm.com>)
List pgsql-general
I realized shortly after I sent this email that I could use a synonym
dictionary to solve problem #2. To construct it myself I'd have to determine
the common misspellings and create synonyms for them. So I have two more
questions:

2.1 Are there any canned synonym dictionaries available the deal with
misspellings in English and/or French?

2.2 Are there any clever linguistic algorithms that can partly solve the
same problem?

-----Original Message-----
From: pgsql-general-owner@postgresql.org
[mailto:pgsql-general-owner@postgresql.org] On Behalf Of Don Walker
Sent: April 28, 2006 15:11
To: pgsql-general@postgresql.org
Subject: [GENERAL] Use of ISpell dictionaries with tsearch2 - what is the
point?


I'm new to using tsearch2 and am trying to understand why I would want to
use an ISpell dictionary before the stemming dictionary. I'd originally
hoped that ISpell would suggest corrections for misspelled words as the
documents that I will be indexing will contain a lot of spelling mistakes.
From what I now understand ISpell dictionaries only recognize properly
spelled words. This means that any misspelled word will be handled by the
stemming dictionary and usually just passed as is.

1. If I am correct about this then what is the point of using the ISpell
dictionary in the first place?

2. Is there a solution for correcting spelling mistakes in the documents you
index? I have seen the readme files for pg_trgm,
http://www.sai.msu.su/~megera/postgres/gist/, which would allow me to
suggest other terms for a query if the misspellings were common enough. I'd
rather fix the problem at index time so that querying with the proper term
would find any misspelled terms (within reason).


---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend


pgsql-general by date:

Previous
From: Bruce Momjian
Date:
Subject: Re: Autovacuum Logging
Next
From: Geoffrey
Date:
Subject: Re: How to define + operator for strings