Re: full text indexing - Mailing list pgsql-general

From Gilles DAROLD
Subject Re: full text indexing
Date
Msg-id 39D3007D.8B383C81@darold.net
Whole thread Raw
In response to full text indexing  ("Poul L. Christiansen" <poulc@cs.auc.dk>)
Responses Re: full text indexing
List pgsql-general
Hi,

After taking a look to many way from fast search into full text (flat
field with regrep, UDMSearch,
and other stuff like contrib  FTI) i have developped my own indexer and
fast search.
It's still specific to my database but can be easily ported to any other
database structure.
It also use stop word

The first load is very slow for big database but I have had incremental
indexing and then
following load are speeder.

Here is the usage of the indexer :

Usage: ./lincat-indexer.pl [-c -i file -h host -p port] -t table -f field
-l language -u user -s file -d dbname
        -c        => drop search tables and indexes, reset all indexing
        -i file   => do incremental index with file
        -v        => print a dot for each word processed
        -h host   => database hostname [default: localhost]
        -p port   => database service port [default: 5432]
        -t table  => table to index
        -f field  => field to index
        -l fr     => language to use (can be fr, en, ...)
        -u user   => postgres user who need read grant on search tables
        -s file   => file with a list of all words not to be indexed
        -d dbname => name of the database
        -q        => quiet mode, do not print anything else than error
message

I also have a search script that can be use to test the index and from you
can cut and past
some code to add to your programme.

usage is ./lincat-search.pl "essai test".

I don't remember exactly what it does with search (AND/OR) but this depend
on what you want
as result, so hack it.

Of course it's Perl... Search is very fast !
And in any way you can easily hack it to match your needs.

Let me know if your are interested I will post it to the list.

Regards,

Gilles DAROLD

"Poul L. Christiansen" wrote:

> Hi
>
> In my PostgreSQL database I have a lot of newspaper articles (size:
> 100mb now, growing beyond 1gb within few months).
> I wan't to use full text indexing so that users can search the articles
> with a keyword and have the results in less than one second.
>
> How do I accomplish that?
>
> Does PostgreSQL have this feature?
>
> Which 3rd party indexing tools are available that easily interoperate
> with PostgreSQL?
>
> Thanks,
> Poul L. Christiansen
> Dynamic Paper


pgsql-general by date:

Previous
From: Steve Heaven
Date:
Subject: Re: warning - virus on the loose.
Next
From: Stefan Huber
Date:
Subject: Re: Postgres ODBC woes