Home > mailing lists

Large Text Search Help - Mailing list pgsql-performance

From	psql-mail@freeuk.com
Subject	Large Text Search Help
Date	October 14, 2003 16:44:58
Msg-id	E1A7GXx-0007J5-PM@buckaroo.freeuk.net Whole thread Raw
Responses	Re: Large Text Search Help
List	pgsql-performance

Tree view

Hi,
I am trying to design a large text search database.

It will have upwards of 6 million documents, along with meta data on
each.

I am currently looking at tsearch2 to provide fast text searching and
also playing around with different hardware configurations.

1. With tsearch2 I get very good query times up until I insert more
records. For example with 100,000 records tsearch2 returns in around 6
seconds, with 200,000 records tsearch2 returns in just under a minute.
Is this due to the indices fitting entirely in memory with 100,000
records?

2. As well as whole word matching i also need to be able to do
substring matching. Is the FTI module the way to approach this?

3. I have just begun to look into distibuted queries. Is there an
existing solution for distibuting a postgresql database amongst
multiple servers, so each has the same schema but only a subset of the
total data?

Any other helpful comments or sugestions on how to improve query times
using different hardware or software techniques would be appreciated.

Thanks,

Mat

pgsql-performance by date:

From: Tom Lane
Date: 14 October 2003, 16:14:56
Subject: Re: [SQL] sql performance and cache

From: Peter Eisentraut
Date: 14 October 2003, 16:45:07
Subject: Re: [HACKERS] Sun performance - Major discovery!

Large Text Search Help - Mailing list pgsql-performance

Previous

Next