Re: Writting a "search engine" for a pgsql DB - Mailing list pgsql-performance

From Dave Page
Subject Re: Writting a "search engine" for a pgsql DB
Date
Msg-id 45E53E5A.5060600@postgresql.org
Whole thread Raw
In response to Re: Writting a "search engine" for a pgsql DB  (Charles Sprickman <spork@bway.net>)
List pgsql-performance
Charles Sprickman wrote:
> On Tue, 27 Feb 2007, Dave Page wrote:
>
>> Magnus Hagander wrote:
>>>
>>> Just as a datapoint, we did try to use mnogosearch for the
>>> postgresql.org website+archives search, and it fell over completely.
>>> Indexing took way too long, and we had search times several thousand
>>> times longer than with tsearch2.
>>>
>>> That said, I'm sure there are cases when it works fine :-)
>>
>> There are - in fact before your time the site did use Mnogosearch. We
>> moved to our own port of ASPSeek when we outgrew Mnogo's capabilities,
>> and then to your TSearch code when we outgrew ASPSeek.
>
> At risk of pulling this way too far off topic, may I ask how many
> documents (mail messages) you were dealing with when things started to
> fall apart with mnogo?

I honestly don't remember now, but it would have been in the tens or
maybe low hundreds of thousands. Don't get me wrong, I've built sites
where Mnogo is still running fine and does a great job - it just doesn't
scale well.

> We're looking at it for a new project that will
> hopefully get bigger and bigger.  We will be throwing groups of mailing
> lists into their own mnogo config/tables...  If we should save ourselves
> the pain and look at something more homebrew, then we'll start
> investigating "Tsearch".

Well put it this way, the PostgreSQL mailing list archives outgrew Mnogo
years ago and even ASPSeek was beginning to struggle when it got removed
a few months back.

Regards, Dave

pgsql-performance by date:

Previous
From: Magnus Hagander
Date:
Subject: Re: Writting a "search engine" for a pgsql DB
Next
From: "Steinar H. Gunderson"
Date:
Subject: Re: Writting a "search engine" for a pgsql DB