Re: Postgresql.org search engine. - Mailing list pgsql-www

From Oleg Bartunov
Subject Re: Postgresql.org search engine.
Date
Msg-id Pine.GSO.4.58.0401302007180.19778@ra.sai.msu.su
Whole thread Raw
In response to Re: Postgresql.org search engine.  ("Dave Page" <dpage@vale-housing.co.uk>)
Responses Re: Postgresql.org search engine.
List pgsql-www
On Fri, 30 Jan 2004, Dave Page wrote:

> Hi Oleg,
>
> > -----Original Message-----
> > From: Oleg Bartunov [mailto:oleg@sai.msu.su]
> > Sent: 30 January 2004 16:03
> > To: Dave Page
> > Cc: pgsql-www@postgresql.org
> > Subject: Re: [pgsql-www] Postgresql.org search engine.
> >
> >
> > I'd recommend to use ispell dictionaries, so 'databases' and
> > 'database'
> > will produce the same results.
>
> Thanks, installed.
>
> BTW, searching for 'database' really makes it think! Other queries that
> generate less hits (eg. Mvcc or psqlodbc) seem to be far quicker.

It would think much longer if you search 'pgsql database' :(
Just tried and got ~100 sec.

This is feature of search engines based on inverted indices.
tsearch2 does just the other way - the more words in query the faster
searching.

I suggest to include 'postgresql', 'pgsql', 'postgres' into stop words
list :(  btw, you may look at word statistics and let top N words
as stop words.

>
> I have also added some weighting to the indexed sites to try to give
> preference to those that are more 'authoritative' and of global interest
> than others. Any comments or suggestions for changes welcome as always!

Hmm, I thought aspseek has sort of page rank, so let him works.


>
> # Primary sites
> SiteWeight http://www.postgresql.org/ 100
> SiteWeight http://advocacy.postgresql.org/ 100
> SiteWeight http://jdbc.postgresql.org/ 100
> SiteWeight http://developer.postgresql.org/ 100
>
> # Authoritiative project sites
> SiteWeight http://gborg.postgresql.org/ 75
> SiteWeight http://pgadmin.postgresql.org/ 75
> SiteWeight http://phppgadmin.sourceforge.net/ 75
>
> # User contributed stuff
> SiteWeight http://techdocs.postgresql.org/ 50
> SiteWeight http://archives.postgresql.org/ 50
>
> # Outside but reliable
> SiteWeight http://www.varlena.com/ 25
>
> # And the rest...
> SiteWeight http://www.postgresql.cl/ 0
> SiteWeight http://postgresql.ok.cz/ 0
> SiteWeight http://www.postgresql.jp/ 0
> SiteWeight http://pgsql-fr.tuxfamily.org/ 0
> SiteWeight http://www.linuxshare.ru/ 0
> SiteWeight http://www.postgres.de/ 0
> SiteWeight http://www.pgsqldb.org/ 0
> SiteWeight http://www.postgresql.org.br/ 0
>
> Regards, Dave.
>

    Regards,
        Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

pgsql-www by date:

Previous
From: "Marc G. Fournier"
Date:
Subject: Re: Postgresql.org search engine.
Next
From: "Dave Page"
Date:
Subject: Re: Postgresql.org search engine.