Re: Postgresql.org search engine. - Mailing list pgsql-www
From | Oleg Bartunov |
---|---|
Subject | Re: Postgresql.org search engine. |
Date | |
Msg-id | Pine.GSO.4.58.0401302007180.19778@ra.sai.msu.su Whole thread Raw |
In response to | Re: Postgresql.org search engine. ("Dave Page" <dpage@vale-housing.co.uk>) |
Responses |
Re: Postgresql.org search engine.
("Dave Page" <dpage@vale-housing.co.uk>)
|
List | pgsql-www |
On Fri, 30 Jan 2004, Dave Page wrote: > Hi Oleg, > > > -----Original Message----- > > From: Oleg Bartunov [mailto:oleg@sai.msu.su] > > Sent: 30 January 2004 16:03 > > To: Dave Page > > Cc: pgsql-www@postgresql.org > > Subject: Re: [pgsql-www] Postgresql.org search engine. > > > > > > I'd recommend to use ispell dictionaries, so 'databases' and > > 'database' > > will produce the same results. > > Thanks, installed. > > BTW, searching for 'database' really makes it think! Other queries that > generate less hits (eg. Mvcc or psqlodbc) seem to be far quicker. It would think much longer if you search 'pgsql database' :( Just tried and got ~100 sec. This is feature of search engines based on inverted indices. tsearch2 does just the other way - the more words in query the faster searching. I suggest to include 'postgresql', 'pgsql', 'postgres' into stop words list :( btw, you may look at word statistics and let top N words as stop words. > > I have also added some weighting to the indexed sites to try to give > preference to those that are more 'authoritative' and of global interest > than others. Any comments or suggestions for changes welcome as always! Hmm, I thought aspseek has sort of page rank, so let him works. > > # Primary sites > SiteWeight http://www.postgresql.org/ 100 > SiteWeight http://advocacy.postgresql.org/ 100 > SiteWeight http://jdbc.postgresql.org/ 100 > SiteWeight http://developer.postgresql.org/ 100 > > # Authoritiative project sites > SiteWeight http://gborg.postgresql.org/ 75 > SiteWeight http://pgadmin.postgresql.org/ 75 > SiteWeight http://phppgadmin.sourceforge.net/ 75 > > # User contributed stuff > SiteWeight http://techdocs.postgresql.org/ 50 > SiteWeight http://archives.postgresql.org/ 50 > > # Outside but reliable > SiteWeight http://www.varlena.com/ 25 > > # And the rest... > SiteWeight http://www.postgresql.cl/ 0 > SiteWeight http://postgresql.ok.cz/ 0 > SiteWeight http://www.postgresql.jp/ 0 > SiteWeight http://pgsql-fr.tuxfamily.org/ 0 > SiteWeight http://www.linuxshare.ru/ 0 > SiteWeight http://www.postgres.de/ 0 > SiteWeight http://www.pgsqldb.org/ 0 > SiteWeight http://www.postgresql.org.br/ 0 > > Regards, Dave. > Regards, Oleg _____________________________________________________________ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83