If i strip all html tags and filter more stop words, will the search
be more accurate? Actually my fulltext stats returns some like: font
from <font> tags i guess, and other garbage.
If i do that, will i improve the speed of my search?
Thanks!
Ps: I cannot use other tools like MNOsearch, lucene, etc...because i
have no root pass to my server.
On 22 ago, 02:20, o...@sai.msu.su (Oleg Bartunov) wrote:
> On Fri, 21 Aug 2009, xaviergxf wrote:
> > Hi,
>
> > I?m using php and full text on postgresql 8.3 for indexing html
> > descriptions. I have no acess to postgresql server, since i use a
> > shared hosting service.
> > To improve search and performance, i want to do the follow:
>
> > Strip all html tags then use my php script to remove more stop words
> > (because i can?t edit stop words file on the server).
>
> > My question: What i?m thinking to do, has any collateral effects? Any
> > suggestions?
>
> You shouldn't bother to strip all html tags, just create your own text search
> configuration, which index only what do you want. Read documentation for
> details.
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
> Sternberg Astronomical Institute, Moscow University, Russia
> Internet: o...@sai.msu.su,http://www.sai.msu.su/~megera/
> phone: +007(495)939-16-83, +007(495)939-23-83
>
> --
> Sent via pgsql-general mailing list (pgsql-gene...@postgresql.org)
> To make changes to your subscription:http://www.postgresql.org/mailpref/pgsql-general