Thread: tsearch2 parser configuration
Is it possible to configure the tsearch2 parser? I'd like a very dumb parser that splits on everything that is not [a-zA-Z0-9.]. The default parser seems to work well on my dataset except for the '/' character ... it doesn't split mike/john into two lexemes. And ideas? Thanks!
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/HOWTO-parser-tsearch2.html On Tue, 6 Feb 2007, Worky Workerson wrote: > Is it possible to configure the tsearch2 parser? I'd like a very dumb > parser that splits on everything that is not [a-zA-Z0-9.]. The > default parser seems to work well on my dataset except for the '/' > character ... it doesn't split mike/john into two lexemes. And ideas? > > Thanks! > > ---------------------------(end of broadcast)--------------------------- > TIP 5: don't forget to increase your free space map settings > Regards, Oleg _____________________________________________________________ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83
On 06.02.2007 21:39, Worky Workerson wrote: > Is it possible to configure the tsearch2 parser? I'd like a very dumb > parser that splits on everything that is not [a-zA-Z0-9.]. The > default parser seems to work well on my dataset except for the '/' > character ... it doesn't split mike/john into two lexemes. And ideas? Sure that's an ugly, but I'd just use: SELECT to_tsvector(REPLACE('mike/john', '/', ' ')); and the same with to_tsquery(); -- Regards, Hannes Dorbath