Thread: tsearch2 parser configuration

tsearch2 parser configuration

From

"Worky Workerson"

Date:

06 February 2007, 16:39:20

Is it possible to configure the tsearch2 parser?  I'd like a very dumb
parser that splits on everything that is not [a-zA-Z0-9.].  The
default parser seems to work well on my dataset except for the '/'
character ... it doesn't split mike/john into two lexemes.  And ideas?

Thanks!

Re: tsearch2 parser configuration

From

Oleg Bartunov

Date:

07 February 2007, 01:31:27

http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/HOWTO-parser-tsearch2.html

On Tue, 6 Feb 2007, Worky Workerson wrote:

> Is it possible to configure the tsearch2 parser?  I'd like a very dumb
> parser that splits on everything that is not [a-zA-Z0-9.].  The
> default parser seems to work well on my dataset except for the '/'
> character ... it doesn't split mike/john into two lexemes.  And ideas?
>
> Thanks!
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

Re: tsearch2 parser configuration

From

Hannes Dorbath

Date:

07 February 2007, 04:12:35

On 06.02.2007 21:39, Worky Workerson wrote:
> Is it possible to configure the tsearch2 parser?  I'd like a very dumb
> parser that splits on everything that is not [a-zA-Z0-9.].  The
> default parser seems to work well on my dataset except for the '/'
> character ... it doesn't split mike/john into two lexemes.  And ideas?

Sure that's an ugly, but I'd just use:

SELECT to_tsvector(REPLACE('mike/john', '/', ' '));

and the same with to_tsquery();


--
Regards,
Hannes Dorbath