This is how default parser works. See output from
select * from ts_debug('gallery2-httpd-conf');
and
select * from ts_debug('httpd-2.2.3-5.src.rpm');
All token type:
select * from token_type();
On Thu, 6 Sep 2007, RC Gobeille wrote:
> I'm having trouble understanding to_tsvector. (PostreSQL 8.1.9 contrib)
>
> In this first case converting 'gallery2-httpd-conf' makes sense to me and is
> exactly what I want. It looks like the entire string is indexed plus the
> substrings broken by '-' are indexed.
>
>
> ossdb=# select to_tsvector('gallery2-httpd-conf');
> to_tsvector
> ---------------------------------------------------------
> 'conf':4 'httpd':3 'gallery2':2 'gallery2-httpd-conf':1
>
>
> However, I'd expect the same to happen in the httpd example - but it does not
> appear to.
>
> ossdb=# select to_tsvector('httpd-2.2.3-5.src.rpm');
> to_tsvector
> ---------------------------
> 'httpd-2.2.3-5.src.rpm':1
>
> Why don't I get: 'httpd', 'src', 'rpm', 'httpd-2.2.3-5.src.rpm' ?
>
> Is this a bug or design?
>
>
> Thank you!
> Bob
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83