Re: Fragments in tsearch2 headline - Mailing list pgsql-general

From Catalin Marinas
Subject Re: Fragments in tsearch2 headline
Date
Msg-id b0943d9e0710300642u60033bb2m3a2b0cef4308c1a1@mail.gmail.com
Whole thread Raw
In response to Re: Fragments in tsearch2 headline  (Richard Huxton <dev@archonet.com>)
Responses Re: Fragments in tsearch2 headline  (Oleg Bartunov <oleg@sai.msu.su>)
List pgsql-general
On 30/10/2007, Richard Huxton <dev@archonet.com> wrote:
> Oleg Bartunov wrote:
> > Catalin,
> >
> > what is your need ? What's wrong with this ?
> >
> > postgres=# select ts_headline('1 2 3 4 5 3 4 abc abc 2 3
> > xyz','2'::tsquery, 'StartSel=...,StopSel=...')
> > ;
> >                 ts_headline
> > -------------------------------------------
> >  1 ...2... 3 4 5 3 4 abc abc ...2... 3 xyz
>
> I think he want's something like: "1 2 3 ... abc 2 3 ..."
>
> A few characters of context around each match and then ... between. Kind
> of like grep -C.

That's pretty much correct (with the difference that I'd like context
of words rather than lines as in "grep" and StartSel=<b>,
StopSel=</b>).

Since the text I want a headline for might be pretty long (tens of
lines), I'd like to only show the excerpts around the matching words.
Similar to the above example:

select ts_headline('1 2 3 4 5 3 4 abc x y z 2 3', '2 & abc'::tsquery);

should give:

'1 <b>2</b> 3 4 ... 3 4 <b>abc</b> x y'

Currently, if you limit the maximum words so that 'abc' is too far, it
only highlights the first match.

Many of the search engines (including google) show the headline this
way. I think Lucene can do this as well but I've never used it to be
sure.

--
Catalin

pgsql-general by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: Collation sequence and use of operatings system's locale
Next
From: Douglas McNaught
Date:
Subject: Re: function and passing the table name to be used with SQL stmnt